Initial support for Implicit TPMs #105

isacdaavid · 2023-03-26T20:06:02Z

This might probably require further prettifying, and I have just started writing tests for the new code, documentation, etc. That said, it's mature enough to ask for review and suggestions. Test results are on par with the 4.0 branch, the examples I run are working. Also, I want to get a sense of merge conflicts.

This is what the code is supposed to be doing (also, a mini guide for users):

The previous TPM format and Network creation remain supported. Users shouldn't need to update their old scripts 90% of the time. API breakage is mostly contained within node.py. The new TPM format obviously looks different, but most of the existing API has been either inherited or re-implemented for it.
Regular (explicit) TPMs are automatically converted to implicit ones upon Network creation. ImplicitTPMs are supposed to be the new common currency throughout pyphi (help us spot leftovers and subpar conversions!).
The way to create an ImplicitTPM is with a list or tuple of node TPMs, where each node TPM looks very much like a multidimensional explicit TPM with one dimension per node in the network (inputs to node contribute nonsingleton dimensions, non-inputs contribute singletons), plus the last dimension containing the probabilities for this node at t+1. Instead of only providing probabilities for the ON state, that last dimension must contain entries for all states (to simplify our work regardless of whether the node is binary or not). Users can look at the existing my_subsystem.nodes[i].tpm to get a sense node TPMs.

Example using the 2nd system in fig. 7C in the IIT 4.0 paper:

import pyphi
import numpy as np

node_labels = ("A", "B", "C")

connectivity_matrix = np.array([
    [1, 1, 0,],
    [0, 1, 1,],
    [1, 1, 1,],
])

explicit_tpm = np.array([
    [1, 0, 0],
    [0, 1, 0],
    [1, 1, 1],
    [0, 1, 1],
    [0, 0, 0],
    [1, 1, 0],
    [0, 0, 1],
    [1, 0, 1],
])

implicit_tpm = [
    np.array(
        [[[[0., 1.],
            [1., 0.]]],
         [[[1., 0.],
            [0., 1.]]]]
    ),
    np.array(
         [[[[1., 0.],
            [1., 0.]],
           [[0., 1.],
            [1., 0.]]],
          [[[0., 1.],
            [0., 1.]],
           [[0., 1.],
            [1., 0.]]]]
    ),
    np.array(
        [[[[1., 0.],
            [1., 0.]],
           [[0., 1.],
            [0., 1.]]]]
    )
]

How do I convert an ExplicitTPM to a ImplicitTPM? You can do it indirectly, by defining a Network and extracting its .tpm attribute. Also see (assuming candidate system is whole network) [node.tpm for node in my_subsystem.nodes] and (more involved!) pyphi.node.generate_nodes.
How do I convert an ImplicitTPM back to ExplicitTPM? pyphi.tpm.reconstitute_tpm
The connectivity matrix is now optional when passing an implicit TPM to Network. If absent, pyphi will infer the cm from the node TPMs, or report inconsistencies in the TPM. If passed, the cm will be used to validate that it matches the TPM. When passing an explicit TPM, the behavior of the cm parameter is as before (assumes all-to-all if absent).
There's a new optional parameter to Network (as well as a corresponding attribute): state_space. This can be used to define state labels for each node. If absent, pyphi will create a default state space using int's as state indices (like 0 for OFF, 1 for ON as previously implied).
Internally, node labels and the state space are used to create xarray DataArrays with appropriate dimension and coordinate names.
For users, ImplicitTPMs have fancier indexing. In addition to the regular numpy syntax using positional indexing with integers and slices, there's also pandas/xarray-like indexing by name:

>>> network = pyphi.Network(implicit_tpm, node_labels=node_labels, state_space=(("OFF", "ON"),) * 3)
>>> network
Network(
ImplicitTPM((A, B, C)),
cm=[[1 1 0]
 [0 1 1]
 [1 1 1]],
node_labels=NodeLabels(('A', 'B', 'C')),
state_space={'B': ['OFF', 'ON'], 'A': ['OFF', 'ON'], 'C': ['OFF', 'ON']}
)

# P(A_{t+1} | A=0, B=0, C=0)

>>> network.tpm[0, 0, 0]
ImplicitTPM((A, B, C))

# That result returned indexed nodes, behind the scenes. To prove it we can repeat it and then inspect node A: 

>>> network.tpm[0, 0, 0].nodes[0].tpm
ExplicitTPM(
[0. 1.]
)

# That means P(A_{t+1}=OFF | A=0, B=0, C=0) = 0, and P(A_{t+1}=ON | A=0, B=0, C=0) = 1.

# Using state space labels, that would be the same as:

>>> network.tpm[{"B": "OFF", "A": "OFF", "C": "OFF"}].nodes[0].tpm
ExplicitTPM(
[0. 1.]
)

# A different example. P(A_{t+1}=1). This can be achieved by indexing the last dimension, called "Pr".

network.tpm[{"Pr": "ON"}].nodes[0].tpm
ExplicitTPM(
[[[1. 0.]]
 [[0. 1.]]]
)

We still have to work around the 32-node limit coming from numpy. However, the fact that we now use xarray DataArrays and named dimensions on top means that it should be relatively easy to come up with a workaround.
Nonbinary stuff is still unsupported, beyond being able to define nonbinary, possibly heterogeneous Networks . There are several places throughout the source code that still assume binary units, so correct analyses aren't guaranteed nor tested. This patch paves the way. though.

Remove inconsequential assignments introduced in bfd62c and 99ad3f.

Thus allowing xarray DataArrays to have anonymous singleton dimensions.

It turns out that, while allowed on a DataArray-level, nameless singleton dimensions cannot be aligned at the Dataset level.

…for ImplicitTPMs

In `subsystem.find_mice`, computing potential purviews can be very expensive in some situations, and if the user has provided a short iterable of purviews, then computing the potential purviews is not worth it. So, we simply use the user-provided purviews directly, allowing the user to decide whether to filter out reducible purviews.

…into feature/tpm-class

…ible

…bolic intersection.

…quivalence classes

…ture/tpm-class

isacdaavid and others added 30 commits January 3, 2023 15:15

tpm.ProxyMetaclass: Change list to generator expression

a9be5c2

tpm: Improve documentation

d0000b1

Move TPM documentation to tpm.ExplicitTPM class

39796ca

tpm.ExplicitTPM: Remove superfluous call to super constructor

a357031

tpm.ExplicitTPM: Preserve column alignment in string representation.

b15924a

subsystem: Complete ExplicitTPM.enforce cleanup

9c349e2

Remove inconsequential assignments introduced in bfd62c and 99ad3f.

Refactor Node to use xarray DataArray (accessor namespace for now)

91772ac

Node: Add support for multivalued and heterogeneous units

db0a505

Node: Fix docstring

b77efbb

Node.node: Singleton coords aren't necessarily due to marginalization

890ed62

tpm: Add abstract class ImplicitTPM and decorate for xr.Dataset

f31a38a

Refactor Network and Node for future implicit TPM support

5c8c626

Send duplicate state space filtering to utils.build_state_space()

7e43f6f

Network.__init__: Avoid transient storage of nodes beyond the TPM

b3687c7

Network.to_json Include state_space attribute.

34e7032

Network.__len__: Go back to counting nodes in the TPM (now a Dataset).

0dd6036

build_state_space(): Fix Union declaration

ed3bce3

Fix widespread Union declaration typo

3c9760c

Squeeze singleton dimensions in Node

5c1f1b5

Implement ImplicitTPM validation

5351aec

Merge remote-tracking branch

bf64cfc

Update Network __eq__() method.

955fe69

Network: revert to a __repr__ ammenable to JSON serialization.

03f6a04

Miscellaneous bug fixes in Node attribute setters and getters.

d192cbf

tpm.ImplicitTPM: Implement condition_tpm()

60e67ec

Minor import and docstring cleanup in subsystem

6301380

Change state space structure to use a dictionary.

f4b7d74

Thus allowing xarray DataArrays to have anonymous singleton dimensions.

Revert to using dummy singleton dimensions in multidimensional node TPMs

1687293

It turns out that, while allowed on a DataArray-level, nameless singleton dimensions cannot be aligned at the Dataset level.

Node._hash: temporary workaround when the TPM array is mutable.

719cb7c

Implement function to properly unalign DataArray's

db89127

isacdaavid and others added 30 commits November 21, 2023 17:17

tpm: Simplify subtpm method

89bad27

Refactor TPM validation

20808fe

subsystem: Fix proper_tpm

e687f26

Tidy documentation

0454ad7

node.py: Improve code formatting

460e9c8

Rename DataArray accessor to something more explicit

0f8bafc

validate: Disable state reachability warning for now

74ac75f

tpm: add number_of_units getter in the ImplicitTPM case

6593f21

ImplicitTPM.number_of_units: more efficient implementation

ac5fc86

test_tpm: validate arguments in implicit_tpm test

685cb7f

tpm: implement probability_of_current_state() and backward_tpm() …

e4f1757

…for ImplicitTPMs

Refactor backward_tpm as instance methods

1014cec

Rename DataArray accessor to something more expressive

1d5939a

test_tpm: Add test_backward_tpm

f69f7a4

Move Fig 8 counter network to examples

a357ee4

utils: Add equivalent_states generator

1740585

Rework state reachability validation to avoid computing joint prob.

53b56c3

Merge branch feature/iit-4.0.

5ae32c5

Merge branch 'feature/iit-4.0' into feature/tpm-class

34aae42

Merge remote-tracking branch 'refs/remotes/origin/feature/tpm-class' …

78ff393

…into feature/tpm-class

validate.state_reachable: Shortcircuit intersection as soon as poss…

d44b760

…ible

validate.state_reachable: Don't expand equivalence classes, use sym…

8083a71

…bolic intersection.

validate.state_reachable: parallelize intersection between symbolic e…

fa9a3e4

…quivalence classes

Merge branch 'feature/tpm-class' of github.com:wmayner/pyphi into fea…

484f3b4

…ture/tpm-class

validate: Refactor scope and add documentation/tests.

a08fc2b

Improve state validation when creating subsystems

4b93ca1

validate.state_value: Fix off-by-one error

34010f6

Remove parallelization of state_reachable for now

ab4c202

Backport unified subsystem interface to tpm-class branch

bec9e9b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial support for Implicit TPMs #105

Initial support for Implicit TPMs #105

isacdaavid commented Mar 26, 2023 •

edited

Loading

Initial support for Implicit TPMs #105

Are you sure you want to change the base?

Initial support for Implicit TPMs #105

Conversation

isacdaavid commented Mar 26, 2023 • edited Loading

isacdaavid commented Mar 26, 2023 •

edited

Loading