GitHub - meffmadd/pandas-visual-analysis: A package for interactive visual analysis in jupyter notebooks

Status

Release

Code

Generates an interactive visual analysis widget to analyze a pandas DataFrame in Jupyter notebooks. It can display various different types of graphs with support for linked-brushing in interactive widgets. This allows data exploration and cognition to be simple, even with complex multivariate datasets.

There is no need to create and style plots or interactivity - it's all ready without any configuration.

Installation

Using pip

To install this package with pip run:

pip install pandas-visual-analysis

Using conda

To install this package with conda run:

conda install -c meffmadd pandas-visual-analysis

From Source

To install this package from source, clone into the repository or download the zip file and run:

python setup.py install

JupyterLab Support

Run the following commands to install the necessary JupyterLab extensions:

# plotly.py renderer support
jupyter labextension install jupyterlab-plotly

# Jupyter widgets extension for plotly.py
jupyter labextension install @jupyter-widgets/jupyterlab-manager plotlywidget

Note that this requires node to be installed!

Usage

Basic Usage

Having a DataFrame, for example:

import pandas as pd
import ssl
ssl._create_default_https_context = ssl._create_unverified_context
url = "https://raw.githubusercontent.com/mwaskom/seaborn-data/master/mpg.csv"

df = pd.read_csv(url)

you can just pass it to VisualAnalysis to display the default layout:

from pandas_visual_analysis import VisualAnalysis
VisualAnalysis(df)

If you want to explicitly specify which columns of the DataFrame are categorical, just pass the categorical_columns option:

from pandas_visual_analysis import VisualAnalysis
categorical = ["name", "origin", "model_year", "cylinders"]
VisualAnalysis(df, categorical_columns=categorical)

Selection Types

By default a new selection replaces the old selection, however, it is also possible to add data points to the existing selection by selecting the Additive selection type. By choosing the Subtractive selection, newly selected data points are removed from the selection.

Using DataSource

Instead of passing the DataFrame object directly to VisualAnalysis it is possible to use a DataSource object. This enables linked-brushing across multiple notebook cells if the object is used across cells.

from pandas_visual_analysis import VisualAnalysis, DataSource

data = DataSource(df)
VisualAnalysis(data)

Later you can create a new analysis with the brushing still preserved simply by using the same data object created earlier.

VisualAnalysis(data)

Using Layouts

If you want to specify your own layout, you can do that by passing the layout parameter. The parameter is a list of rows, where each row is in turn a list specifying the Widgets in that row.

from pandas_visual_analysis import VisualAnalysis

VisualAnalysis(df,
    layout=[["Scatter", "Scatter"],
            ["ParallelCoordinates"]]
)

Here, two scatter plots will share the first row while the second row only contains a parallel coordinates plot. In order to see all the possible options you can call the widgets class-method of VisualAnalysis.

VisualAnalysis.widgets()

This outputs the following list of possible plots:

['Scatter',
 'ParallelCoordinates',
 'BrushSummary',
 'Histogram',
 'ParallelCategories',
 'BoxPlot']

Any of those can be part of the layout specification. See also: widgets Documentation.

For more advanced features of the VisualAnalysis class see: VisualAnalysis Documentation

Documentation

For more details see the Official Documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.github/workflows		.github/workflows
conda		conda
docs		docs
src/pandas_visual_analysis		src/pandas_visual_analysis
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
README.rst		README.rst
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Using pip

Using conda

From Source

JupyterLab Support

Usage

Basic Usage

Selection Types

Using DataSource

Using Layouts

Documentation

About

Releases 3

Packages

Contributors 2

Languages

License

meffmadd/pandas-visual-analysis

Folders and files

Latest commit

History

Repository files navigation

Installation

Using pip

Using conda

From Source

JupyterLab Support

Usage

Basic Usage

Selection Types

Using DataSource

Using Layouts

Documentation

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Languages

Packages