data chimp is a programmable data analysis assistant that automatically shows contextual data visualizations, tables, and data docs as you work in your Jupyter notebook. Use it to help you spot unexpected features in your data, get oriented in a new data set quickly, or to enforce best practices on your team.
Connect to an existing jupyter kernel by running the
Once connected, click the "scatterplots" button within the data chimp view:
Then run the following code:
You should see a scatterplot for every combination of numeric columns within the titanic data set:
You can get the code that generated a particular visualization by pressing the Send Code button:
data chimp is configured via jupyter notebooks that live in the
pro tip: move the data chimp view to your secondary side bar so you can see data chimp results AND any other sidebar view:
Quickly visualize your data w/ code-aware visualizations
The notebooks you keep in the
Automatically check for data quality or analysis issues
The cells of the
You can see this cell at work in the titanic data set:
Once you've installed data chimp, you can check out
Loop previous results for feedback as you wrangle your data
data chimp adds a loop button to each notebook cell toolbar. Once this button is pressed, the cell will automatically run each time you execute another cell, but the data frame referenced in the looped cell will be replaced with the data frame you're currently working with in the current cell.
For example, imagine you've found a typo in some penguins data by running
As you fix these typos, you want to this table update so you can get feedback on the correctness of your string replace code. So, you can loop this result, and as you run your string replace code, it'll update:
Pull data docs into your notebook
With data chimp, it's easy to pull your data catalogue docs into your notebook. We already have a way to do this with dbt docs here, but you can easily adapt it to pull in docs from your data catalogue in just a few minutes. All you need to do is:
Make sure you've already installed the VSCode Jupyter extension before using data chimp.
Public beta launch