Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

README.md

✅ To check:

  • Is the data Independent and Identically Distributed (i.i.d.)?

📚 Books

🛠️ Tools

Title Description
Deepchecks Deepchecks is the leading tool for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompanies you through various validation needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models
pandas_profiling Generates profile reports from a pandas DataFrame.
Great Expectations Great Expectations helps data teams eliminate pipeline debt, through data testing, documentation, and profiling.

📈 Visualization

Title Description
Vega-Altair: Declarative Visualization in Python With Vega-Altair, you can spend more time understanding your data and its meaning. Altair’s API is simple, friendly and consistent and built on top of the powerful Vega-Lite visualization grammar. This elegant simplicity produces beautiful and effective visualizations with a minimal amount of code.

❓ How To Select Columns Using Prefix/Suffix of Column Names in Pandas?

🔹 Select Columns with a Prefix using Pandas filter

For example, if we are interested in selecting columns starting with “lifeExp”, the regular expression for the pattern is “^lifeExp”. In the regular expression “^” represents we are interested in patterns that starts with. So our argument for “regexp” will be regexp=’^lifeExp’.

df.filter(regex='^lifeExp',axis=1).head()

🔹 Select Columns with a suffix using Pandas filter

Let us select columns with names ending with a suffix in Pandas dataframe using filter function. As before, we need to come up with regular expression for the pattern we are interested in. Here our pattern is column names ending with a suffix.

Let us select columns ending with “1957” and the regular expression pattern is ‘1957$’, where the dollar symbol at the end represents the pattern ending with “1957”.

We use regex=’1957$’ as argument to the Pandas’ filter function and addition to axis=1. We get a data frame with three columns that have names ending with 1957.

df.filter(regex='1957$',axis=1).head()

📰 Articles

💠 Missing Values

📰 Atricles