Pandas and Python Tips and Tricks for Data Science and Data Analysis | by Zoumana Keita | Dec, 2022

By Jessie Hobb On Dec 2, 2022

Take your efficiency to the next level with these Pandas and Python Tricks!

This blog regroups all the Pandas and Python tricks & tips I share on a basis on my LinkedIn page. I have decided to centralize them into a single blog to help you make the most out of your learning process by easily finding what you are looking for.

The content is is divided into two main sections:

Pandas tricks & tips are related to only Pandas.
Python tricks & tips related to Python.

This section provides a list of all the tricks

1. 𝗖𝗿𝗲𝗮𝘁𝗲 𝗮 𝗻𝗲𝘄 𝗰𝗼𝗹𝘂𝗺𝗻 𝗳𝗿𝗼𝗺 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗰𝗼𝗹𝘂𝗺𝗻𝘀 𝗶𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲.

Performing simple arithmetic tasks such as creating a new column as the sum of two other columns can be straightforward.

🤔 But, what if you want to implement a more complex function and use it as the logic behind column creation? Here is where things can get a bit challenging.

Guess what…

✅ 𝙖𝙥𝙥𝙡𝙮 and 𝙡𝙖𝙢𝙗𝙙𝙖 can help you easily apply whatever logic to your columns using the following format:

𝙙𝙛[𝙣𝙚𝙬_𝙘𝙤𝙡] = 𝙙𝙛.𝙖𝙥𝙥𝙡𝙮(𝙡𝙖𝙢𝙗𝙙𝙖 𝙧𝙤𝙬: 𝙛𝙪𝙣𝙘(𝙧𝙤𝙬), 𝙖𝙭𝙞𝙨=1)

where:

➡ 𝙙𝙛 is your dataframe.

➡ 𝙧𝙤𝙬 will correspond to each row in your data frame.

➡ 𝙛𝙪𝙣𝙘 is the function you want to apply to your data frame.

➡ 𝙖𝙭𝙞𝙨=1 to apply the function to each row in your data frame.

💡 Below is an illustration.

Result of Pandas apply and lambda (Image by Author)

Take your efficiency to the next level with these Pandas and Python Tricks!

The content is is divided into two main sections:

Pandas tricks & tips are related to only Pandas.
Python tricks & tips related to Python.

This section provides a list of all the tricks

1. 𝗖𝗿𝗲𝗮𝘁𝗲 𝗮 𝗻𝗲𝘄 𝗰𝗼𝗹𝘂𝗺𝗻 𝗳𝗿𝗼𝗺 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗰𝗼𝗹𝘂𝗺𝗻𝘀 𝗶𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲.

Performing simple arithmetic tasks such as creating a new column as the sum of two other columns can be straightforward.

🤔 But, what if you want to implement a more complex function and use it as the logic behind column creation? Here is where things can get a bit challenging.

Guess what…

✅ 𝙖𝙥𝙥𝙡𝙮 and 𝙡𝙖𝙢𝙗𝙙𝙖 can help you easily apply whatever logic to your columns using the following format:

𝙙𝙛[𝙣𝙚𝙬_𝙘𝙤𝙡] = 𝙙𝙛.𝙖𝙥𝙥𝙡𝙮(𝙡𝙖𝙢𝙗𝙙𝙖 𝙧𝙤𝙬: 𝙛𝙪𝙣𝙘(𝙧𝙤𝙬), 𝙖𝙭𝙞𝙨=1)

where:

➡ 𝙙𝙛 is your dataframe.

➡ 𝙧𝙤𝙬 will correspond to each row in your data frame.

➡ 𝙛𝙪𝙣𝙘 is the function you want to apply to your data frame.

➡ 𝙖𝙭𝙞𝙨=1 to apply the function to each row in your data frame.

💡 Below is an illustration.

Read original article here

Denial of responsibility! Techno Blender is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – [email protected]. The content will be deleted within 24 hours.

Pandas and Python Tips and Tricks for Data Science and Data Analysis | by Zoumana Keita | Dec, 2022

Take your efficiency to the next level with these Pandas and Python Tricks!

1. 𝗖𝗿𝗲𝗮𝘁𝗲 𝗮 𝗻𝗲𝘄 𝗰𝗼𝗹𝘂𝗺𝗻 𝗳𝗿𝗼𝗺 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗰𝗼𝗹𝘂𝗺𝗻𝘀 𝗶𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲.

2. Convert categorical data into numerical ones

3. Select rows from a Pandas Dataframe based on column(s) values

4. Deal with zip files

5. Select 𝗮 𝘀𝘂𝗯𝘀𝗲𝘁 𝗼𝗳 𝘆𝗼𝘂𝗿 𝗣𝗮𝗻𝗱𝗮𝘀 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲 𝘄𝗶𝘁𝗵 𝘀𝗽𝗲𝗰𝗶𝗳𝗶𝗰 𝗰𝗼𝗹𝘂𝗺𝗻 𝘁𝘆𝗽𝗲𝘀

6. Remove comments from Pandas dataframe column

7. Print Pandas dataframe in Tabular format from consol

8. Highlight data points in Pandas

9. Reduce decimal points in your data

10. Replace some values in your data frame

11. Compare two data frames and get their differences

12. Get a subset of a very large dataset for quick analysis

13. Transform your data frame from a wide to a long format

14. Reduce the size of your Pandas data frame by ignoring the index

15. Parquet instead of CSV

16. Transform your data frame into a markdown

17. Format Date Time column

1. Create a progress bar with tqdm and rich

2. Get day, month, year, day of the week, the month of the year

3. Smallest and largest values of a column

4. Ignore the log output of the pip install command

5. Run multiple commands in a single notebook cell

6. Virtual environment.

7. Run multiple metrics at once

8. Chain multiple lists as a single sequence

9. Pretty print of JSON data

Take your efficiency to the next level with these Pandas and Python Tricks!

1. 𝗖𝗿𝗲𝗮𝘁𝗲 𝗮 𝗻𝗲𝘄 𝗰𝗼𝗹𝘂𝗺𝗻 𝗳𝗿𝗼𝗺 𝗺𝘂𝗹𝘁𝗶𝗽𝗹𝗲 𝗰𝗼𝗹𝘂𝗺𝗻𝘀 𝗶𝗻 𝘆𝗼𝘂𝗿 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲.

2. Convert categorical data into numerical ones

3. Select rows from a Pandas Dataframe based on column(s) values

4. Deal with zip files

5. Select 𝗮 𝘀𝘂𝗯𝘀𝗲𝘁 𝗼𝗳 𝘆𝗼𝘂𝗿 𝗣𝗮𝗻𝗱𝗮𝘀 𝗱𝗮𝘁𝗮𝗳𝗿𝗮𝗺𝗲 𝘄𝗶𝘁𝗵 𝘀𝗽𝗲𝗰𝗶𝗳𝗶𝗰 𝗰𝗼𝗹𝘂𝗺𝗻 𝘁𝘆𝗽𝗲𝘀

6. Remove comments from Pandas dataframe column

7. Print Pandas dataframe in Tabular format from consol

8. Highlight data points in Pandas

9. Reduce decimal points in your data

10. Replace some values in your data frame

11. Compare two data frames and get their differences

12. Get a subset of a very large dataset for quick analysis

13. Transform your data frame from a wide to a long format

14. Reduce the size of your Pandas data frame by ignoring the index

15. Parquet instead of CSV

16. Transform your data frame into a markdown

17. Format Date Time column

1. Create a progress bar with tqdm and rich

2. Get day, month, year, day of the week, the month of the year

3. Smallest and largest values of a column

4. Ignore the log output of the pip install command

5. Run multiple commands in a single notebook cell

6. Virtual environment.

7. Run multiple metrics at once

8. Chain multiple lists as a single sequence

9. Pretty print of JSON data