Open Access Open Access  Restricted Access Subscription Access

Data Visualization using Python and Matplotlib

Arshadali M Athani, Gajanan M Naik

Abstract


In today's data-focused world, the ability to analyze and communicate complex data is vital. Data visualization is a key link between data and information you can use. This case study investigates the practical application of Python and its effective Matplotlib library for producing insightful data visualizations. Using a real-world public dataset of global COVID-19 statistics,this paper carefully demonstrates the complete process of analyzing data. The process includes data cleaning and preprocessing with the Pandas library, subsequent arrangements of static, animated, and interactive visualizations completed with Matplotlib. The case study process highlights the creation of basic plots such as bar plots, line plots, scatter plots, and histograms with the objective of revealing visual trends, patterns, or anomalies. This study demonstrates how the capabilities of Matplotlib's landscape framework allow us to convert dense data into clear visual understandings. The study concludes that Python and Matplotlib work together to create a potent, accessible, and indispensable grid for researchers, analysts, and decision-makers, while also serving as a key resource in modern data science.


Full Text:

PDF

References


J. Brownlee, “A Gentle Introduction to Data Visualization in Python,” Machine Learning Mastery, 2020.

Kaggle, “COVID-19 Data Repository by Johns Hopkins University,” Kaggle Datasets, 2020.

J. D. Hunter, “Matplotlib: A 2D Graphics Environment,” Computing in Science & Engineering, vol. 9, no. 3, pp. 90–95, 2007, doi: 10.1109/MCSE.2007.55.

World Health Organization (WHO), “WHO Coronavirus (COVID-19) Dashboard,” World Health Organization, 2020. [Online].


Refbacks

  • There are currently no refbacks.