EXPLORATORY DATA ANALYSIS
Course Code BAI515E
CIE Marks 50
Teaching Hours/Week (L: T:P: S) 3:0:0:0
SEE Marks 50
Total Hours of Pedagogy 40
Total Marks 100
Credits 03
Exam Hours 3
Examination type (SEE) Theory
Module-1
Introduction to Python and NumPy: Getting Started in IPython and Jupyter, Enhanced Interactive
Features, The Basics of NumPy Arrays, Sorted Arrays, Structured Data: NumPy’s Structured Arrays
Textbook: Chapter 2, Chapter 5, Chapter 11, Chapter 12, Chapter 1(Not for CIE/SEE),
Module-2
Data Manipulation with Pandas - I: Introducing Pandas Objects, Handling Missing Data, Hierarchical
Indexing, Pivot Tables.
Module-3
Data Manipulation with Pandas - II: Vectorized String Operations, Working with Time Series, HighPerformance Pandas: eval and query
Module-4
Data Visualization with MatPlotlib: General Matplotlib Tips, Simple Line Plots, Simple Scatter Plots,
Visualization with Seaborn
Module-5
Introduction to Machine Learning: What Is Machine Learning?, Introducing Scikit-Learn,
Hyperparameters and Model Validation
Suggested Learning Resources:
Text Books:
1. Jake VanderPlas - Python Data Science Handbook: Essential Tools for Working with Data, Oreilly
2nd Edition, 2022.
Reference Book:
2. https://python4csip.com/files/download/Data%20Visualization.pdf

.png)