Course Overview
Data analysis drives decisions, reveals patterns, and solves problems across industries. This course offers a practical introduction to data analysis using two essential tools: Excel for quick, hands-on exploration, and R for powerful, reproducible statistical analysis and visualization.
Course Structure
Designed for beginners and those with minimal experience, the course covers:
Data import, cleaning, and wrangling in Excel and R
Exploratory data analysis with Excel functions and R packages
Data visualization using Excel charts and ggplot2 in R
Statistical tests and model building in Excel (ToolPak) and R
Advanced techniques and model evaluation with R
Each session pairs Excel warm-ups with R coding exercises. This balanced approach builds practical skills in a progressive, real-world workflow. You’ll complete a final project demonstrating integrated analysis and storytelling using both tools. Delivered through Muni Labs and the Graduate Studies Academy (GSA), the course equips you for data roles across sectors.
Course Details
Over 12 weeks, you will:
Use Excel to quickly clean, explore, and visualize data
Apply R to automate analysis, create advanced visuals, and perform rigorous statistical tests
Gain confidence with real-world datasets, through hands-on exercises and interactive discussion
Progress from foundational skills to advanced analysis and reproducibility practices
The course culminates in a capstone project where you apply both Excel and R to conduct a full data analysis cycle, delivering actionable insights in a business or research context.
Introduction to Data Analysis
Lesson Summary:
Welcome to data analysis! This week, you will:
Understand what data analysis is and why it is important in decision-making.
Get an overview of Excel and R as complementary tools for analyzing data.
Install R and RStudio on your computer and prepare Excel for data tasks.
Explore your first dataset in Excel (sorting, filtering, basic summaries) and in R (using head() and summary()).
By the end of this week, you will be able to open and explore a dataset in both Excel and R, and understand the role of these tools in solving real-world problems.
Assignment 1
Data Import in Excel
Lesson Summary:
This week, you will focus on bringing data into Excel and preparing it for analysis. You will learn to:
Import CSV, Excel, and text files into Excel.
Check and adjust data types (dates, numbers, text) to ensure accuracy.
Clean datasets by handling missing values, removing duplicates, and standardizing entries.
Save and document your cleaned datasets for future use.
By the end of this week, you will be able to confidently import, inspect, and clean datasets in Excel, laying the foundation for analysis.
Assignment 2:
Download the sample dataset provided.
In Excel:
Import the dataset (CSV or Excel format).
Check that each column has the correct data type (date, number, text).
Clean the dataset: remove duplicates, fill or flag missing values, and standardize entries if needed.
Save the cleaned dataset with a clear file name.
Submit the cleaned dataset and a short note describing the cleaning steps you performed.
Lesson Summary:
This week focuses on bringing data into R and preparing it for analysis. You will learn to:
· Import CSV, Excel, and text files into R using readr and readxl.
· Explore datasets with basic functions like head(), summary(), and str().
· Perform basic data wrangling using dplyr functions: select(), filter(), mutate().
· Handle missing values and perform simple transformations.
By the end of this week, you will be able to load and clean data in R, making it ready for analysis.
Assignment 3:
1. Import the same dataset you used in Excel into R.
2. Explore the dataset using head(), summary(), and str().
3. Use dplyr to:
o Select specific columns.
o Filter rows based on conditions.
o Create a new column with a transformation.
4. Identify and handle missing values.
5. Save your cleaned and transformed dataset as an R object (.RData) or CSV.
Assignment 3:
Import the same dataset you used in Excel into R.
Explore the dataset using head(), summary(), and str().
Use dplyr to:
Select specific columns.
Filter rows based on conditions.
Create a new column with a transformation.
Identify and handle missing values.
Save your cleaned and transformed dataset as an R object (.RData) or CSV.
EDA in Excel
Lesson Summary:
This week, you will explore your data in Excel to understand its structure and patterns. You will learn to:
Calculate descriptive statistics: AVERAGE, MEDIAN, MIN, MAX.
Summarize data using subtotals and pivot tables.
Identify missing values and outliers visually using conditional formatting and charts.
By the end of this week, you will be able to extract key insights from your dataset in Excel and spot data issues that need cleaning or further investigation.
Assignment 4:
Calculate summary statistics for your dataset (AVERAGE, MEDIAN, MIN, MAX).
Create pivot tables to summarize data by groups.
Highlight missing values and outliers using conditional formatting or visual inspection.
Prepare a brief summary of the patterns and issues you observed in your dataset.
EDA in R
Lesson Summary:
This week, you will explore your dataset in R to understand its structure and extract meaningful insights. You will learn to:
Calculate descriptive statistics using mean(), median(), and sd().
Create group summaries using group_by() and summarise().
Detect missing values and outliers programmatically.
By the end of this week, you will be able to perform a comprehensive exploratory analysis in R and identify key trends, missing values, and anomalies in your data.
Assignment 5:
Perform summary statistics on your dataset in R.
Use group_by() and summarise() to create summaries by categorical variables.
Identify missing values and outliers.
Write a short report summarizing the key findings and data issues detected.
Data Visualization in Excel
Lesson Summary:
This week, you will learn to turn raw data into clear, actionable visuals using Excel. You will:
Create bar, line, and column charts to highlight trends.
Apply conditional formatting to quickly spot key values.
Format and label charts effectively for presentations.
By the end of this week, you will be able to communicate insights visually and make your data easier to interpret for decision-making.
Assignment 6:
Create bar, line, or column charts for your dataset in Excel.
Apply conditional formatting to highlight top, bottom, or outlier values.
Ensure charts are properly labeled and formatted for clarity.
Submit your Excel file with the visualizations.
Data Visualization in R
Lesson Summary:
This week, you will learn to create and customize charts in R using ggplot2. You will:
Understand the syntax and structure of ggplot2 (aesthetics, geoms).
Create bar, line, and scatter plots.
Customize colors, labels, and themes to make charts presentation-ready.
By the end of this week, you will be able to recreate Excel visualizations in R and design advanced charts for data storytelling.
Assignment 7:
Use ggplot2 to create bar, line, or scatter plots from your dataset.
Customize your charts with appropriate colors, labels, and themes.
Compare your R visualizations with the ones you created in Excel.
Submit your R script and the resulting plots.
Statistical Analysis in Excel
Lesson Summary:
This week, you will learn how to perform basic statistical analysis using Excel. You will:
Use the Data Analysis ToolPak for correlation, t-tests, and regression.
Create scatterplots with regression lines.
Interpret p-values, R-squared, and other statistical outputs.
By the end of this week, you will be able to analyze relationships between variables and interpret results effectively in Excel.
Assignment 8:
Use the Data Analysis ToolPak to run correlation analysis on your dataset.
Conduct a t-test or regression analysis as appropriate.
Create scatterplots with regression lines.
Interpret the outputs and write a short summary of your findings.
Submit your Excel workbook with analysis and charts.
Statistical Analysis in R
Lesson Summary:
This week, you will perform statistical analysis using R. You will:
Calculate correlation and perform hypothesis tests using cor(), t.test(), and chisq.test().
Fit linear regression models using lm().
Visualize regression lines and relationships with geom_smooth() in ggplot2.
By the end of this week, you will be able to run statistical tests, interpret results, and create visualizations of relationships between variables in R.
Assignment 9:
Calculate correlations between variables in your dataset.
Perform appropriate t-tests and/or chi-square tests.
Fit a linear regression model using lm().
Visualize the regression line with ggplot2.
Write a brief interpretation of your results.
Submit your R script and any resulting plots.
Advanced Analysis Techniques
Lesson Summary:
This week focuses on more advanced tools for analyzing data in both Excel and R. You will learn:
Excel: Advanced formulas such as IF(), VLOOKUP(), and INDEX-MATCH for dynamic data analysis.
Excel: Time series basics, including trendlines and forecasting.
R: Time series analysis using ts(), decomposition, and the forecast package.
R: Introduction to ANOVA for comparing group means and clustering techniques for identifying patterns in data.
By the end of this week, you will be able to handle complex datasets, perform time-based analyses, and explore group differences and clustering in R.
Assignment 10:
Apply advanced formulas in Excel to analyze your dataset.
Create a time series plot and apply trendlines in Excel.
In R, create a time series object, decompose it, and forecast future values.
Run a simple ANOVA or clustering analysis in R.
Document your steps and insights clearly.
Submit both Excel and R outputs for review.
Model Evaluation (Conceptual)
Lesson: Introduction to model evaluation
Importance of evaluating models
Key evaluation metrics for regression: R-squared, Mean Absolute Error (MAE), Mean Squared Error (MSE)
Key evaluation metrics for classification: Accuracy, Precision, Recall, F1-score (brief overview)
Concept of overfitting and underfitting
Cross-validation overview (conceptual)
Choose one regression or classification scenario from your previous work. Write a short explanation of how you would evaluate the model using the metrics discussed. No calculations required; focus on reasoning and interpretation.
Case Study & Project Prep
Lesson Summary:
This week prepares you to integrate all the skills you’ve learned into real-world applications and your final project. You will:
Apply Excel and R techniques to a real-world dataset.
Combine quick insights from Excel with deeper analysis in R.
Plan your final project, including selecting a dataset, cleaning data, and outlining your analysis approach.
By the end of this week, you will have a clear roadmap for your final project and be ready to conduct comprehensive data analysis.
Assignment 11:
Choose a dataset relevant to your interest or field.
Perform initial cleaning in Excel.
Apply exploratory and summary analyses in Excel and R.
Draft a detailed plan for your final project, including:
Dataset description
Cleaning steps
Analysis methods
Visualizations to include
Submit your project plan and preliminary analysis for feedback.
Final Project Presentations
Lesson Summary:
This week is dedicated to completing and presenting your final project. You will:
Finalize dataset cleaning and full analysis using both Excel and R.
Develop comprehensive visuals, summary statistics, and interpretations.
Present your findings clearly, including insights, recommendations, and actionable conclusions.
By the end of this week, you will demonstrate mastery of data analysis concepts and practical skills, showing your ability to solve real-world problems using data.
Assignment 12:
Complete all data cleaning and analysis in Excel and R.
Create visuals and statistical summaries for your project.
Compile your results into a professional report or presentation.
Present your findings to the class, highlighting key insights, trends, and recommendations.
Submit your final project for evaluation.
Course Notice: Introduction to Data Analysis with R
Dear Participants,
Welcome to Introduction to Data Analysis with Excel & R, a practical 12-week course designed to build your foundational skills in data analysis using both Excel and R. Whether you are new to data or looking to expand your capabilities, this course equips you to explore, clean, analyze, and visualize data effectively using these complementary tools.
Course Highlights:
Start Date: August 8, 2025, at 5:30 PM EAT
Delivery Mode: Virtual via Zoom (live and pre-recorded lectures)
Core Topics:
Data Import and Wrangling in Excel and R
Exploratory Data Analysis (EDA)
Data Visualization with Excel charts and R’s ggplot2
Statistical Analysis Techniques in Excel (ToolPak) and R
Advanced Methods including ANOVA, Time Series, and Model Evaluation
Real-World Case Studies and a Final Capstone Project
What to Expect:
Hands-On Learning: Interactive exercises using both Excel and R, real-world case studies, and a final project to apply your skills.
Comprehensive Support: Access live sessions and recordings, plus instructor guidance for assignments and Q&A.
Flexible & Practical: Learn data analysis from scratch with Excel and R—no prior experience required.
We look forward to an engaging and productive learning journey. Prepare to unlock the full potential of your data skills with Excel and R!
Best regards,
Chris Benjamin Asianzu
Course Instructor – Introduction to Data Analysis with R
Graduate Studies Academy, Muni Labs