Data Analytics & Machine Learning with R Programming
About this Course:
R is one of the most trusted programming languages in the field of Artificial Intelligence and Machine Learning. Its extensive ecosystem of packages, simplicity in data visualization and built-in statistical functions make it a professional choice for Analysts, Researchers and Data Scientists working across different domains. Therefore, this hands-on beginner-friendly course is designed to provide the learners with a comprehensive foundation in R programming and its applications in Data Analysis, Visualization and Machine Learning. This course will equip the learners with the important skills of R-programming and make them competent to be able to analyze complex datasets, communicate data insights and develop predictive models —laying a solid foundation for advanced analytics roles and in the fields of AI and Machine Learning.
What you will Learn:
-
Fundamentals of R Programming: The course starts with the fundamentals of R, focusing on its core data structure, conditional & logical operators, statements and functions etc.
-
Data Analytics with R Programming: The participants will gain hands-on expertise on data manipulation using modern R packages such as dplyr, tidyr, and readr etc. and data visualization using ggplot2. Statistical concepts are thoroughly covered to help participants understand distributions, variability and inference using real-world examples.
-
Machine Learning with R Programming: The second half of the course introduces Machine Learning workflows using R’s powerful packages such as caret, mlr3 and randomForest etc. Both supervised and unsupervised learning techniques will be thoroughly explored with real-world examples.
Course Objectives:
● To make the participants understand the fundamentals of R programming and its applications in the fields of Data Analytics & Machine Learning.
● To develop proficiency in using powerful R packages like dplyr, tidy, readr for Data manipulation and ggplot2 for Data Visualization.
● To introduce participants to essential Statistical and Machine Learning techniques using R programming.
● To equip learners with the ability to build, evaluate, and interpret predictive models using R programming.
● To provide hands-on experience through projects using real-world datasets.
Batch Details:
Class Timings: 7:00 pm – 9:00 pm (Wednesday & Friday) Start Date: 18th July 2025
Duration: 70 Hours End Date: 31st Oct 2025
Mode: Online Last Date to Register: 17th July 2025
Course Fee: Students/PhD Scholars/RA/JRF/SRF/Postdoc fellows: Rs. 11,000/-
Faculty/Working Professional: Rs. 13,000/-
(Amounts inclusive of GST)
Course Highlights:
• Industry-Relevant Skills.
• Hands-on based learning experience through practical projects.
• Full-time access to recorded lectures/PPTs/PDFs/Study Materials.
• Session on Resume Preparation/Interview Preparation.
Course Overview:
Module 1: Fundamentals of R Programming
- Basics of R language: Syntax, Data Types, Operators
- R Objects: Vectors, Lists, Matrices, Arrays, Factors, Data Frames
- Variable Assignment, Indexing & Slicing
- Conditional & Logical Operators
- Statements in R: if-else, while, for
- Writing Functions in R, Anonymous Functions (function ())
Module 2: Data Analytics with R Packages
- Introduction to Data Science in R
- Built-in R Features
- Apply and Math Functions with R
- Regular Expressions, Dates and Timestamps
- Introduction to dplyr and Tidyverse
- Data manipulation: select(), filter(), mutate(), arrange(), summarise()
- Handling Missing Values & Outliers
- Data Cleaning Techniques
Module 3: Data Visualization in R
- Base R Plotting: plot(), hist(), boxplot(), etc.
- Introduction to ggplot2
- Customizing plots: titles, labels, themes
- Advanced visualizations: Violin Plots, Pair Plots
- Combining plots: patchwork, gridExtra, cowplot
- Hands-on project with real-world data visualization
Module 4: ML Basics & Supervised Learning
- Introduction to Machine Learning in R
- Using caret and mlr3 packages
- Simple & Multiple Linear Regression
- Model Evaluation: RMSE, MAE, R²
- Logistic Regression & Classification Tasks
- Confusion Matrix, Accuracy, Precision, Recall, F1 Score
Module 5: Supervised Learning & Natural Language Processing (NLP)
- Decision Trees using rpart, C50
- k-Nearest Neighbors with class and caret
- Random Forest with randomForest package
- Naive Bayes Classifier
- Basics of Natural Language Processing (NLP) in R
- Text Mining with tm, tidytext, quanteda
- Creating Document Term Matrices and using TF-IDF
Module 6: Unsupervised Learning with R
- Introduction to Clustering in R
- K-Means Clustering
- Hierarchical Clustering with Dendrograms
- DBSCAN using dbscan package
- Dimensionality Reduction with PCA
- Hands-on Unsupervised Learning Project with R
Prerequisites and eligibility:
-
No coding experience in any programming language required. We’ll start from scratch.
-
This course can be taken up by any undergraduate/postgraduate student of Basic & Applied Sciences, Engineering, Management and Computer Applications and also by Research Scholars/Faculties/Working Professionals who want to upskill themselves.
-
Participants need to have a laptop/PC (with a minimum of 4 GB RAM, 100 GB HDD, Intel i3 processor) and proper internet/Wi-Fi connection.
Contact Person: Dr. Subrat Kotoky
Email: [email protected] / [email protected]
Phone: 9085317465 / 8473874389
Expert Profile: Mr. Shreyas Shukla
Professional Corporate Trainer & Microsoft Azure Certified Data Engineer
M.Tech-IIT Kharagpur & BE- The Aeronautical Society of India, New Delhi
Has successfully conducted 25+ courses and trained 1500+ learners in the fields of Python Programming, Data Analytics, Machine Learning, Deep Learning, Computer Vision etc. till now.
(Total Experience in conducting Professional Courses: 4+ Years)
Certifications:
-
DP-203: Microsoft Certified: Azure Data Engineer Associate
-
DP-900: Microsoft Certified: Azure Data Fundamentals
-
AZ-900: Microsoft Certified: Azure Fundamentals
Register Now
Share this Page!