on request
6 weeks
English
Online
TU Berlin Certificate of Professional Education (6 ETCS)
Online
Prof. Dr. Rand Kouatly
3980 €
Recognized as Bildungszeit
INTRODUCTION TO STATISTICS AND PROGRAMMING FOR DATA SCIENCE
The certificate course provides intensive instruction in the fundamentals of statistics and programming for data science. It covers the basics of descriptive and inferential statistics as well as the fundamentals of programming for data analysis. Participants learn how to use the R/Python language to analyse data and create data visualizations.
Learning goals
Upon completion of the course, participants will know how to import, export and manipulate data using R/Python packages and understand basic statistical concepts and techniques and how to calculate them in R/Python. They will be able to create and customize different types of plots and graphs using R visualization packages. They will be proficient in creating and customizing different types of plots and graphs using Excel as complementary tools for statistics and be able to use R/Python to perform simple statistical analysis, hypothesis testing and data exploration. They can independently perform data cleaning, data transformation and data preparation using R/Python and know techniques for data pre-processing, cleaning and preparation.Content
Week 1: Preparation for the course - Software installation- What are R and RStudio? What are Python and Jupyter (Notebook)?
- Installing R and RStudio / Python and Jupyter (Notebook)? on your personal computer
- Errors, warnings and messages
- Tips on learning to code
- Package installation
- Package loading
- Testing and Hello World Program
- Topic 1: Data Types
- Topic 2: Basics Operations
- Topic 3: Data Structure
- Programming practice 1
- Topic 4: Types of Data
- Topic 5: Exploratory Data Analysis
- Topic 6: Statistics Analytics Using Excel
- Programming practice 2
- Topic 7: Data Frame operations
- Topic 8: Input and Output using R/Python
- Topic 9: Data Reshaping
- Programming practice 3
- Topic 10: Missing data handling
- Topic 11: Exploring and visualizing techniques
- Topic 12: Visualizing data using R/Python
- Programming practice 4
- Applying the skills and knowledge learned throughout the course to a real-world data
- Exam Project Requirement and specification
- 4 hours open consultation (Groups) – Online only
Target group
This course is designed for professionals who want to take an in-depth look at data science and to learn more about fundamentals and concepts in this field.This course is recognized as Bildungszeit according to paragraph § 10 (5) of the Berliner Bildungszeitgesetz (BiZeitG).
Prerequisites
- English level B1 (according to the European Framework)
- Prior knowledge of programming and statistics
- Basics in mathematics
- Understanding of standard Microsoft Office applications
- Laptop/PC + headset with microphone
Dates
Currently no dates. If you are interested, please contact us.Prof. Dr. Rand Kouatly