-
Certificates: Certificate "Data Scientist"
-
Additional Certificates: Data Engineer" certificate
Data Analytics" certificate
Machine Learning" certificate
Deep Learning" certificate -
Examination: Practical project work with final presentations
-
Teaching Times: Full-timeMonday to Friday from 8:30 a.m. to 3:35 p.m. (in weeks with public holidays from 8:30 a.m. to 5:10 p.m.)
-
Language of Instruction: German
-
Duration: 16 Weeks
Data Engineer
Basics of Business Intelligence (approx. 2 days)
Fields of application, dimensions of a BI architecture
Basics of business intelligence, OLAP, OLTP, tasks of data engineers
Data Warehousing (DWH): handling and processing of structured, semi-structured and unstructured data
Requirements management (approx. 2 days)
Tasks, objectives and procedures in requirements analysis
Data modeling, introduction/modeling with ERM
Introduction/modeling in UML
- Class diagrams
- Use case analysis
- Activity diagrams
Artificial intelligence (AI) in the work process
Presentation of specific AI technologies
and possible applications in the professional environment
Databases (approx. 3 days)
Basics of database systems
Architecture of database management systems
Application of RDBMS
Implementation of data model in RDBMS, normal forms
Practical and theoretical introduction to SQL
Limits of relational databases, csv, json
Data Warehouse (approx. 4 days)
Star Schema
Data modeling
Creation of Star Schema in RDBMS
Snowflake Schema, basics, data modeling
Creation of Snowflake Schema in RDBMS
Galaxy Schema: Basics, data modeling
Slowly Changing Dimension Tables Type 1 to 5 - Restating, Stacking, Reorganizing, mini Dimension and Type 5
Introduction to normal, causal, mini and monster, heterogeneous and sub dimensions
Comparison of state and transaction oriented
Fact tables, density and storage from DWH
ETL (approx. 4 days)
Data Cleansing
- Null Values
- Preparation of data
- Harmonization of data
- Application of regular expressions
Data Understanding
- Data validation
- Statistical data analysis
Data protection, data security
Practical structure of ETL routes
Data Vault 2.0, basics, hubs, links, satellites, hash key, hash diff.
Data Vault data modeling
Practical structure of a Data Vault model - Raw Vault, practical implementation of hash procedures
Project work (approx. 5 days)
To consolidate the content learned
Presentation of the project results
Data analytics
Introduction to data analysis (approx. 1 day)
CRISP-DM reference model
Data analytics workflows
Definition of artificial intelligence, machine learning, deep learning
Requirements and role in the company of data engineers, data scientists and data analysts
Review of Python basics (approx. 1 day)
data types
Functions
Data analysis (approx. 3 days)
Central Python modules in the context of data analytics (NumPy, Pandas)
Process of data preparation
Data mining algorithms in Python
Artificial intelligence (AI) in the work process
Presentation of specific AI technologies
and possible applications in the professional environment
Data visualization (approx. 3 days)
Explorative data analysis
insights
Data quality
Benefit analysis
Visualization with Python: Matplotlib, Seaborn, Plotly Express
Data storytelling
Data management (approx. 2 days)
Big data architectures
Relational databases with SQL
Comparison of SQL and NoSQL databases
Business Intelligence
Data protection in the context of data analysis
Data analysis in a big data context (approx. 1 day)
MapReduce approach
Spark
NoSQL
Dashboards (approx. 3 days)
Library: Dash
Structure of dashboards - Dash components
Customizing dashboards
Callbacks
Text Mining (approx. 1 day)
Data preprocessing
Visualization
Library: SpaCy
Project work (approx. 5 days)
To consolidate the content learned
Presentation of the project results
Machine Learning
Introduction to Machine Learning (approx. 5 days)
Why machine learning?
Application examples
Supervised learning, unsupervised learning, partially supervised learning, reinforcement learning
Examples of data sets
Getting to know data
Training, validation and test data
Viewing data
Making predictions
Supervised learning (approx. 5 days)
Classification and regression
Generalization, overfitting and underfitting
Size of the data set
Algorithms for supervised learning
Linear models
Bayes classifiers
Decision trees
Random Forest
Gradient Boosting
k-nearest neighbors
Support Vector Machines
Conditional Random Field
Neural Networks and Deep Learning
Probabilities
Unsupervised learning (approx. 5 days)
Types of unsupervised learning
Preprocessing and scaling
Data transformations
Scaling training and test data
Dimension reduction
Feature engineering
Manifold learning
Principal component decomposition (PCA)
Non-negative matrix factorization (NMF)
Manifold learning with t-SNE
Cluster analysis
k-Means clustering
Agglomerative clustering
Hierarchical cluster analysis
DBSCAN
Cluster algorithms
Evaluation and improvement (approx. 2 days)
Model selection and model evaluation
Tuning the hyperparameters of an estimator
Cross-validation
Grid search
Evaluation metrics
Classification
Project work (approx. 3 days)
To consolidate the content learned
Presentation of the project results
Deep learning
Introduction to Deep Learning (approx. 1 day)
Deep learning as a type of machine learning
Fundamentals of neural networks (approx. 4 days)
Perceptron
Calculation of neural networks
Optimization of model parameters, backpropagation
Deep learning libraries
Regression vs. classification
Learning curves, overfitting and regularization
Hyperparameter optimization
Stochastic gradient descent (SGD)
Momentum, Adam Optimizer
Learning rate
Convolutional Neural Network (CNN) (approx. 2 days)
Image classification
Convolutional layers, pooling layers
Reshaping layers, flattening, global-average pooling
CNN architectures ImageNet-Competition
Deep neural networks, vanishing gradients, skip connections, batch normalization
Transfer Learning (approx. 1 day)
Adaptation of models
Unsupervised pre-training
Image data augmentation, explainable AI
Regional CNN (approx. 1 day)
Object localization
Regression problems
Branched neural networks
Methods of creative image generation (approx. 1 day)
Generative Adversarial Networks (GAN)
Deepfakes
Diffusion models
Recurrent neural networks (approx. 2 days)
Sequence analysis
Recurrent layers
Backpropagation through time (BPTT)
Analysis of time series
Exploding and vanishing gradient problems
LSTM (Long Short-Term Memory)
GRU (Gated Recurrent Unit)
Deep RNN
Deep LSTM
Text processing using neural networks (approx. 2 days)
Text preprocessing
Embedding layers
Text classification
Sentiment analysis
Transfer learning in NLP
Translations
Sequence-to-sequence method, encoder-decoder architecture
Language models (approx. 1 day)
BERT, GPT
Attention layers, Transformers
Text generation pipelines
Summarization
chatbots
Deep reinforcement learning (approx. 1 day)
Control of dynamic systems
Agent systems
Training through rewards
Policy Gradients
Deep Q-learning
Bayesian neural networks (approx. 1 day)
Uncertainties in neural networks
Statistical evaluation of forecasts
Confidence, standard deviation
Unbalanced data
Sampling methods
Project work (approx. 3 days)
To consolidate the content learned
Presentation of the project results
Changes are possible. The course content is updated regularly.
You are familiar with processes relating to the consolidation, preparation, enrichment and transfer of data as well as the application of machine learning. You are also familiar with the areas of application of deep learning and the functioning of neural networks.
Data scientists are used in companies that want to optimize their business processes based on the analysis and modelling of large amounts of data, such as in logistics, online retail and marketing, in the energy industry and also in the healthcare sector.
Your meaningful certificate provides a detailed insight into the qualifications you have acquired and improves your career prospects.
Didactic concept
Your lecturers are highly qualified both professionally and didactically and will teach you from the first to the last day (no self-study system).
You will learn in effective small groups. The courses usually consist of 6 to 25 participants. The general lessons are supplemented by numerous practical exercises in all course modules. The practice phase is an important part of the course, as it is during this time that you process what you have just learned and gain confidence and routine in its application. The final section of the course involves a project, a case study or a final exam.
Virtual classroom alfaview®
Lessons take place using modern alfaview® video technology - either from the comfort of your own home or at our premises at Bildungszentrum. The entire course can see each other face-to-face via alfaview®, communicate with each other in lip-sync voice quality and work on joint projects. Of course, you can also see and talk to your connected trainers live at any time and you will be taught by your lecturers in real time for the entire duration of the course. The lessons are not e-learning, but real live face-to-face lessons via video technology.
The courses at alfatraining are funded by Agentur für Arbeit and are certified in accordance with the AZAV approval regulation. When submitting a Bildungsgutscheinor Aktivierungs- und Vermittlungsgutschein, the entire course costs are usually covered by your funding body.
Funding is also possible via Europäischen Sozialfonds (ESF), Deutsche Rentenversicherung (DRV) or regional funding programs. As a regular soldier, you have the option of attending further training courses via Berufsförderungsdienst (BFD). Companies can also have their employees qualified via funding from Agentur für Arbeit (Qualifizierungschancengesetz).