Big Data Specialist

The course teaches the basics of big data and shows how modern technologies and frameworks are used to store, process and analyze large amounts of data. Participants will learn about cloud-based big data solutions, Apache Spark, data pipelines, data lakes and NoSQL databases. In addition, methods of data visualization, the basics of artificial intelligence and important aspects of data governance, data protection and data ethics are covered.

  • Certificates: Certificate "Big Data Specialist"
  • Examination: Praxisbezogene Projektarbeit mit Abschlusspräsentation
  • Teaching Times: Full-time
    Monday to Friday from 8:30 a.m. to 3:35 p.m. (in weeks with public holidays from 8:30 a.m. to 5:10 p.m.)
  • Language of Instruction: German
  • Duration: 4 Weeks

What is Big Data? (approx. 1 day)

Volume, Velocity, Variety, Value, Veracity

Opportunities and risks of large amounts of data

Differentiation: business intelligence, data analytics, data science

Introduction to data mining

Role of AI and data-driven systems in the big data environment


Introduction to big data frameworks (approx. 2 days)

Big data solutions in the cloud (overview of AWS, Azure, GCP)

Data access patterns

Data storage

Introduction to data lakes and data warehouses

Overview of Apache Hadoop and Spark


Distributed data processing with Spark (approx. 3 days)

Basics of distributed systems

Apache Spark (Core and SQL)

Comparison of different approaches to data processing

Processing large amounts of data

Introduction to simple ML workflows with Spark


Data pipelines and data integration (approx. 2 days)

ETL and ELT processes

Batch vs. streaming processing

Basics of data pipelines

Introduction to orchestration (e.g. Airflow overview)

Data quality and preparation


Components (approx. 2 days)

Brief presentation of various tools

Data transfer

Overview of resource management in big data systems

Hadoop ecosystem

Apache Spark deepening

Introduction to streaming technologies


NoSQL and data storage (approx. 2 days)

CAP theorem

ACID and BASE

Types of databases

HBase

Introduction to document-oriented databases

Introduction to storage formats

Overview of data lakehouse approaches


Big Data Visualization (approx. 2 days)

Theories of visualization

Diagram selection

New types of diagrams

Tools for data visualization

Introduction to BI tools (e.g. Power BI, Tableau)

Basics of data-driven decision making


Data governance and data protection (approx. 1 day)

Basics of the GDPR in the data context

Data ethics and responsible handling of data

Data quality and governance concepts

Access controls and security

Fundamentals of responsible AI use


Project work (approx. 5 days)

To consolidate the content learned

Presentation of the project results



Changes are possible, the course content is updated regularly.

Programming skills (ideally Python) and experience with databases (SQL) are required.

After the course, you will be able to process large, unstructured amounts of data with the help of tools and cloud technologies. You will have knowledge of current big data frameworks, be able to analyse data, classify AI-supported processes and visualize results in an appealing way.

The course is aimed at people with a degree in computer science, business informatics, mathematics or a comparable qualification.

As companies have to manage and structure ever-increasing volumes of data to evaluate and set objectives for their business processes, data processing skills are in demand in all sectors.

Your meaningful certificate provides a detailed insight into the qualifications you have acquired and improves your career prospects.

Didactic concept

Your lecturers are highly qualified both professionally and didactically and will teach you from the first to the last day (no self-study system).

You will learn in effective small groups. The courses usually consist of 6 to 25 participants. The general lessons are supplemented by numerous practical exercises in all course modules. The practice phase is an important part of the course, as it is during this time that you process what you have just learned and gain confidence and routine in its application. The final section of the course involves a project, a case study or a final exam.

 

Virtual classroom alfaview®

Lessons take place using modern alfaview® video technology - either from the comfort of your own home or at our premises at Bildungszentrum. The entire course can see each other face-to-face via alfaview®, communicate with each other in lip-sync voice quality and work on joint projects. Of course, you can also see and talk to your connected trainers live at any time and you will be taught by your lecturers in real time for the entire duration of the course. The lessons are not e-learning, but real live face-to-face lessons via video technology.

 

The courses at alfatraining are funded by Agentur für Arbeit and are certified in accordance with the AZAV approval regulation. When submitting a Bildungsgutscheinor Aktivierungs- und Vermittlungsgutschein, the entire course costs are usually covered by your funding body.
Funding is also possible via Europäischen Sozialfonds (ESF), Deutsche Rentenversicherung (DRV) or regional funding programs. As a regular soldier, you have the option of attending further training courses via Berufsförderungsdienst (BFD). Companies can also have their employees qualified via funding from Agentur für Arbeit (Qualifizierungschancengesetz).

We will gladly advise you free of charge.

0800 3456-500 Mon. - Fri. from 8 am to 5 pm
free of charge from all German networks.

Contact

We will gladly advise you free of charge. 0800 3456-500 Mon. - Fri. from 8 am to 5 pm free of charge from all German networks.