If you have strong Python programming skills and want to start a career in the Data Software Engineering (prev. Big Data) sphere, then this program is what you need.

Our learning path includes two stages, offering you a step-by-step approach to gaining in-depth knowledge in this direction. By participating, you will have the opportunity to:

  • Gain expertise in software development technologies and Git
  • Dive into the capabilities of the main public cloud platforms
  • Explore the design, development and maintenance of large data-volume software systems
  • Learn large-scale data handling technologies (Hadoop, Apache Spark, Kafka, etc.)

Training process

The program consists of two stages:

In the initial stage, you'll have the flexibility to explore self-study materials at your own pace. Through completing assigned tasks, you'll receive immediate automated feedback. Typically, this stage takes about 3 months to finish, requiring around 10-12 hours of weekly involvement.

If you demonstrate promising results and pass a technical interview successfully, you'll advance to the next phase. This stage usually spans about 3 months and entails approximately 20 hours of weekly engagement. As part of this stage, you'll have the opportunity to join our mentoring program, featuring personalized one-on-one sessions with our production experts.

Upon successfully completing the program, you'll have the chance to align your career path with your demonstrated skills and explore available opportunities within EPAM.

How to get started?

  • Register on this page. Once you fill in all the required fields, you will find the confirmation with more details in your mailbox or notifications tab.
  • Take an English test available in your profile. You should complete it within two weeks after your registration and reach the B1 (Intermediate) level result to proceed.
  • Pass a preliminary technical test on Python. You should take it within two weeks by accessing the “My enrollments” section after logging in on our website.
  • Receive a link to the Learn platform and start studying. We will send it via your profile's email right after the successful tests' completion and your application review.
  • Please check your profile's mailbox regularly to make sure you get all important notifications.

What will you learn?

The first stage of the program will take ~3 months to complete (recommended time), during which you will dive into two training modules.

Module 1. Basic technical self-paced courses:

  • Git
  • Version Controlling Concepts
  • Version Control Types
  • Branching, Merging and Conflict Solving
  • Software Development Methodologies
  • Software Development Model
  • Waterfall
  • Agile
  • Scrum
  • Kanban
  • Test-Driven Development
  • Cloud Overview
  • Cloud Core Concepts
  • Cloud Service Models
  • Cloud Ownership
  • Public Cloud Platforms Overview
  • Introduction to DevOps
  • Docker Image / Docker Container / Docker Hub
  • Kubernetes and Basic Concept
  • Continuous Integration, Continuous Delivery, Continuous Deployment
  • Virtual Machine / Container
  • Python Core
  • Data Types
  • Functions
  • Modules and Packages
  • Lists and Tuples
  • Queues and Stacks
  • Sets
  • Dictionaries
  • Conditional Execution
  • Loops
  • Error Handling Core

The completion of this module requires a score of at least 75%.

Module 2. Data Engineering introductory self-paced courses:

  • DB and SQL Fundamentals
  • DB Basics
  • TCL Statements (Transaction Control Language)
  • DML Statements (Data Manipulation Language)
  • DDL Statements (Data Definition Language)
  • DCL (Data Control Language)
  • SQL for Analysis
  • Data Modelling and Data Architecture
  • OLAP vs OLTP
  • Kimball and Inmon Approaches DWH Layers
  • Dimension. Types. SCD
  • Fact Table. Types
  • ETL and ELT
  • NoSQL
  • NoSQL vs RDBMS
  • CAP Theorem: No SQL Types
  • Spark Cassandra
  • MongoDB
  • HBase
  • Big Data / Data Lake Concept
  • Data Lake vs Data Warehouse: Key Differences

The completion of this module requires a score of at least 85%.

Technical Test

Based on the final test result and technical interview, we will invite you to the second stage, lasting ~3 months and embracing the following topics:

  • Cloud (Azure or AWS)
  • Spark
  • Databricks
  • Kafka
  • Hadoop
  • Airflow
  • Elastic
  • Data Flow & Pipelining
  • Orchestration & Scheduling

  • Education within an IT company. As a leading software engineering company, we will help you explore emerging technologies and best practices, ensuring you are equipped with the knowledge that the current market seeks.
  • Top-notch learning materials. Our curriculum is designed by Data specialists with extensive project experience and validated in hundreds of training runs.
  • Practice-oriented approach. This comprehensive program focuses on providing you with hands-on experience and practical application of the concepts learned.
  • Deep dive into the specialization. Our graduates become highly skilled specialists ready to face complex technical challenges and work with the world's leading customers.
  • Support from experienced mentors. We will guide you at advanced training stages, covering your questions and sharing feedback on assigned tasks.

What is required for training:

  • English fluency at the Upper-Intermediate level (B2) or higher
  • Strong Python programming skills
  • Practical experience in one of the additional languages (Java or Scala)
  • Solid knowledge of Relational Database Management System (DBMS) theory
  • Good command of Structured Query Language (DML/DDL/DCL/TCL)
  • Theoretical knowledge of DWH/Data Lake and one of the visualization tools
  • Basic understanding of Linux, Docker and Kubernetes

Please read this info before registration:

This training is for citizens of Poland and specialists relocated to this country for a permanent stay.

Please take into account that the training format is ongoing, however registration may close within a year.

Active participants of trainings and EPAM Systems Company Employees are not allowed to register for the training. Please, contact your manager regarding the positions available.

Please note that each stage typically takes around 3 months to complete. If, during this time, we don't observe progress, your application may be automatically withdrawn at any stage.

EPAM Systems Inc. (EPAM) is a leading digital transformation services and product engineering company. Since 1993, we have used our software engineering expertise to become a leading global provider of digital engineering, cloud and AI-enabled transformation services, as well as a leading business and experience consulting partner for global enterprises and ambitious startups. We address our clients’ transformation challenges by fusing EPAM Continuum’s integrated strategy, experience and technology consulting with our 30+ years of engineering execution to speed our clients’ time to market and drive greater value from their innovations and digital investments. Learn more at www.epam.com.