Data Engineer

Springer Nature India Pvt Ltd

Pune

Not disclosed

Work from Office

Full Time

Min. 3 years

Job Details

Job Description

Data Engineer

Job Title:  Data Engineer

Location: Pune

  

About Springer Nature Group  

Springer Nature opens the doors to discovery for researchers, educators, clinicians and other professionals. Every day, around the globe, our imprints, books, journals, platforms and technology solutions reach millions of people. For over 180 years our brands and imprints have been a trusted source of knowledge to these communities and today, more than ever, we see it as our responsibility to ensure that fundamental knowledge can be found, verified, understood and used by our communities – enabling them to improve outcomes, make progress, and benefit the generations that follow. Visit group.springernature.com and follow @SpringerNature / @SpringerNatureGroup 

  

About the Role:

Are you ready to support us in building core data products for use in shaping the future of research publishing? 

Springer Nature is seeking a Data Engineer to join the Analytics Center of Excellence team, within SN Data - Data Competence Center.  You will work closely with other engineers, data scientists and analysts to build and scale the data infrastructure behind our analytics products, which enable users to make data driven decisions to improve our products, services and platforms. We’re looking for a blend of skills and attributes that make you a great fit for this role. If you don’t tick every box, don’t worry - we provide tailored learning and development programs to help you grow and succeed with us.

Role Responsibilities 

  • Design and implement and optimize production data solutions, such as scalable data pipelines to create data products to meet business use cases. Existing data pipelines include both batch and streaming data using Apache Beam (Dataflow) and enabling efficient ETL/ELT of structured and unstructured data. 

  • Architect and maintain end-to-end data infrastructure on Google Cloud Platform (GCP), ensuring quality checks, performance, scalability, and security. 

  • Develop and manage data orchestration workflows using Apache Airflow, automating pipeline scheduling and monitoring. 

  • Build and deploy containerized backend Python-based APIs backed by  databases. 

  • Work collaboratively with other engineers, using techniques like pair and ensemble programming, to foster collective code ownership.

  • Collaborate with Data Scientists, Analysts, and other team members as relevant to translate business requirements into scalable data solutions. 

  • Maintain and optimize CI/CD pipelines, ensuring secure and automated deployments. 

  • Support and enhance ML/AI solutions at scale, including deployment and monitoring of models in production environments. 

Experience, Skills & Qualifications

Essential

  • Bachelor’s degree in Engineering, Computer Science, or a related quantitative field. 

  • 3+ years of experience in data engineering with a strong focus on a cloud platform (GCP experience preferable, including BigQuery, Dataflow, Dataform, Cloud Functions, Cloud Run, Pub/Sub, and Cloud Composer but AWS/Azure also welcome). 

  • Strong SQL skills and proficiency in programming languages such as Python. 

  • Experience managing data pipeline using tools like Apache Beam, Airflow, and Docker/Kubernetes. 

  • Experience with CI/CD tools, Terraform, and GitHub Actions. 

  • Excellent problem-solving skills and ability to work independently in a fast-paced environment. 

Desirable

  • An understanding of decentralized Data Mesh and Data Product architecture principles. 

  • Experience building and testing data pipelines with DBT 

  • Exposure to Vertex AI, Dash, and other ML/AI tools. 

  • Experience with DevOps practices, infrastructure as code, and secure deployment workflows. 

  • Strong communication and collaboration skills. 

At Springer Nature, we value the diversity of our teams and work to build an inclusive culture, where people are treated fairly and can bring their differences to work and thrive. We empower our colleagues and value their diverse perspectives as we strive to attract, nurture and develop the very best talent. Springer Nature was awarded Diversity Team of the Year at the 2022 British Diversity Awards. Find out more about our DEI work here https://group.springernature.com/gp/group/taking-responsibility/diversity-equity-inclusion If you have any access needs related to disability, neurodivergence or a chronic condition, please contact us so we can make all necessary accommodation. For more information about career opportunities in Springer Nature please visit https://springernature.wd3.myworkdayjobs.com/SpringerNatureCareers

#LI-DB1

Job Posting End Date:

15-01-2026

Job role

Work location

Pune [SNTPS Kharadi], India

Department

Data Science & Analytics

Role / Category

Data Science & Machine Learning

Employment type

Full Time

Shift

Day Shift

Job requirements

Experience

Min. 3 years

About company

Name

Springer Nature India Pvt Ltd

Job posted by Springer Nature India Pvt Ltd

Apply on company website