Data Engineer



Not disclosed

Work from Office

Full Time

Min. 1 year

Job Details

Job Description

Senior Principal Consultant- Lead Data Engineer

Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people – we serve and transform leading enterprises, including the Fortune Global 500, with our deep business and industry knowledge, digital operations services, and expertise in data, technology, and AI.

Inviting applications for the role of Senior Principal Consultant- Lead Data Engineer! Responsibilities • (The primary tasks, functions and deliverables of the role) • Design and build reusable components, frameworks and libraries at scale to support analytics products • Design and implement product features in collaboration with business and Technology stakeholders • Identify and solve issues concerning data management to improve data quality • Clean, prepare and optimize data for ingestion and consumption • Collaborate on the implementation of new data management projects and re-structure of the current data architecture • Implement automated workflows and routines using workflow scheduling tools • Build continuous integration, test-driven development and production deployment frameworks • Analyze and profile data for designing scalable solutions • Troubleshoot data issues and perform root cause analysis to proactively resolve product and operational issues

Qualifications we seek in you! Minimum Qualifications

Experience: • Strong understanding of data structures and algorithms • Strong understanding of solution and technical design • Has a strong problem solving and analytical mindset. • Able to influence and communicate effectively, both verbally and written, with team members and business stakeholders • Able to quickly pick up new programming languages, technologies, and frameworks • Experience building cloud scalable, real time and high-performance data lake solutions • Fair understanding of developing complex data solutions • Experience working on end-to-end solution design • Willing to learn new skills and technologies • Has a passion for data solutions

Required skill 1. Hands on experience in Databricks and AWS - EMR [Hive, Pyspark], S3, Athena. 2. Familiarity with Spark Structured Streaming 3. experience working experience with Hadoop stack dealing huge volumes of data in a scalable fashion 4. hands-on experience with SQL, ETL, data transformation and analytics functions 5. hands-on Python experience including Batch scripting, data manipulation, distributable packages 6. experience working with batch orchestration tools such as Apache Airflow or equivalent, preferable Airflow 7. working with code versioning tools such as GitHub or BitBucket; expert level understanding of repo design and best practices 8. Familiarity with deployment automation tools such as Jenkins 9. hands-on experience designing and building ETL pipelines; expert with data ingest, change data capture, data quality; hand on experience with API development 10. designing and developing relational database objects; knowledgeable on logical and physical data modelling concepts; some experience with Snowflake 11. Familiarity with Tableau or Cognos use cases

Preferred Qualifications 12. Familiarity with Agile; working experience preferred ⁃ ⁃ Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. Get to know us at and on LinkedIn, X, YouTube, and Facebook. Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training

Job role

Work location



Data Science & Analytics

Role / Category

Data Science & Machine Learning

Employment type

Full Time


Day Shift

Job requirements


Min. 1 year

About company



Job posted by Genpact

Apply on company website

Follow us on social media

© 2024 Apna | All rights reserved Privacy Policy Terms & Conditions