Hadoop Data Infrastructure Engineer
Synechron Technologies
Apply on company website
Hadoop Data Infrastructure Engineer
Synechron Technologies
Chennai
Not disclosed
Job Details
Job Description
Hadoop Data Infrastructure Engineer | Cloud Migration, Cluster Support, Automation, Performance Optimization, Security & High Availability
Job Summary
Synechron is seeking a highly skilled DevOps Data Hadoop Engineer to lead the design, implementation, and management of enterprise-grade big data infrastructure. This role involves supporting high-availability Hadoop clusters, optimizing performance, automating deployment workflows, and integrating new technological solutions within large-scale data environments. You will work closely with platform, security, and data science teams to ensure scalable, reliable, and secure systems that support advanced analytics and data-driven initiatives.
Software Requirements
Required: Cloudera platform (CDH/HDp) or Hadoop 2.x/3.x, Terraform, Ansible, Git, Jenkins, Shell and Python scripting, monitoring tools (Splunk, CloudWatch, New Relic, ELK Stack), Linux OS, network and security tools (firewalls, VPNs, encryption APIs)
Preferred: Spark, Hive, Presto, Kafka, HDFS, Redshift, AWS EMR, Azure HDInsight, Kubernetes, GitHub Actions, Prometheus, Grafana
Experience level: 5+ years supporting large-scale Hadoop clusters, migration projects, and automation in enterprise environments
Overall Responsibilities
Lead the deployment, support, and optimization of Hadoop clusters supporting data analytics platforms
Automate provisioning, patching, and configuration management of Hadoop data environments using Terraform, Ansible, and scripting
Support high-availability architectures, monitor system health, and perform capacity planning and tuning
Collaborate with data and analytics teams to enable data ingestion, processing, and storage workflows
Conduct root cause analysis of system incidents, optimize performance, and implement preventive measures
Drive automation initiatives to reduce manual operations and improve system resilience
Maintain operational documentation, runbooks, and compliance records for data infrastructure
Support migration activities and cloud integration (AWS, Azure) supporting hybrid data environments
Ensure compliance with security, data governance, and enterprise standards
Technical Skills (By Category)
Programming Languages:
Essential: Shell scripting, Python, SQL (query optimization, data validation)
Preferred: Java, Scala, Spark APIs for data processing tasks
Databases/Data Management:
Enterprise Hadoop-related data storage (HDFS, HBase), Redshift, cloud data lakes (ADLS, S3)
Cloud Technologies:
Basic knowledge of AWS, Azure, or GCP for cloud migration, automation, and managed data services (preferred)
Frameworks and Libraries:
Spark, Hive, Presto, Kafka, Kafka Connect, Presto, Apache Ranger, Knox security gateways
Development Tools & Methodologies:
Terraform, Ansible, Jenkins, Git, CI/CD pipelines, Agile/Scrum, DevSecOps principles
Security & Compliance:
Encryption, Kerberos, LDAP integration, IAM policies, data masking, audit logging
Experience Requirements
5+ years of experience supporting enterprise Hadoop clusters, data lakes, or big data ecosystems
Proven success in automating deployment, patching, and scaling of Hadoop environments
Demonstrable experience supporting high-availability clusters and performance tuning at scale
Previous involvement in cloud data migration or hybrid cloud integration projects preferred
Industry experience in banking, finance, telecom, or healthcare sectors supporting data analytics is advantageous; extensive enterprise support experience is necessary
Day-to-Day Activities
Manage, support, and optimize Hadoop clusters supporting enterprise analytics workflows
Develop and maintain automation scripts and IaC to support provisioning and configuration management
Monitor system health, optimize performance, and troubleshoot outages or security incidents
Lead capacity planning, upgrade, and patch management activities supporting high availability
Support migration of data and operational workflows into cloud or hybrid environments
Collaborate with data science, security, and platform teams to implement best practices
Maintain documentation, incident reports, and operational runbooks
Conduct root cause analysis, performance tuning, and proactive system enhancements
Qualifications
Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field
5+ years of experience supporting large-scale enterprise Hadoop clusters or big data platforms
Certifications such as Cloudera Certified Administrator (CCA) or AWS/Azure Data Engineer are a plus
Strong scripting and automation skills for deployment and operational management
Proven experience supporting high-availability large data environments
Excellent troubleshooting, communication, and documentation skills
Professional Competencies
Critical thinking and analytical problem-solving in complex data environments
Leadership and mentoring skills to guide support teams and foster best practices
Strong stakeholder management for coordinating cross-team activities
Adaptability to evolving tech environments and cloud services support
Ownership mentality for system reliability, security, and continuous improvement
Time management and prioritization skills in a fast-paced, high-impact setting
SYNECHRON’S DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
Experience Level
Senior LevelJob role
Work location
Chennai, India
Department
Software Engineering
Role / Category
Software Development
Employment type
Full Time
Shift
Day Shift
Job requirements
Experience
Min. 5 years
About company
Name
Synechron Technologies
Job posted by Synechron Technologies
Apply on company website