AI Software Stack Deployment Architect

SanDisk India Device Design Centre Pvt.Ltd
Bengaluru/Bangalore
Not disclosed
Work from OfficeWork from Office
Full TimeFull Time
Min. 10 yearsMin. 10 years

Job Description

AI SW Stack Deployment Architect

Company Description

Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions have become the beating heart of the digital world we’re living in and that we have the power to shape.

Sandisk meets people and businesses at the intersection of their aspirations and the moment, enabling them to keep moving and pushing possibility forward. We do this through the balance of our powerhouse manufacturing capabilities and our industry-leading portfolio of products that are recognized globally for innovation, performance and quality.

Sandisk has two facilities recognized by the World Economic Forum as part of the Global Lighthouse Network for advanced 4IR innovations. These facilities were also recognized as Sustainability Lighthouses for breakthroughs in efficient operations. With our global reach, we ensure the global supply chain has access to the Flash memory it needs to keep our world moving forward.

Job Description

Role Overview

We are looking for a Software Architect (12+ years experience) to lead the application/framework layer and deployment stack for the Next Generation Accelerator AI platform. This role owns how models run on Next Generation Accelerator—from vLLM / PyTorch / TensotFlow/XLA to production deployment—ensuring correctness, performance, and scalability.

Key Responsibilities

  • Architect integration of vLLM, PyTorch, and TensorFlow, JAX/XLA into Next Generation Accelerator stack
  • Define framework → compiler → runtime APIs and contracts
  • Own LLM execution behavior (batching, KV cache, streaming inference)
  • Design and implement end-to-end deployment workflows (packaging, versioning, reproducibility)
  • Drive performance optimization across model → framework → runtime
  • Work cross-functionally with compiler, runtime, and low-level SW teams
  • Support customer workloads, model onboarding, and debugging

Impact

Own customer-visible AI execution and deployment on Next Generation Accelerator, closing the gap between models and system performance, and enabling enterprise-grade AI solutions

Qualifications

Required Qualifications

  • 10+ years in AI/ML systems or software architecture
  • Strong experience with PyTorch / Transformers / LLMs
  • Hands-on experience with LLM deployment and scalable inference engine systems e.g. vLLM, Triton, SGLang etc.
  • Experience building scalable AI platforms (cloud/edge)
  • Expertise in system design, APIs, and cross-layer integration

Preferred Qualifications

  • Experience with vLLM or similar LLM serving systems
  • Familiarity with XLA / MLIR / compiler frameworks
  • Exposure to AI accelerators (GPU/NPU) and runtime systems

Experience in distributed or multi-agent AI systems

Additional Information

Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.

Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at jobs.accommodations@sandisk.com to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Experience Level

Senior Level

Job role

Work location
Work locationBengaluru, KA, India
Department
DepartmentSoftware Engineering
Role / Category
Role / CategorySoftware Backend Development
Employment type
Employment typeFull Time
Shift
ShiftDay Shift

Job requirements

Experience
ExperienceMin. 10 years

About company

Name
NameSanDisk India Device Design Centre Pvt.Ltd
Job posted by SanDisk India Device Design Centre Pvt.Ltd

Similar jobs you can apply for

Accounts / Finance
Tranquil HR Solutions

Accountant/ Accounts Executive

Tranquil HR Solutions
Kalyan Nagar, Bengaluru/Bangalore
₹20,000 - ₹30,000
Work from Office
Full Time
Min. 3 years
Good (Intermediate / Advanced) English
TalentRouters

Technical Sales Engineer - Spindles , CNC, VMC, Rotary & Grinding Tables , Angle Heads

TalentRouters
Bengaluru/Bangalore
₹30,000 - ₹45,000
Field Job
Full Time
Min. 3 years
Good (Intermediate / Advanced) English
Icici Prudential Life Insurance Company Limited

Relationship Manager

Icici Prudential Life Insurance Company Limited
Ashok Nagar, Bengaluru/Bangalore
₹20,500 - ₹25,000
Work from Office
Full Time
Min. 6 months
Basic English
Erode Shree Amman Mess

Human Resource Executive

Erode Shree Amman Mess
Brookefield, Bengaluru/Bangalore
₹18,000 - ₹20,000
Work from Office
Full Time
Any experience
Basic English
Krazybee Services Private Limited

Tele Collections Executive – Personal Loan

Krazybee Services Private Limited
Brookefield, Bengaluru/Bangalore
₹18,000 - ₹25,000
Work from Office
Full Time
Min. 6 months
Good (Intermediate / Advanced) English
Niva Bupa Health Insurance Company

Relationship Manager

Niva Bupa Health Insurance Company
Mangammanapalya, Bengaluru/Bangalore
₹25,000 - ₹35,000
Work from Office
Full Time
Min. 1 year
Basic English

You can expect a minimum salary of 0 INR. The salary offered will depend on your skills, experience and performance in the interview.

The candidate should have completed the required education and people who have 10 to 31 years are eligible to apply for this job. You can apply for more jobs in Bengaluru/Bangalore to get hired quickly.

The candidate should have sound communication skills and sound communication skills for this job.

Both Male and Female candidates can apply for this job.

No, it's not a work from home job and can't be done online. You can explore and apply for other work from home jobs in Bengaluru/Bangalore at apna.

No work-related deposit needs to be made during your employment with the company.

Go to the apna app and apply for this job. Click on the apply button and call HR directly to schedule your interview.

The last date to apply for this job is . For more details, download apna app and find Full Time jobs in Bengaluru/Bangalore . Through apna, you can find jobs in 64 cities across India. Join NOW!