We’re seeking a Senior AI/ML Engineer who will design, fine-tune, and deploy machine learning models, including LLMs, within our serverless AWS infrastructure. Collaborating with backend engineers and product teams, you’ll integrate AI solutions focused on scalability, performance, and seamless deployment.
We’re looking for expertise in Python, AI/ML development, LLM fine-tuning, API design, and AWS serverless architecture to drive innovation and deliver impactful AI-powered solutions.
Title: Senior AI/ML Engineer (Backend)
Department: Product Development
Location: Bangalore (in office)
The Product Development Team transforms ideas into innovative, AI-driven products that enhance customer lives and drive organizational growth. With a focus on creating solutions that exceed expectations and become key revenue drivers, we redefine possibilities through cutting-edge technology, ensuring progress, impact, and success at every step.
Deliver High-Performance AI Models: Design, fine-tune, and deploy advanced ML models, particularly LLMs, to improve scalability, accuracy, and user outcomes.
Implement Scalable Solutions: Build serverless AI applications using AWS (Lambda, API Gateway, DynamoDB) to ensure cost-efficiency and reliability.
Enable Seamless Integration: Develop robust GraphQL and RESTful APIs to integrate AI models into real-time workflows and product features.
Ensure Operational Excellence: Set up and maintain CI/CD pipelines, monitor model performance, and optimize infrastructure for cost and efficiency.
Drive Collaboration and Innovation: Work with cross-functional teams to embed AI/ML into products and stay ahead of emerging trends in AI technology.
Optimize Data Utilization: Streamline data pipelines and ETL processes to support efficient model training and deployment.
Programming Languages:
Expertise in Python for developing and deploying machine learning models with a focus on efficient and maintainable code.
Machine Learning Frameworks:
Proficiency in TensorFlow, PyTorch, and Hugging Face Transformers for developing and fine-tuning advanced AI models, including LLMs.
AWS Serverless Architecture:
Experience in building scalable and cost-efficient serverless solutions using AWS services like Lambda, API Gateway, S3, and DynamoDB.
LLMs and Model Fine-Tuning:
Skilled in fine-tuning pre-trained models like GPT and LLaMA for domain-specific tasks, including prompt engineering and performance optimization.
Vector Search and RAG:
Familiarity with vector databases (e.g., Faiss, Pinecone) and Retrieval-Augmented Generation (RAG) techniques to enhance model accuracy and relevance.
API Development:
Capable of creating GraphQL and RESTful APIs using Flask, FastAPI, or AWS AppSync to enable real-time model inference.
Model Deployment and Monitoring:
Experience with CI/CD pipelines and AWS CloudWatch to ensure smooth deployment, scaling, and performance monitoring of AI/ML models.
MLOps:
Knowledge of operational best practices for deploying, optimizing, and monitoring machine learning models in production.
Data Engineering:
Proficient in data handling and ETL workflows using tools like SQL, Spark, and AWS S3 for effective data processing and integration.
At Softway, we redefine possibilities through technology. By joining us, you’ll:
Be part of a flat hierarchy where your ideas are valued.
Work on cutting-edge AI solutions across diverse domains.
Thrive in an ego-free culture that emphasizes innovation, collaboration, and growth.
Enjoy a balanced, fulfilling work environment with a competitive salary and exciting perks.
Join us to lead, innovate, and grow with a supportive and dynamic team.