About Us
At Servable, we’re not just building another AI platform—we’re pioneering how enterprises securely and efficiently adopt AI. Our focus is on fine-tuning, deploying, and serving large language models (LLMs) while optimizing scalability, performance, and enterprise-grade security.
We are seeking an experienced and innovative AI/ML Engineer to join our team. In this role, you will work at the intersection of machine learning and engineering to build, optimize, and deploy state-of-the-art AI solutions that drive the functionality and performance of our platform.
Responsibilities
- Build, fine-tune, and deploy large language models (LLMs) using techniques like LoRA and distillation.
- Optimize AI inference for low-latency, high-throughput performance with tools like vLLM and TensorRT.
- Design robust pipelines for training, fine-tuning, and evaluation.
- Design and implement AI models tailored to enterprise use cases.
- Work with engineers and product teams to deliver customer-driven AI solutions.
- Research and apply the latest advancements in AI/ML to improve our platform.
Qualifications
Must-Have Skills:
- 2+ years of experience with AI/ML, particularly in NLP and LLMs.
- Strong Python skills and proficiency with frameworks like PyTorch or TensorFlow.
- Experience with inference optimization tools (vLLM, TensorRT-LLM, llama.cpp).
- Familiarity with cloud platforms (AWS, Azure, GCP) and distributed systems.
- Expertise in fine-tuning methods like LoRA.
- Knowledge of containerization (Docker) and orchestration (Kubernetes).
Nice-to-Have Skills:
- Experience with synthetic data generation and data pipelines.
- Understanding of security and compliance in enterprise AI systems in regulated industries (e.g. finance, healthcare).
- Hands-on experience with hybrid or multi-cloud AI workloads.
- Exposure to designing AI solutions for client-specific use cases.
Why Join Us?
- Impactful Work: Play a critical role in building AI solutions that transform industries.
- Cutting-Edge Technology: Work on state-of-the-art AI/ML projects with the latest tools and frameworks.
- Dynamic Startup Environment: Be part of a fast-paced, collaborative team tackling real-world challenges.
- Growth Opportunities: Take ownership of your work and grow alongside the company.
- Competitive Compensation: We offer a competitive salary, equity options, and room for growth.