Deploy LLMs in Minutes with the New LLM Job Type
proxiML now offers a dedicated LLM job type that lets you deploy pre-configured large language models as managed inference endpoints in minutes. Select a model family and size from the platform and get an OpenAI-compatible endpoint with no custom serving commands, Docker images, or manual checkpoint setup required. Currently supported families include Gemma 4, Qwen 3.5, and Qwen 3.6.