Outpost Services

Deploy and scale applications and machine learning workloads using Outpost Services.

Introduction

Outpost Services is a powerful platform designed to simplify the deployment and scaling of applications and machine learning workloads. With Outpost Services, you can efficiently serve models and applications that automatically adapt to traffic demands, ensuring optimal performance and maintaining high availability. Let's explore the key features and benefits that Outpost Services offers.

Key features

  • Custom, Reproducible Environments: Define your environment, including dependencies, using a straightforward YAML file. This ensures consistency and reproducibility across deployments, making it easy to manage and deploy your applications.
  • Flexible Model Serving: Outpost Services supports a wide range of serving frameworks, including vLLM, TGI, TensorRT-LLM, Triton, and more. Easily scale your models across multiple regions and cloud providers, ensuring efficient resource utilization.
  • Secure HTTPS Endpoint: Protect your services with HTTPS endpoints, custom domain support, and free SSL certificates. This ensures secure data transmission and enhances the credibility of your applications.
  • Built-in Autoscaling: Outpost Services automatically handles load-balancing and dynamically adjusts service replicas based on traffic demands. This ensures optimal resource utilization and helps maintain high performance.
  • Scale-To-Zero for Cost Optimization: With the scale-to-zero feature, your services can scale down to zero replicas during periods of inactivity, optimizing your costs by paying only for what you use.
  • Seamless Automatic Deployment: Integrate with Outpost Hub to enable automatic deployment from the latest commits. Streamline your development process and ensure your applications are always up-to-date.
  • Multi-Cloud and Multi-Region Scaling: Manage large-scale deployments across multiple clouds and regions to maximize GPU availability and performance. Leverage diverse locations and hardware options to optimize costs and ensure high availability.
  • Privacy and Control: Deploy within your own cloud VPCs to maintain control over your data and deployments, allowing you to meet specific security and compliance requirements.

Under the Hood

Here's a closer look at how Outpost Services works:

  • Process Group Management: Each service is backed by a process group that efficiently manages autoscaling, monitoring, recovery, and load balancing for its replicas, ensuring high availability and performance.
  • HTTP/HTTPS Endpoint Routing: Services are assigned an HTTP/HTTPS endpoint, which seamlessly routes incoming traffic to the appropriate replicas, regardless of their location.
  • Global Distribution of Replicas: Replicas can be distributed across different regions and cloud providers, reducing costs by leveraging varying pricing and availability while enhancing overall service availability.

Getting started

To begin your journey with Outpost Services, we've provided a step-by-step guide to help you create your first service. In this guide, you'll learn how to deploy a machine learning model-serving application and configure it for autoscaling:

Create Your First Service

Guide to deploy and scale your applications and machine learning workloads on Outpost.

With Outpost Services, you can focus on building innovative applications and models, leveraging a robust, scalable, and flexible infrastructure. Let Outpost handle the complexities of deployment and scaling, ensuring your users always have a smooth and reliable experience.