Platform for AI/ML and Data teams

Train, fine-tune, and deploy generative AI models with managed infrastructure, tools, and workflows.

Hero

Choose from leading open-source models

Llama-3 8B Instruct

Llama 3 is an auto-regressive language model that uses an optimized transformer architecture.

Mixtral-8x7B-Instruct

The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts.

DBRX Instruct

DBRX Instruct is a mixture-of-experts (MoE) large language model trained from scratch by Databricks.

Inference Endpoints

Fast Inference with an easy-to-use API that scales as you grow

Deploy your LLM

4x Faster

inferencing compared to the other inference APIs.

11x Cheaper

model serving on any cloud or region with high availability.

Highly Scalable

inference to meet your API request volume.

Train/Fine-Tune

Train, Evaluate and Fine-Tune state-of-the-art ML models

Train

Train your own LLMs and other generative AI models on a robust and flexible infrastructure.

LLM (Large Language Model for Text Generation)

LDM (Latent Diffusion Model for Image Generation)

ECD (Encoder-Combiner-Decoder Neural Network Model)

GBM (Gradient Boosting Machine Tree-Based Model)

Fine-Tune

Improve your task accuracy by fine-tuning leading open-source models with your data.

LLM Fine-tuning
DreamBooth
Tabular
Text Classification
Seq2Seq
Image Classification
Image Classification

Summarization

Workflow Automation

Copilots

Analyzing Structured Data

Reinforcement Learning

Anomaly & Fraud Detection

Audio Classification

Medical Diagnosis Support

Personal Assistant

Medical Imaging

Customer Sentiment Analysis

Bot Detection

Tabular Classification

Object Detection

Image Classification

Automatic Speech Recognition

Depth Estimation

Image Segmentation

Translation

Video Classification

Tabular Regression

Summarization

Workflow Automation

Copilots

Analyzing Structured Data

Reinforcement Learning

Anomaly & Fraud Detection

Audio Classification

Medical Diagnosis Support

Personal Assistant

Medical Imaging

Customer Sentiment Analysis

Bot Detection

Tabular Classification

Object Detection

Image Classification

Automatic Speech Recognition

Depth Estimation

Image Segmentation

Translation

Video Classification

Tabular Regression

Summarization

Workflow Automation

Copilots

Analyzing Structured Data

Reinforcement Learning

Anomaly & Fraud Detection

Audio Classification

Medical Diagnosis Support

Personal Assistant

Medical Imaging

Customer Sentiment Analysis

Bot Detection

Tabular Classification

Object Detection

Image Classification

Automatic Speech Recognition

Depth Estimation

Image Segmentation

Translation

Video Classification

Tabular Regression

Summarization

Workflow Automation

Copilots

Analyzing Structured Data

Reinforcement Learning

Anomaly & Fraud Detection

Audio Classification

Medical Diagnosis Support

Personal Assistant

Medical Imaging

Customer Sentiment Analysis

Bot Detection

Tabular Classification

Object Detection

Image Classification

Automatic Speech Recognition

Depth Estimation

Image Segmentation

Translation

Video Classification

Tabular Regression

Job Queues & Batch Processing

Everything you need to build an AI product

Environments

Connected

30 cores • A100 • 200 GiB

Offline

4 cores • T4 • 32Gi

app.py

Define container images and hardware specs in code, and serve any function as HTTPS endpoints.

Storage

Name

Type

App Name

model-weight

Persistent

Llama70B

config

Shared

fine-tune

config

Shared

fine-tune

config

Shared

fine-tune

app.py

Easily provision network volumes, key-value stores, and queues using powerful cloud primitives that feel like regular Python.

Scheduling

Replica #1

11 tasks

Replica #2

5 tasks

app.py

Transform functions into cron jobs with just one line of code. Execute compute-intensive tasks without blocking your backend.

Hub

Host your

Models

for simple storage, discovery and collaboration

Models. A place for sharing and discovering trained machine learning models ready-to-deploy and fine-tune.

sdxl-simpsons-charcters

Public
Text-to-Image
Diffusers
ONNX
StableDiffusionXLPipeline
Inference Endpoints
lora
I

niva

Update config.json (#101)

Scheduler

added support for abort and chat completion types

text_encoder

added support for abort and chat completion types

Datasets. Explore, analyze and share datasets for Audio, Computer Vision, and Natural Language Processing (NLP) tasks.

wikipedia

Public

Updated 3 days ago

34.4k

640

bloomspeech

Public

Updated 5 days ago

3.3k

111

openfeedback

Public

Updated 11 days ago

5k

11k

music-genbox

Public

Updated 7 days ago

500

551

ML Portfolio. Create a personalized ML profile, share your work globally, and collaborate with teams using Git.

500

36

Katara Murphy

kataramurphy

21k

followers

.

11k

following

https://kataramurphy.com

https://github.com/kataramurphy

Explore

Models
04

Host machine learning models.

Datasets
04

Host datasets for training.

Inference Endpoint
02

Deploy models in production.

Fine-Tuning
02

Train, fine-tune, or process data.

AutoML
02

Train, fine-tune, or process data.

Enterprise Ready

Deploy on your own infrastructure

On Premise

A100-80GB

outpost-cluster.onoutpost.com

AWS

123456789

Azure

123456789

GCP

123456789

Connect

Connect your AWS, GCP, or Azure account to automatically provision the resources required to manage your infrastructure.

Deploy

Deploy ML models to your cloud with the Outpost SDK, ensuring data privacy and access to an OpenAI-compatible API.

Scale

Outpost dynamically scales across multiple cloud providers based on traffic, ensuring fast cold boots.

Deploy to production in minutes.

Speak to an expert for your Enterprise needs.