AI Platforms Overview

Staque IO supports multiple AI platforms for model deployment and inference. Each platform offers unique advantages for different use cases.

Supported Platforms

AWS Bedrock

Fully managed foundation models with pay-per-use pricing

✓ No infrastructure management

✓ Multiple model providers

✓ Token-based pricing

AWS SageMaker

Dedicated infrastructure for custom model deployments

✓ Custom model support

✓ Predictable performance

✓ Full control

NVIDIA NIM

High-performance hosted inference microservices

✓ Optimized for speed

✓ Cost-effective

✓ Easy integration

Platform Comparison

Feature	AWS Bedrock	AWS SageMaker	NVIDIA NIM
Pricing Model	Pay-per-token	Hourly instance cost	Pay-per-use
Setup Time	Instant	5-10 minutes	Instant
Infrastructure	Fully managed	Dedicated instances	Fully managed
Custom Models	Limited (fine-tuning)	Full support	Pre-selected models
Best For	Quick prototyping, variable workloads	Production, custom models	High-performance inference
Idle Costs	$0 (pay-per-use)	Hourly charge applies	$0 (pay-per-use)

Choosing a Platform

Use AWS Bedrock When:

You want instant access to foundation models without infrastructure setup
Your workload is variable or unpredictable
You prefer pay-per-token pricing with no idle costs
You need access to multiple model providers (Anthropic, Amazon, Meta, etc.)
You want to experiment with different models quickly

Use AWS SageMaker When:

You have custom models or fine-tuned versions
You need dedicated, predictable infrastructure
Your workload is consistent and high-volume
You require full control over the deployment environment
You need to deploy models from HuggingFace or other sources

Use NVIDIA NIM When:

You need high-performance inference with GPU optimization
You want cost-effective access to popular open-source models
You prefer managed infrastructure with fast response times
You're building latency-sensitive applications
You want to leverage NVIDIA's inference optimizations

💡 Pro Tip: Multi-Platform Strategy

Many organizations use multiple platforms simultaneously:

Development: Bedrock for rapid prototyping
Production: SageMaker for stable workloads
Specialized: NVIDIA NIM for latency-critical endpoints

Getting Started

Click on any platform above to view detailed documentation, including:

Platform overview and capabilities
Available models and configurations
Deployment instructions
Pricing details
API usage examples
Best practices and troubleshooting