报告错误/功能要求

私有云转录

Name: STT.ai Private Cloud
Brand: STT.ai
Price: 299 USD
Availability: PreOrder

您的数据永远不会离开您的服务器。专为需要完全控制音频数据的组织提供的专用 GPU 转录服务。

Get Started 了解我们的安全性

工作原理

三个步骤即可启动运行。

1. 部署

我们在您首选的区域配置专用 GPU 服务器，或在您自己的硬件上部署我们的 Docker 镜像。设置时间不到24小时。

2. 转录

使用您熟悉的 STT.ai API 和网页界面。音频完全在您的专用服务器上处理，不会发送到共享基础设施。

3. 导出

转录文本保留在您的服务器上。可导出为 TXT、SRT、VTT、DOCX、JSON 或 PDF。通过 API 与您现有的系统集成。

选择您的部署方式

功能	共享云	私有云	自托管许可证
起始价格	$0 - $39/以单位	$499/以单位	$99/以单位
基础设施	共享 GPU	专用 GPU	您自己的 GPU
数据位置	我们的服务器	您选择的区域	您的场所
气隙部署支持
SLA
完全托管			您自行管理
无限分钟数

为受监管行业而建

当合规要求音频不得离开您的基础设施时。

医疗保健

符合 HIPAA 标准的患者录音、临床笔记和远程医疗会话转录。

法律

证词、法庭录音和特权通信留在您的律所内部。

政府

在气隙网络上转录机密或敏感简报。完全的数据主权。

金融

在本地处理财报电话会议、合规录音和交易大厅音频。

定价

私有云

$499/以单位

您自己的专用 GPU 服务器。音频永远不会离开您的基础设施。真正的端到端隐私。

专用 A100 GPU
隔离服务器——无共享基础设施
音频仅在您的硬件上处理
完整 API 访问 + SLA
无限分钟数

自托管许可证

$99/以单位

在您自己的硬件上运行 STT.ai。Docker 镜像，您的服务器，您做主。

Docker 镜像——可在任何 NVIDIA GPU 上运行
气隙部署支持——无需互联网
包含模型更新
完全控制您的数据
无限分钟数

准备好掌控您的转录基础设施了吗？

告诉我们您的需求。我们将帮助您选择合适的部署方案。

Get Started

常见问题

STT.ai Private Cloud and Self-Hosted transcription runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for STT.ai Private Cloud and Self-Hosted transcription the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

STT.ai Private Cloud and Self-Hosted transcription runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

STT.ai Private Cloud and Self-Hosted transcription can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most STT.ai Private Cloud and Self-Hosted transcription jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

STT.ai Private Cloud and Self-Hosted transcription accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to STT.ai Private Cloud and Self-Hosted transcription are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for STT.ai Private Cloud and Self-Hosted transcription workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

您的体验如何？

私有云转录

工作原理

1. 部署

2. 转录

3. 导出

选择您的部署方式

为受监管行业而建

医疗保健

法律

政府

金融

定价

私有云

自托管许可证

准备好掌控您的转录基础设施了吗？

常见问题

How does STT.ai Private Cloud and Self-Hosted transcription work on STT.ai?

Is STT.ai Private Cloud and Self-Hosted transcription free?

How accurate is STT.ai Private Cloud and Self-Hosted transcription?

What AI models can I use for STT.ai Private Cloud and Self-Hosted transcription?

Can I get subtitles from STT.ai Private Cloud and Self-Hosted transcription?

Does STT.ai Private Cloud and Self-Hosted transcription detect different speakers?

How long does STT.ai Private Cloud and Self-Hosted transcription take?

What input formats does STT.ai Private Cloud and Self-Hosted transcription support?

Is my audio private when I use STT.ai Private Cloud and Self-Hosted transcription?

Is there a STT.ai Private Cloud and Self-Hosted transcription API?

Can I edit a STT.ai Private Cloud and Self-Hosted transcription transcript after?

How do I share what STT.ai Private Cloud and Self-Hosted transcription produces?

What other platforms work beyond STT.ai Private Cloud and Self-Hosted transcription?