バグ/機能要求を報告

どんなポッドキャストでも転写

ポッドキャストエピソードリンクまたはオーディオURLを貼り付けると、数秒で正確なAI転写を得る。スピーカーラベル、タイムスタンプ、SRT、TXT、DOCXにエクスポート。

公開されているオーディオとビデオで動作します。DRM 保護されたコンテンツはサポートされていません。

アップグレード

プライベート・トランスクリプト

転写付きチャット

プロでロック解除 →

ファイルをここにドラッグまたはクリックしてブラウズ

MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — 最大2GB

複数のファイルを一括アップロードプロと一緒に

アップグレード

プライベート・トランスクリプト

転写付きチャット

プロでロック解除 →

アップグレード

リアルタイムの音声からテキストに変換。AI は話すときに自動的に訂正します。長い話をすると正確さが向上します。

まずマイクをテストしてください

10分フリー/日 600分無料クレジットカードなし暗号化

無料登録 →

Apple Podcasts、Spotify、YouTube、直接MP3、RSSフィードからのエピソードリンクと共に動作する。合計1,300以上のプラットフォーム。

公開されているオーディオと動作します。DRM 保護されたコンテンツはサポートされていません。

1. URL を貼り付け

ポッドキャストエピソードのリンクまたは直接のオーディオ/RSS URLをコピーして上に貼り付けます。

２．人工知能が転写する

音声を取得し，話者検出と単語レベルタイムスタンプを用いて転写する。

3. 読み込みとエクスポート

転写を検索し、スピーカーを編集し、SRT、VTT、TXT、DOCX、JSONにエクスポートします。

ポッドキャスト転写機能

話者検出

人工知能はホストとゲストを自動的に識別し、読者が会話を追うために各スピーカーにラベルを付けます。

SEOに優れた翻訳

発表された抄録は検索エンジンがポッドキャストのコンテンツをインデックス化し、より多くの有機的なトラフィックをあなたの番組に駆り立てる。

メモ生成を表示

人工知能を用いてエピソードの要約，キーポイント，タイムスタンプを生成する。

サイトに埋め込む

検索可能なインタラクティブなトランスクリプトを、コードの一行で、ポッドキャストのウェブサイトに直接埋め込む。

なぜ、ポッドキャストを作成するのか?

検索エンジン最適化

Googleは音声ではなくテキストをインデックス化する

アクセシビリティ

聴覚障害者と高度聴覚障害者の聴衆に届く

再利用

ブログ・ポスト、ソーシャル・クリップ

検索可能

瞬間を即座に見つける

エンガメント

読者はサイトに長く滞在する

AI 総括字幕生成メモ生成器を表示

よくある質問

podcast transcription runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for podcast transcription the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

podcast transcription runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

podcast transcription can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most podcast transcription jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

podcast transcription accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to podcast transcription are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for podcast transcription workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

どんなポッドキャストでも転写

1. URL を貼り付け

２．人工知能が転写する

3. 読み込みとエクスポート

ポッドキャスト転写機能

話者検出

SEOに優れた翻訳

メモ生成を表示

サイトに埋め込む

なぜ、ポッドキャストを作成するのか?

よくある質問

How does podcast transcription work on STT.ai?

Is podcast transcription free?

How accurate is podcast transcription?

What AI models can I use for podcast transcription?

Can I get subtitles from podcast transcription?

Does podcast transcription detect different speakers?

How long does podcast transcription take?

What input formats does podcast transcription support?

Is my audio private when I use podcast transcription?

Is there a podcast transcription API?

Can I edit a podcast transcription transcript after?

How do I share what podcast transcription produces?

What other platforms work beyond podcast transcription?