Security & Privacy

Exactly what happens to your audio and transcripts at every step. No marketing fluff — just how it actually works.

Three Levels of Privacy

Standard

Every user, every plan — including free
  • HTTPS (TLS 1.3) for all data in transit
  • Audio deleted immediately after processing
  • Transcripts stored in our database
  • We can read stored transcripts
  • Data never sold or used for training
  • Delete your data anytime

Private Transcript

Pro+ Available on Pro and Business plans
  • Everything in Standard, plus:
  • Transcript encrypted in your browser (AES-256-GCM) before saving
  • We store only encrypted data — we cannot read it
  • Key derived from your password, never sent to us
  • ⚠ Audio is still processed on our servers during transcription

Private Cloud / Self-Hosted

Full isolation — from $99/mo
  • Audio never leaves your infrastructure
  • Transcription runs on your GPU
  • No data sent to STT.ai servers
  • Air-gapped support available
  • True end-to-end privacy

What Actually Happens to Your Data

A transparent, step-by-step breakdown of how your audio and transcript are handled.

Standard (all users)
1
You upload audio or record live
Your file is sent over HTTPS (TLS 1.3) to our GPU server for transcription.
2
Audio is processed in memory
Our AI models transcribe your audio on the GPU. The audio is held in memory during processing — never written to disk — and deleted from memory immediately after.
3
Transcript is stored in our database
The text transcript, timestamps, and speaker labels are saved so you can access them later. We can read this data (this is how search, AI summaries, and sharing work).
4
You can delete everything anytime
Delete individual transcripts or your entire account from Privacy Settings. Deletion is permanent and immediate.
With Private Transcript enabled

Steps 1-2 are the same — your audio must be processed on our servers to generate the transcript. The difference is what happens next:

3
Transcript is encrypted in your browser before saving
After transcription, the result is returned to your browser. Your browser encrypts it with AES-256-GCM using a key derived from your password (PBKDF2, 100K iterations). The encrypted blob is then sent to our servers for storage. We never see or store the encryption key.
4
We store only encrypted data
Our database contains only the encrypted blob. We cannot decrypt it. If our database were breached, your transcripts would be unreadable.
Important: Private transcript protects the stored transcript. During the transcription process itself, your audio is processed on our servers in order to generate the text. If your threat model requires that audio never touches third-party servers, consider Private Cloud or Self-Hosted.

What We Can and Can't See

We CANNOT see (with Private Transcript)
  • Your stored transcript text
  • Speaker names or labels (stored)
  • Timestamps or word-level data (stored)
  • Your encryption key or password
We CAN see (even with Private Transcript)
  • Your audio during processing (deleted after)
  • File name, size, duration (metadata)
  • Language detected, model used
  • Timestamp of transcription
  • Your account info and billing

Technical Details

Encryption algorithmAES-256-GCM (authenticated encryption)
Key derivationPBKDF2 with SHA-256, 100,000 iterations
IV (nonce)Random 12 bytes per encryption (never reused)
Key storageNever stored — derived from password on each session
Transport encryptionTLS 1.3 (HTTPS) + HSTS (1 year, preload)
Audio retentionProcessed in memory, never written to disk, deleted immediately
ImplementationWeb Crypto API (browser-native, no external libraries)
Source codegithub.com/sttaigit/stt-encryption (MIT license)

Private Transcript Trade-offs

Private transcript is opt-in because encrypting the stored transcript limits some features:

Works with encryption
  • Viewing your transcripts
  • Exporting (TXT, SRT, VTT, etc.)
  • Downloading
  • Editing (decrypted in browser)
Not available with encryption
  • Server-side search across transcripts
  • AI summaries and chat (server can't read data)
  • Public sharing via link
  • Team workspace collaboration

Need Audio to Never Leave Your Servers?

Private transcript protects the transcript at rest, but audio still passes through our GPU during processing. If your compliance or security requirements demand that audio never touches third-party infrastructure, these are your options:

Private Cloud

$499/mo

Dedicated GPU server managed by us. Your audio never leaves your isolated environment.

  • Dedicated A100 GPU
  • Isolated — no shared infrastructure
  • Audio processed on your hardware only
  • Full API access + SLA
Learn More

Self-Hosted

$99/mo

Docker image. Your servers. Your GPU. Nothing leaves your network.

  • Docker — runs on any NVIDIA GPU
  • Air-gapped support — no internet required
  • Model updates included
  • Full control, full privacy
Learn More

Our Commitments (All Users, All Plans)

  • Audio files are never stored permanently. Processed in GPU memory, deleted immediately after transcription.
  • Your data is never used for AI training unless you explicitly opt in via Voice Lab.
  • We don't sell your data. Ever. To anyone.
  • All traffic encrypted in transit via TLS 1.3 with HSTS.
  • Delete your data anytime from Privacy Settings or by deleting your account.
  • Encryption code is open-sourceaudit it yourself (MIT license).

Open-Source Encryption

Our encryption library is fully open-source under the MIT license. Don't trust us — verify the code. No trust required, just math.

View on GitHub | View Source

Ready to transcribe securely?

Upload your first file free. Private transcripts available on Pro and Business plans.

Start Transcribing

Nā nīnau i nīnau pinepine ʻia

E hoʻouka i kāu faila leo a i ʻole wikiō i STT.ai. E koho i kāu mau koho a me nā koho AI e makemake ai, a laila kaomi i ka Transcribe. E hoʻomākaukau ʻia kāu transcript i nā minuke. Hoʻouna i ka TXT, SRT, VTT, DOCX, JSON, a i ʻole PDF.

STT.ai hāʻawi 600 minuke manuahi i kēlā me kēia mahina no nā mea hoʻohana āpau. ʻAʻohe kau inoa e pono ai no kāu hoʻololi mua. Nā papa hana i uku ʻia me nā minuke a me nā hiʻohiʻona hou aku e hoʻomaka ana ma $ 5 / mahina.

ʻO ka pololei e pili ana i ka ʻano AI āu e koho ai a me ka maikaʻi o ka leo. ʻO kā mākou mau ʻano maikaʻi loa e loaʻa ai kahi 5-7% Word Error Rate ma nā mea hoʻohālikelike, ʻo ia hoʻi ka pololei o 93-95%.

STT.ai hāʻawi 10+ mauʻano e like me Whisper Large V3, NVIDIA Canary, a me nā mea hou aku. Hiki iāʻoe ke hoʻohālikelike i nā hopena mai nāʻano likeʻole ma ka faila like.

ʻAe. Ma hope o ka hoʻololi, e hoʻouna i kāu hoʻololi i nā faila SRT a i ʻole VTT subtitle. Hoʻohana kēia me YouTube, Vimeo, a me nā papa wikiō nui āpau.

ʻAe. STT.ai hoʻomaopopo a hoʻopaʻa inoa i nā mea haʻi ʻōlelo like ʻole e hoʻohana ana i ka diarization speaker AI.

ʻO ka hapa nui o nā faila i hoʻololi ʻia i lalo o 5 mau minuke. ʻO ka faila leo 1-hour e lawe i nā minuke 2-3 me kā mākou mau ʻano wikiwiki.

STT.ai kākoʻo 20 + leo a me nāʻano wikiō e like me MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, a me AVI. Export e like me TXT, SRT, VTT, DOCX, JSON, a i 'ole PDF.

ʻAe. Hoʻokō ʻia nā faila leo a hoʻopau ʻia ma hope o ka hoʻololi ʻana. ʻAʻole hoʻohana ʻia kāu ʻike no ka aʻo ʻana. ʻAʻole hoʻohana ʻia ka encryption ma ka ʻaoʻao Client ma nā papa hana āpau — hoʻouna ʻia nā encryption i hoʻouna ʻia me kahi kī āu e loaʻa ai wale nō. I ka wā o ka hoʻokō ʻana, hoʻomalu ka mea lawelawe i kāu leo i ka huaʻōlelo kūlike. E aʻo e pili ana i kā mākou palekana.

ʻAe. STT.ai hāʻawi i kahi REST API me Python a me Node.js SDKs. Loaʻa i ka pāʻani manuahi nā minuke 100 / mahina.

ʻAe. STT.ai e komo pū ana i kahi mea hoʻoponopono transcript i hoʻokomo ʻia e hiki ai iā ʻoe ke hoʻoponopono i nā hewa, hoʻololi i nā mea haʻi ʻōlelo, a hoʻoponopono i nā manawa.

E loaʻa i kēlā me kēia transcript kahi loulou hoʻokaʻawale ʻokoʻa. Hoʻouna i DOCX a i ʻole PDF no ka leka uila. Hāʻawi nā papa hana Pro i nā loulou i pale ʻia e ka ʻōlelo huna a me nā loulou kūwaho.

STT.ai kākoʻo 1,300 + pālākiō e like me YouTube, Vimeo, TikTok, SoundCloud, a me nā mea hou aku. URL transcription hana wale me ka lehulehu i loaʻa leo a me ka wikiō. DRM-hoʻomālamalama i nā mea i hoʻomālamalama ʻia (e like me Spotify premium episodes, Netflix, Disney +, etc.) hiki ke hoʻomālamalama ʻia. No ka DRM i nā mea i hoʻomālamalama ʻia, e hoʻouka i ka faila i ka ʻāpana a hoʻouka iā ia i ka ʻāpana.