Voice Interpretation
Real-time voice translation that speaks for you in meetings, 1:1 conversations, and live events.
Learn more →Live interpretation for meetings and 1:1 conversations, side-by-side caption translation for presentations, and meeting transcription with translation and AI summaries — all browser-based, no install.
Speak with someone, listen to a talk, project captions on a screen, or upload a file. Pick the mode that fits.
Switch between speakers in a 1:1 talk and let LingoFlow voice both sides. Voice output: Japanese, English, Chinese.
Live interpretation that reads back the translated speech as the speaker continues. Voice output: Japanese, English, Chinese.
Live captions and translation shown side-by-side, ideal for screen-sharing and projecting on a meeting room display. Recognized input: Japanese, English, Vietnamese, Chinese. Translated text: Japanese, English, Chinese.
Upload .m4a, MP3, WAV, MP4, MPEG, MPGA, or WebM and get a transcript, translation, summary, and Markdown. Input and output supported in Japanese, English, Vietnamese, Chinese.
What we actually do with audio and text — written from the implementation, not as marketing.
We do not use your audio, transcripts, translations, or summaries to train LingoFlow-owned AI models.
An audio-processing consent screen is shown before any session. Please confirm with the people in the conversation before you start.
You can delete history entries, recorded transcription jobs, and related recording assets from the app where implemented.
See Security · Data Management · Subprocessors for details.
From speech to reusable notes in a few steps.
Use the microphone for a live conversation, or upload a recording from your device.
LingoFlow creates the original transcript and translated text with language settings you control.
Generate a summary, reopen results from history, and export Markdown for follow-up work.
Built for repeated communication work, not one-off demos.
Follow discussions across Japanese, English, Chinese, and Vietnamese, then keep a searchable record.
Record on your phone, upload later, and turn the recording into structured text and summaries.
Convert long recordings into text that is easier to review, translate, and share with teammates.
Tools that help global teams communicate without friction.
Real-time voice translation that speaks for you in meetings, 1:1 conversations, and live events.
Learn more →Live captions and translation, side-by-side, for meetings and presentations.
Learn more →Real-time translation for in-person international meetings on a single smartphone. Reduce misunderstandings and align faster on decisions.
Learn more →Upload recordings and get transcripts, translations, summaries, and Markdown notes.
Learn more →The same conversation often calls for a different mode depending on who is in the room.
Use Voice Interpretation in Conversation Mode to switch between speakers. Each side hears the other's words spoken in their own language, so people can keep eye contact instead of reading a screen.
Run Realtime Interpretation for the spoken translation, and project Realtime Caption Translation on a separate screen — covering both listeners and readers in the same session.
Upload phone-recorded audio to Audio Transcription for transcript, translation, summary, and Markdown export in one pass. History entries can be deleted at any time.
Simple monthly plans. You're charged immediately at checkout and your subscription auto-renews monthly.
¥0/month
Up to 10 minute-equivalent credits/month (10 min-equiv/day)
7-day retention
¥980/month (tax included)
Up to 120 minute-equivalent credits/month (no daily limit)
30-day retention
Ticket top-up: Basic plan only
Quotas are tracked in minute-equivalent credits. Realtime caption transcription uses 1 credit per minute, while voice interpretation and voice-output translation use 2 credits per minute. So the Basic plan's 120 minute-equivalent credits can be spent, for example, as 120 minutes of caption translation, 60 minutes of voice interpretation, or any mix in between. See /pricing for full plan details.
Conversation Mode is two-way voice interpretation. Switch between speakers and the app reads each side's speech back in the other side's language. Use it for 1:1 talks and face-to-face meetings.
Realtime Interpretation is one-way voice interpretation. The app listens to a presenter or meeting and produces a translated voice in your target language while speech continues.
Realtime Caption Translation shows the original transcript and translation side-by-side as text — no voice output. It's intended for screen-sharing and projecting captions for an audience.
Voice output for interpretation: Japanese, English, Chinese.
Realtime caption translation: input Japanese, English, Vietnamese, Chinese / translated text Japanese, English, Chinese.
Audio transcription (upload): input and output in Japanese, English, Vietnamese, Chinese.
No. We do not use your audio, transcripts, translations, or summaries to train LingoFlow-owned AI models. See Security for details.
Yes. History entries, recorded transcription jobs, and related recording assets can be deleted from the app. See Data Management for the deletion scope.
Quotas are tracked in minute-equivalent credits. Realtime caption transcription uses 1 credit per minute, while voice interpretation and voice-output translation use 2 credits per minute. So 120 credits can become 120 minutes of caption translation, 60 minutes of voice interpretation, or any mix.
Yes. LingoFlow runs in a modern browser. You can also install it as a PWA for an app-like experience.
Audio transcription supports .m4a, .mp3, .wav, .mp4, .mpeg, .mpga, and .webm files. Free supports up to 10 minutes or 25MB per file; Basic supports up to 60 minutes or 100MB per file. Files larger than 25MB are split automatically.
Free to start. No credit card required.
10 minute-equivalent credits per month on the Free plan.