totonoeAI
Pricing

totonoeai

AI tools that tidy up your documents

Free Tools

  • PDF Converter
  • Word/Excel → Markdown
  • Image → Text (OCR)
  • Markdown Editor
  • Diagram Builder
  • Screenshots → PDF
  • PDF Editor
  • Image Editor
  • Data Converter
  • SQL Formatter
  • URL Shortener

AI Tools

  • AI Document Formatter
  • AI Meeting Notes
  • AI Table Extraction
  • AI Diagram Builder
  • Video & Audio Processor

Other

  • Pricing plans
  • Terms of Service
  • Privacy Policy
  • Commercial Disclosure

© 2026 totonoeai

Video & Audio Processor

Upload a video (mp4 / mov / webm / avi) or audio file (mp3 / wav / m4a / ogg) and get a full transcript, an AI summary or structured meeting minutes, plus screenshots (video only) all in one job. Long content runs without timing out.

Drag & drop a file

orclick to select

Accepted: .mp4, .mov, .webm, .avi, .mp3, .wav, .wav, .m4a, .ogg / up to 200MB

Output mode

Screenshot settings

AI Video Processor — Batch Extract Transcript, Summary, and Screenshots

Upload an MP4, MOV, WebM, or AVI video and AI transcribes the audio, generates a summary, and extracts scene-change screenshots as a ZIP — all at once. Ideal for turning Zoom / Teams recordings, YouTube videos, lecture recordings, and internal training videos into meeting notes or summaries.

Three outputs from one video

This tool automatically generates the following from a single video file:

1. **Full transcript**: Japanese audio transcribed with high accuracy by OpenAI Whisper 2. **Summary (Markdown)**: Claude Sonnet reads the transcript and extracts key points 3. **Screenshot ZIP**: screenshots captured on scene change or at fixed intervals

A one-hour meeting video is fully processed in 5–10 minutes.

Use cases

· **Meeting minutes from Zoom / Teams recordings**: record → upload → structured minutes in 5 minutes · **Lecture and seminar notes**: transcript + chapter screenshots from video lectures become complete notes · **Article from a YouTube video**: transcribe your own video → AI-format into an article · **Training material review**: watch a training video on the Free plan → minutes + key-scene archive

Technology stack

· **Audio extraction**: ffmpeg (LGPL license, commercially OK) cascades video to mp3/m4a/wav · **Transcription**: OpenAI Whisper API (`whisper-1`, language=ja for Japanese) · **Summarization**: Claude Sonnet 4.5 · **Screenshots**: ffmpeg scene change detection or fixed-interval captures

Privacy

Uploaded videos are stored temporarily in AWS S3 (Tokyo region) and automatically deleted within one hour. Whisper API and Anthropic API transfers are TLS-encrypted + zero-data-retention contracts. Safe for confidential internal meeting recordings.

File size: Free 20 MB / Light 100 MB / Premium 200 MB. Video processing runs as an async job — if you close the browser, processing continues.

Frequently Asked Questions

Can it process a 1-hour video?

Yes. Internal auto-chunking handles Whisper's 25 MB limit. The video file itself must be within plan limits: Free 20 MB / Light 100 MB / Premium 200 MB.

What video formats are supported?

MP4, MOV, WebM, AVI, and MKV. Videos taken with a smartphone (.mov / .mp4) can be uploaded directly.

Can it distinguish between speakers?

Speaker diarization is not currently supported — AI infers participants from the transcript. If Zoom/Teams speaker-labeled subtitles are available, pasting those separately improves accuracy.

How are screenshot timings determined?

"Scene change" mode auto-detects significant scene transitions. "Fixed interval" mode captures at a specified number of seconds (default 10s). Scene change is better for meetings; interval is better for lectures.

Can I use video processing on the Free plan?

Yes, but 3 reward ad views are required before processing (Light: 1 view; Premium: no ads). This offsets the higher Whisper API costs.

Related Tools

AI Meeting Notes →

Create meeting notes from audio or text rather than video.

Screenshots → PDF →

Combine the output screenshot ZIP into a single PDF.

AI Document Formatter →

Restructure the summary into a report or proposal format.