Extract and visualize document layout as markdown and JSON
Generate speaker‑labeled transcripts from audio files