Convert MOV to M4A Online
Convert MOV video from iPhone or QuickTime to M4A audio. Free, in your browser.
.mov · up to 100 MB
What you can do
MOV to M4A: from Apple video to Apple audio without leaving the ecosystem
Native Apple ecosystem
MOV and M4A are Apple formats. Conversion preserves metadata and is fully compatible with iTunes and Music.
Siri Shortcuts ready
M4A is the native iOS audio format for Shortcuts automations and transcription with SFSpeechRecognizer.
100% private
Your video never leaves your device. FFmpeg.wasm processes everything in local WebAssembly.
Apple Podcasts direct
Get audio ready to upload to Apple Podcasts Connect, Buzzsprout, or any podcast distributor.
How it works
Three steps, no hassle
Upload your MOV file
Drag or select your .mov video from iPhone, QuickTime, or Apple camera. Up to 500 MB, no signup.
M4A audio extraction
FFmpeg.wasm analyzes the QuickTime container. Extracts the audio track — stream copy if already AAC, or transcode to AAC if PCM or ALAC.
Download your M4A file
Audio ready for Apple Music, iTunes, Apple Podcasts distribution, Siri Shortcuts automations, or iPhone ringtone (.m4r).
FAQ
Got questions?
Apple uses the QuickTime container (.mov) for historical reasons and ecosystem compatibility. The QuickTime Movie format was developed by Apple in 1991 and is the native container for all macOS and iOS video APIs — AVFoundation, Core Media, Photos Framework. Although MOV and MP4 are technically very similar (both derive from MPEG-4 ISOBMFF), MOV allows Apple proprietary metadata extensions like photo geolocation, video stabilization data, slow-motion frame rate metadata in the cmov atom, and the Display P3 color profile on iPhones since the iPhone 7 (2016). These additional metadata are why Apple prefers MOV as the native capture container.
In the vast majority of cases, yes. Videos recorded with the iPhone camera app, FaceTime, Clips.app, and most third-party apps using AVFoundation capture audio in AAC-LC at 44.1 kHz or 48 kHz with bitrates between 64 and 320 kbps depending on the recording mode. However, some specific workflows generate different audio: iOS screen recordings may use high-resolution PCM; MOV files exported from Final Cut Pro or Logic Pro can contain ALAC (Apple Lossless Audio Codec) or 24-bit PCM/LPCM; and very old MOV files (pre-2010) may contain IMA4 (IMA ADPCM) audio.
Yes, and it's one of the most powerful workflows in the Apple ecosystem. Siri Shortcuts (iOS 12 and later) includes 'Encode Media' and other actions that can process M4A. For production audio workflows — automating voice memo conversion, generating audio notes for Obsidian or Notion, or sending audio clips via AirDrop — M4A is the recommended format because it's native to the OS and doesn't require additional conversion to play, analyze with SFSpeechRecognizer, or attach in Mail.app.
Both are based on ISOBMFF, but differ in the ftyp atom: MOV uses 'qt ' (QuickTime) as the major brand, while M4A uses 'M4A ' with compatible brands 'mp42' and 'isom'. This makes some players and DAWs that only handle standard MPEG-4 unable to open MOV correctly, while M4A is universally accepted. MOV can also contain Apple proprietary atoms ('wide', 'skip', 'free' from classic QuickTime) that don't exist in MPEG-4 standard. Converting to M4A removes these proprietary atoms, producing a clean file compatible with universal MPEG-4 standard.
iPhone Voice Memos (Voice Memos app) are saved directly as .m4a files — they don't need conversion. However, if you have voice recordings exported as video (for example, shared as attachments that some systems convert to MOV), you can use this tool to extract the audio. The most common case where MOV to M4A is useful for voice notes is when you record a FaceTime call or Zoom meeting on iOS, which generates MOV files, and want to archive only the audio in your library.
Yes, provided the audio meets minimum technical requirements. DistroKid, TuneCore, CD Baby, and Amuse accept M4A/AAC as an alternative delivery format to WAV for independent artists. Requirements are: AAC-LC (not HE-AAC), minimum 256 kbps for Apple Music, stereo, 44.1 kHz sample rate. If the original MOV video was recorded with a recent iPhone (iPhone 12 and later record audio at AAC 256 kbps stereo), the extracted M4A meets the requirements exactly without any additional processing.
Convert MOV to M4A: iPhone and QuickTime audio for iTunes, Podcasts, and Siri Shortcuts
Converting MOV to M4A is the central operation in the Apple ecosystem audio workflow, since the iPhone — the world's most widely used video recording device in 2025 — generates MOV files by default across all its recording modes: Camera.app in standard video, Cinematic Mode (iPhone 13 and later), screen recording from Control Center, and FaceTime calls recorded with ReplayKit. The QuickTime Movie container (.mov) was developed by Apple in 1991 as the native format for QuickTime Player and became the foundation upon which the ISO consortium built the MPEG-4 ISOBMFF (ISO/IEC 14496-12) in 2001: Apple directly contributed the QuickTime Box atom architecture to the international standard. This shared heritage makes MOV and M4A technically very similar, but with important differences in Apple proprietary metadata that third-party players and DAWs don't always interpret correctly. M4A, as a standard MPEG-4 container with AAC audio, is universally compatible with all players, streaming platforms, and editing tools, while MOV can have compatibility issues in non-Apple environments. Extracting M4A audio from an iPhone MOV allows separating audio content — which could be an interview, musical rehearsal, or field-recorded podcast — from Apple's proprietary video container for distribution, editing, and archiving on ecosystem-agnostic platforms.
In the Siri Shortcuts automation space, M4A holds a privileged position as the native iOS audio format. The Shortcuts architecture in iOS 16 and later includes actions that operate directly on M4A files: 'Encode Media' can reduce bitrate for sharing, 'Get File Detail' reads udta atom metadata, and SFSpeechRecognizer — Apple's speech recognition engine available via API since iOS 10 — accepts M4A as direct input for offline transcription without an internet connection. A practical common workflow: record an important video call in MOV from iPhone, convert to M4A here, import to Shortcuts where SFSpeechRecognizer transcribes the audio, and the resulting text is automatically saved to Notes or Notion. This local transcription pipeline with M4A has privacy advantages over sending audio to external APIs like cloud-based Whisper, especially for confidential meetings. For podcast distribution, creators who record field content with the iPhone get an M4A directly compatible with Apple Podcasts Connect, Buzzsprout, Transistor, and Podbean, without needing additional software in the workflow.
Convertir.ai runs MOV to M4A conversion entirely in the browser using FFmpeg.wasm with support for all Apple ecosystem audio codecs. The process begins with analysis of the MOV ftyp atom to confirm the 'qt ' (QuickTime) brand or 'isom'/'mp42' in cases of MOV with dual MPEG-4 compatibility. FFmpeg parses the moov atom structure to locate all trak atoms and their hdlr descriptors: tracks with hdlr='soun' are candidates for extraction. The stsd atom within the audio track reveals the codec: 'mp4a' indicates AAC (direct stream copy), 'alac' indicates Apple Lossless Audio Codec (requires decoding and re-encoding to AAC), 'lpcm' or 'sowt' indicate uncompressed PCM (requires encoding to AAC), and 'ima4' indicates IMA ADPCM (very old MOV files, requires decoding). For the most common case — iPhone MOV with AAC audio — extraction proceeds by stream copy in milliseconds regardless of the video file size. The resulting M4A moov atom includes the ftyp with major brand 'M4A ', duration and sample rate metadata from the mvhd atom, and user metadata from the udta atom transferred from the original MOV, preserving information like the recording date and device model for archiving. The absence of server communication guarantees absolute privacy for meeting recordings, interviews, and personal content.