Video search for local folders — built for real footage libraries
Point ClipCatalog at any folder on your PC or external drive. It builds a fast, searchable catalog — locally — so you can find clips by what’s in the shot (detected content), what was said (transcripts), who appears (optional face recognition), and classic production metadata. No cloud uploads. No manual labeling. Just search and find.
Try ClipCatalog free — up to 500 videos
No account required. Your footage stays on your computer.
Stop guessing which clip is GH010234.MP4. Search by what’s on screen, what was said, or who appears — ClipCatalog makes your footage searchable by meaning.
Processing happens once per clip. After that, searches feel instant because results come from your local library — not from re-scanning files every time.
Video files are huge and personal. ClipCatalog processes everything on your computer so your footage is never uploaded to a cloud service just to become searchable.
Three ways to search your footage
Most video search tools only work with file names and dates. ClipCatalog goes much deeper — it understands what’s on screen, what was said, and who appears, so you can search the way you actually remember things.
Search by what’s on screen
ClipCatalog’s AI watches every clip and automatically detects scenes, objects, and actions — cars, beaches, interviews, celebrations, you name it. Type what you’re looking for and matching clips surface instantly. No manual labeling required.
Whether you’re a vlogger digging through old shoots for a perfect sunset shot or a filmmaker scanning raw footage for an interview close-up, detected content replaces the guesswork of browsing folders.
Learn more about detected content →Search by what was said
Every word spoken in your footage becomes searchable. ClipCatalog transcribes audio locally using state-of-the-art speech recognition, so you can type a word and jump straight to the moment — perfect for pulling sound bites, interview quotes, or voiceover takes.
Think of it as Ctrl+F for every video you’ve ever recorded. Instead of scrubbing through hours of footage, you search for the word and ClipCatalog shows you exactly which clips contain it.
Learn more about transcript search →Search by who appears
Face recognition groups appearances automatically across your entire library. Need every clip featuring a guest, collaborator, or family member? Select their face and get every matching video — no scrubbing through timelines.
Great for vloggers working with recurring guests, wedding videographers with large events, or families organizing decades of home video.
Learn more about face recognition →How ClipCatalog video search works
ClipCatalog is designed for real-world libraries: project dumps, traveling SSDs, camera roll folders, and years of archive footage. It processes your clips locally and stores everything in a fast, searchable library — so searching feels instant later.
Point ClipCatalog at any folder. It processes every clip locally — generating thumbnails, detected content, speech transcripts, and optional face data.
Add one folder or many. ClipCatalog is built to handle big archives and external drives that get unplugged.
ClipCatalog generates thumbnails, extracts metadata, detects content on screen, builds transcripts, and (optionally) runs face recognition — all on your PC. GPU acceleration speeds things up.
Combine filters, preview results, and pull selects without rewatching hours. Save presets for your go-to “b-roll finder” searches.
More than keywords: advanced filters
When your library grows, “search” becomes a combination of content + context. ClipCatalog lets you layer AI-powered search with practical production metadata to narrow thousands of clips down to exactly what you need.
Technical filters
Filter by recording date, clip duration, resolution, frame rate, horizontal vs. vertical, audio/no-audio, camera name, and more. Narrow thousands of clips down to exactly what fits your project specs.


Smart footage type
ClipCatalog classifies every clip as dialog, voiceover, or scenic using smart heuristics. Instantly filter by footage type to find exactly the kind of shot you need — no manual labeling required.
Highlight Score
A smart heuristic ranks your clips by how engaging they are — so the most interesting footage floats to the top. Spend less time previewing and more time editing the good stuff.


Drone footage detection
Automatically identifies aerial shots across your library. Quickly limit results to drone or non-drone footage without manually labeling a single file.
Real searches creators use
Here are the kinds of queries that become possible once your library is indexed. Each one would be a painful manual hunt without searchable detected content and transcripts:
- beach
- Find b-roll that matches a memory
- birthday
- Find family moments quickly
- vertical
- Find Shorts/Reels-ready footage
- alex (face)
- Find every appearance of a person
- ALL · beach + sunset
- Detected content (All)
- ANY · dog + cat
- Detected content (Any)
- ALL · wedding + outdoor
- Detected content (All)
- ALL · launching + month
- Transcript words (All)
- ANY · alex + jamie (face)
- Faces (Any)
From video search to your editing timeline
Finding the right clip is only half the job. ClipCatalog is built for editing workflows so you can go from search results to your editing timeline without friction.
Drag & drop to your timeline
When you’ve found the clips you need, grab them from ClipCatalog and drop them right into your video editor. Multi-select is supported, so you can move entire batches at once. Drag & drop has been tested with CapCut, PowerDirector, Kdenlive, Premiere Pro, Vegas, and ShotCut. Many other editors will likely work fine too.
Or copy matches to any folder
Prefer to collect files first? Copy all matching clips to a location of your choice — great for handing off footage to collaborators or building project-specific folders.
Built-in video player
Preview clips right inside the app without switching windows. Quickly verify a shot before sending it to your editor.
Export transcripts
Download word-for-word captions as plain text or SRT subtitle files — ready for your editing timeline.
GPU-accelerated processing
Your GPU speeds up indexing via DirectML and Vulkan. Falls back to CPU automatically on any hardware. Learn more →
Built for serious footage libraries
Whether you have a few hundred clips or terabytes of raw footage across dozens of drives, ClipCatalog handles it. Your catalog stays searchable even when drives are unplugged.
Large library support
Import many terabytes of video. Lightning-fast thumbnail browsing with smart caching keeps everything responsive, even at massive scale.
External drives & traveling SSDs
Connect external drives, SD cards, or traveling SSDs — ClipCatalog remembers them. Search and plan edits even when the drive is disconnected. Learn more →
What to expect (and what it’s not)
ClipCatalog is a powerful tool, but it’s honest about what it can and can’t do. Here are a few things worth knowing before you get started.
ClipCatalog does the heavy lifting up front: thumbnails, detected content, transcripts, and optional faces are generated during indexing — not every time you search. After your library is processed, searches feel instant because results are already in your local library.
Detected content and transcripts are great for narrowing down clips, but accuracy can vary with motion blur, noisy audio, accents, and overlapping speech. The win is getting to “the right neighborhood” of clips quickly — then you pick the best take.
Your footage never uploads. The app downloads models on first run, and license checks happen periodically. After that, browsing and searching are local and fast. Learn about local-first privacy →
Build complex filter combinations once and save them as presets you can re-apply instantly — your “b-roll finder,” “interview puller,” or “drone-only” search, ready in one click.
Understanding video search for local files
Whether you call it video search, footage cataloging, or media asset management — the core problem is the same: you have hundreds or thousands of video files on your computer, and finding the right clip feels impossible.
Windows File Explorer can find video files by name, date, or size — but it can’t search by what’s in the video. If you named a file GH010234.MP4, good luck finding it months later. ClipCatalog adds the missing content layer: detected content, transcripts, and face data that make videos searchable by meaning, not just metadata.
Cloud-based video search tools require uploading your footage to someone else’s server. That means slow uploads, ongoing storage costs, and giving up control of your files. ClipCatalog processes everything on your machine — your footage never leaves your computer, and there are no recurring cloud fees.
Manual labeling means watching clips, writing descriptions, and maintaining spreadsheets or folder naming conventions. With automatic content detection, you point ClipCatalog at a folder and content labels appear during processing — no naming discipline required, no per-clip effort.
ClipCatalog supports 10 languages for both the UI and AI-generated content labels: English, German, Spanish, French, Portuguese, Japanese, Korean, Chinese, Russian, and Arabic. Your system language is detected automatically, and labels are translated so you search in your native language.
Frequently asked questions
No. ClipCatalog is local-first: your video files stay on your computer. Detected content, transcript generation, and optional face recognition run locally on your own hardware.
You can search by detected content (what's on screen), transcript words (what was said), and optional face recognition (who appears) — plus production metadata like date ranges, camera names, duration, resolution, frame rate, folders, and more.
Not yet. Transcript search is word-based (single spoken words), not exact phrases or in-order quotes. To narrow results, use All/Any matching (AND/OR) when combining multiple terms.
Yes. ClipCatalog is designed for footage on internal drives and external SSDs. Your catalog stays searchable even when a drive is unplugged.
Core indexing, browsing, and searching run locally. On first startup the app downloads AI models, and license validation needs internet from time to time.
It depends on your library size and your hardware. Indexing is a one-time processing step per clip (thumbnails, detected content, transcripts, optional faces). After that, searches feel fast because results are already in your local library.
ClipCatalog is built for editing workflows, including drag-and-drop into editing software. Drag & drop has been tested with CapCut, PowerDirector, Kdenlive, Premiere Pro, Vegas, and ShotCut. Many other editors will likely work fine too.
ClipCatalog runs on Windows 10/11. A capable GPU speeds up processing via DirectML (content detection) and Vulkan (transcription), but the app falls back to CPU automatically. You don’t need special hardware to get started.
Windows search finds files by name or basic metadata. ClipCatalog goes deeper — it indexes what’s visible on screen, what was said, and who appears, so you can search by content instead of file names.
Your library is stored in an encrypted SQLite database (SQLCipher) on your machine. Automatic hourly backups protect your index, and your database key can be escrowed for recovery.
Even more powerful together
Video search is the core experience, but the real advantage is combining it with other search dimensions in ClipCatalog to go from thousands of clips to exactly what you need.
Search by what’s on screen — scenes, objects, and actions, detected automatically.
Find clips by what was said — perfect for interviews, sound bites, and voiceover takes.
Find every appearance of a person across years of footage.
Search clips across archive drives — even when they’re unplugged.
Best for
- Family & travel archivists organizing years of footage.
- YouTubers & vloggers reusing b-roll and past shoots.
- Filmmakers & editors working with TB-scale projects.
Try it with one folder
The best way to see if ClipCatalog works for your footage: process a single project folder or shoot day, then try to retrieve 5–10 “I know I shot this somewhere” moments using detected content and transcript search.
Try ClipCatalog free — up to 500 videos
No account required. Your footage stays on your computer.