Supported File Formats
Picsha AI's ingestion pipeline is designed to handle a vast array of media types. From raw camera files to heavy design assets, our system automatically generates web-optimized proxies, extracts metadata, and prepares files for seamless delivery.
Ingest Formats
Standard Images
These formats are optimized and converted to next-gen formats (like WebP) for high-speed delivery.
- JPEG / JPG
- PNG
- WebP
- HEIC / HEIF
Complex & RAW Images
Heavy design files and raw camera outputs are automatically processed by our backend cluster (via ImageMagick and LibRaw) into standard web-safe proxy JPEGs, ensuring they can be viewed and analyzed by AI without compromising the original file.
- PSD (Adobe Photoshop)
- AI (Adobe Illustrator)
- EPS / PostScript
- RAW Camera Formats:
.cr2(Canon),.nef(Nikon),.arw(Sony),.dng(Digital Negative) - TIFF / TIF
- BMP
- SVG (Vector Graphics)
Documents
Text is automatically extracted for semantic search, and multi-page documents generate visual poster frames.
- PDF: Extracts text and creates high-quality preview posters.
- Word / DOCX: Extracts text and automatically converts to web-friendly PDFs.
- PowerPoint / PPTX: Extracts text from slides and automatically converts to web-friendly PDFs with preview posters.
- Text Files:
.txtand other plain-text formats.
Video & Audio
Media files undergo specialized processing to enable adaptive streaming and AI transcription.
- Video Formats: (e.g., MP4, MOV, OGG/OGV) Processed via AWS MediaConvert for Adaptive HLS streaming and automatic thumbnail generation. Transcribed by AI if speech is present.
- Audio Formats: (e.g., MP3, WAV, OGG) Transcribed automatically using AI for deep semantic search.
Output Formats
When requesting assets via the Delivery API, Picsha dynamically serves the best format based on the original content type and user request.
Image Outputs
- Original Format: The unmodified uploaded file (e.g., the original RAW or PSD file).
- WebP (
optimized.webp): Optimized, compressed next-gen format for rapid web delivery. - JPEG (
proxy.jpgorposter.jpg): Web-safe proxies for complex/RAW images and static preview frames for documents and videos.
Document Outputs
- Original Format: The unmodified uploaded document.
- PDF (
web.pdf): Dynamically converted PDF versions of non-PDF documents (like Word or PowerPoint) to ensure native web browser previewing.
Video Outputs
- Original Format: Directly serve the source video file.
- HLS Stream (
playlist.m3u8): Adaptive bitrate streaming playlist generated for seamless playback across different network conditions. - JPEG Poster: Static preview frame extracted from the video timeline.
Audio Outputs
- Original Format: The unmodified audio file.
- Audio Extract: Extracted audio tracks used internally for transcription.