Found Description
Role Overview:
We are looking for an AI Developer who can build intelligent document processing pipelines — primarily focused on extracting structured and unstructured data from PDFs, with principles that extend to image-based inputs. You will design and ship OCR-powered solutions that turn raw documents (contracts, forms, reports) into clean, queryable data, and integrate LLM reasoning layers on top using OpenAI and Anthropic APIs.
This is a hands-on engineering role. You will own the full pipeline: ingestion, OCR, parsing, prompt engineering, and API delivery via FastAPI.
What You'll Do:
Document Intelligence & OCR:
- Design and build end-to-end PDF and document extraction pipelines (flat text and structured output).
- Select and implement the right OCR strategy per document type — native PDF text layer, layout-aware parsing, or image-based OCR.
- Parse complex layouts: multi-co...