Vendor Data Ingestion, Processing & Standardization Engine
A robust infrastructure supporting both manual high-speed templates for standardized data and AI-driven processing for complex, unstructured vendor documents.
Admin uploads CSV following strict schema.
API verifies data types and constraints immediately.
Records enter Products_Staging with 1.0 confidence.
Bypasses AI processing for maximum throughput.
PDFs, Catalogs, and Excel files uploaded to Data Lake.
Azure Document Intelligence extracts tables and key-value pairs.
Claude/OpenAI analyzes structure to map fields to schema.
AI assigns 0.0-1.0 confidence score for human review.
Valid mappings are saved to automate future uploads.