AdviceOS API

Document Processing Pipeline Documentation

Orchestrator

The batch-workflow coordinates all document processing stages.

batch-workflow

Workflow

Orchestrates the full document processing pipeline: parse, classify, extract, second-pass, normalize, and rename.

D1R2Workflow6 Service Bindings

Workers

Individual processing services called by the orchestrator.

llama-parse

Worker

PDF and document parsing via LlamaCloud API with sync, async, and queue-based processing.

D1Queue

document-classification

Worker

Classifies documents against Fibery artifact catalog using hybrid search, reranking, and LLM agent review.

D1QueueOpenRouter

extraction-worker

Worker

Structured data extraction from documents using LlamaCloud Extract with auto-generated schemas.

D1QueueDLQ

second-pass-extraction

Worker

Gap-finding second-pass extraction that discovers missing data points via naive vs. first-pass comparison.

D1QueueDLQ

data-normalization

Worker

Cross-document field normalization and conflict resolution with heuristic and LLM auto-resolution.

D1QueueDLQ

document-renaming

Worker

Renames documents in R2 storage based on classification and normalized data with revert support.

D1R2QueueDLQ

fibery-relay

Worker

Syncs AdviceOS artifacts and attributes from Fibery workspace to D1 database.

D1QueueWorkflowCron