Tutorial
Tutorial
Ava Xia
October 2, 2025
Today we are sharing a powerful solution for a high-volume document extraction pipeline which lands data into Snowflake. Built specifically for Snowflake Data Engineers, the GitHub repo provides an end-to-end workflow which provides performance at scale: cost‑efficient document processing in the cloud and streaming ingestion into Snowflake. This agentic document extraction workflow offers: We’ll walk […]
Tutorial
Ava Xia
September 16, 2025
This tutorial goes step by step into the process of building a MCP Server for Intelligent Document Processing using LandingAI's Agentic Document Extraction
Tutorial

Reasoning models are good at thinking over text but documents aren’t just text. PDFs are visual artifacts—tables, columns, captions, footnotes—and flattening them erases structure and invites errors. This post shows how Model Context Protocol (MCP) lets an agent discover and call LandingAI’s Agentic Document Extraction (ADE) for layout-aware parsing.
Tutorial

Ankit Khare
August 28, 2025
Turn complex lab reports into clean, structured data ready for analysis and dashboards using zero-shot parsing and schema-guided extraction
Tutorial

Ankit Khare
August 14, 2025
Agenda Introduction Public companies in the United States must file an annual Form 10‑K with the U.S. Securities and Exchange Commission (SEC). The 10‑K is a comprehensive financial report that includes a company’s history, organizational structure, financial statements, earnings per share and other disclosures[1]. These filings provide investors with a detailed snapshot of a company’s operations […]
Tutorial

Ankit Khare
May 29, 2025
I’m sure—like myself and millions of others—you’ve spent hours copying and pasting the same information into job applications 🥱🎯. In this tutorial, we’ll walk through how you can build a powerful automation tool that extracts key data from your resume and fills out LinkedIn job applications for you—automatically! Thanks to LLMs, “developer” is now a […]
Tutorial

Ankit Khare
April 22, 2025
Overview: What You’ll Learn Complete code for the tutorial is available on GitHub — follow along and run the example app yourself👨🏼💻. Figure 1: A simple “Chat with PDF” app. The user has uploaded a PDF and the Agentic Document Extraction Python library has precomputed structured data for each page. Figure 2: The app’s response […]
Tutorial

Ankit Khare
March 31, 2025
Introduction Modern Large Language Models (LLMs) have revolutionized text analysis—until they encounter the complexities of PDFs. PDFs often feature intricate layouts, visual elements, flowcharts, images, and tables with interdependent contexts and relationships. This is where Agentic Document Extraction truly stands out. In Part 1, we demonstrated examples where traditional LLMs struggled, while Agentic Document Extraction […]
Tutorial

Ankit Khare
March 21, 2025
Introduction If you’ve ever tried to extract meaningful data from PDFs—especially documents with complex layouts like tables, charts, or forms—you’ve likely run into OCR’s limitations. OCR is great for raw text, but it ignores structural relationships critical for true comprehension. Enter Agentic Document Extraction: Instead of flattening everything into text, it retains visual and spatial […]
Get LandingAI's Monthly Newsletter
Stay updated with the latest computer vision and AI news and resources delivered to your inbox.