Agentic Object Detection and Document Extraction with Landing.ai

Published: July 22, 2025 at 2:05 PM UTC+0200
Last edited: 22 July, 2025 at 5:48 PM UTC+0200
Author: Richard Djarbeng

This week, I dive into agentic object detection and document extraction using tools from Landing.ai, one of Andrew Ng’s innovative startups! Inspired by Andrew Ng’s recent post on X about their blazing-fast text extraction upgrades, I put their tech to the test. Here’s what I found:

Agentic detection with landing ai cover image

Agentic Object Detection

Forget training models with tons of coffee cup images! Just describe the object, and the model nails it. Simple, smart, and efficient. For example, I noticed one of the coffee cups has a design made with milk that looks like a tree leaf, so I asked it to detect the ‘coffee in a cup with plant design’, and it successfully identified those cups. This differs from typical computer vision tasks (e.g., object detection or instance segmentation) where models are trained on specific object classes like cars or license plates.

coffee cups with plant design detected by landing ai

In a screen recording, I specified detecting ‘windows with room lights on’ in a building picture, and it highlighted them with 100% accuracy. Similarly, using the singular ‘building’ (as instructed by the app) on a skyline image detected all buildings perfectly. Besides bounding boxes in the UI, it also provides JSON output with coordinates for API use.

landing ai detecting windows with lights on

Agentic Document Extraction

Prompted by Andrew Ng’s tweet, I tested document extraction. It handled an invoice, outputting details in markdown or JSON, and a lab report with images and mixed layouts (two-column and single-column) effortlessly. It even described the logo and formatted results consistently. Here is a screenshot from the video showing the extracted text

Extraracted text from landing ai agentic document extraction

Andrew Ng’s X Post

Embedded below is Andrew Ng’s tweet:

He noted: “Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not just text but diagrams, charts, and form fields from PDFs to give LLM-ready output. Please see the video for details and some application ideas.”

Importance and Potential Use Cases

Why This Matters

The advancements in agentic object detection and document extraction from Landing.ai are significant because they simplify complex tasks that were previously time-consuming or required specialized knowledge. For a general person, this means tools that can understand and process visual and textual information in ways that mimic human intuition but with greater speed and accuracy. This technology can transform how we interact with digital content, making it more accessible and useful in everyday life.

Potential Use Cases

Here are some ways this technology can benefit you, regardless of your technical background:

  1. Personal Organization and Productivity
  1. Education and Learning
  1. Home and Lifestyle
  1. Business and Entrepreneurship
  1. Accessibility and Inclusion
  1. Everyday Simplification

These use cases demonstrate how Landing.ai’s technology can simplify tasks, save time, and open up new possibilities for personal and professional growth. Whether you’re organizing your home, learning something new, or running a small business, these tools can make your life easier and more efficient. Imagine the time you’d save if this technology could handle your document clutter or help you find that perfect photo from years ago—it’s not just for tech experts; it’s for anyone looking to make their day-to-day easier.

Landing.ai Video Demo

Check out this video by Richard demonstrating the technology (note: detection is sped up, so actual performance may be slower):

Playground

You can visit landing AI and try it out if you want to achieve this at their playground

Side Note: Landing.ai Support

I reported a non-working ‘Start for free’ button on their site. I received this email:

Hi Richard,
I hope all is well, and thank you for reaching out. I sincerely appreciate you letting us know the “Start for free” button isn’t working! The team is working on fixing it as we speak.
Best,
***

Then Adrian from Landing.ai confirmed via LinkedIn that the team is addressing the issue.

Landing ai thanks Richard for noticing issue

It seems fixed now—glad to help, and impressed by their proactive response!