Blogs
Blogs
Vision Language Models (VLMs) such as GPT-4o and Claude-3.5 have done well and continue to improve at textual tasks but they still struggle with visual tasks. For example, let’s ask these VLMs to count the number of missing soda cans in this image: The Soda Can Puzzle Failure Claude-3.5 (tested on 1/1/2025): “Looking at the […]
Blogs - Developers
Introduction If you’ve ever tried to extract meaningful data from PDFs—especially documents with complex layouts like tables, charts, or forms—you’ve likely run into OCR’s limitations. OCR is great for raw text, but it ignores structural relationships critical for true comprehension. Enter Agentic Document Extraction: Instead of flattening everything into text, it retains visual and spatial […]
Blogs
Discover how LandingLens' Vision Model significantly improved retinopathy classification in a benchmarking study using an open-access fundus image dataset. Achieving an impressive 92.1% F1 score, this result demonstrates the power of LandingLens in advancing AI models for medical applications with minimal setup and just 2 hours of work.
Blogs
Agentic Object Detection (OD) is one of the tools available to developers within VisionAgent. It achieves highly competent zero-shot object detection on complex tasks by reasoning agentically about images. By applying agentic patterns such as planning, code generation, and tool use, Agentic OD can reliably detect everyday objects (e.g. “person”, “motorcycle”) as well as more […]
Blogs - Developers
I recently decided to sell some of my old stuff online—shoes, furniture, random trinkets—and quickly realized how embarrassingly messy my apartment looked in photos. Think dustbins in the corner, socks tossed around, and random hair tufts on the floor. Definitely not the photo aesthetic that screams “Buy this now!” Instead of tidying up (who has […]
Blogs - News
This solution architecture helps you build and deploy Visual AI solutions in Snowflake for Healthcare & Life Sciences. Some of the use cases are described below.
Blogs
XBuild: Revolutionizing Construction Management with AI Automation XBuild is an AI-powered construction platform that automates scope generation and insurance supplementation for contractors. By streamlining these tasks, XBuild enables contractors to undertake larger projects and secure faster approvals while minimizing overhead costs. Documentation Roadblock: Manual Image Analysis Challenges XBuild’s application requires users to submit detailed damage […]
Get LandingAI's Monthly Newsletter
Stay updated with the latest computer vision and AI news and resources delivered to your inbox.