AI Document Search

Stop messy PDFs breaking your AI automationsSimply upload and we'll handle the rest

Ragextract is vision-based document search and retrieval pipelinebuilt for searching, citing and extracting from messy PDF, DocX and PPTX

Insurance Policies

Supplier Invoices

Financial Statements

Reconciliation Spreadsheets

Deep Research

Court Documents

Tenders & RFPs

Slide Decks

Sales Presentations

SEC Filings

Bills

Commercial Property Contracts

Legal Documents

Construction

Academic Papers

Technical Manuals

Get Started For Free

Request a Demo

14 Days Free Trial

GDPR Compliant

30 days Refund Policy

We turn your complex documents into an API

Solutions & Use Cases

AI document processing that scales as you grow

The missing piece for infrastructure for your AI agents

Cost reduction when parsing applications, claim forms and insurance Policies

For Insurance, Underwriters and Brokers

Insurance managers wanting better control over their automation budgets

Development teams looking to reduce infrastructure costs and delivery reliably

Insuratech startups who need to launch to market faster

Productivity boost when handling multi-page contracts, court documents and research

For Legal Teams, Lawyers and Consultants

Legal departments needing to increase their document processing capacity

Consultants who need answers quickly and accurately from case files and court documents

Legal Firms to expand AI usage for their workflows

hrs

Time saved per month in validation and extraction jobs From tenders, quotes and contracts

For Property Managers and Sales Teams

Sales managers who want to expand their volume of tenders and leads in review

Property managers who need to their time back to focus on more important tasks

Proptech companies who process technical site plans, tables and architecture diagrams

How it works

Just drop in your files and we'll do the rest

Seemlessly integrates into your existing workflows

Simple REST API that works with n8n, Make, Zapier and more

Upload documents directly to Ragextract and we'll take care of the storage, splitting, vectorizing and indexing in a matter of seconds.

curl -X POST https://api.ragextract.com/v1/vectorize \
  --header 'x-api-key: $RAGEXTRACT_API_KEY' \
  --form "file=insurance_claim_application.pdf"

Search across 100s of pages instantly using natural language and Ragextract will return only the relevant ones (score-based). No more praying you'll hit the right keywords and images also work - it's all contextual!

curl -X POST https://api.ragextract.com/v1/search \
  --header 'x-api-key: $RAGEXTRACT_API_KEY' \
  --data '{
    "query": "Find motor vehicle details and registration of all parties involved in claim",
  }'

Finally, pass those matching pages into an VLM/LLM of your choice. You decide how and what gets extracted depending of your use-case. Ragextract keeps your data safe from leaks by serving them as dynamic and expirying links.

curl https://api.openai.com/v1/responses \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4.1",
    "input": [
      {
        "role": "user",
        "content": [
          { "type": "input_text", "text": "Extract claimant details in the following structure..." },
          {
            "type": "input_image", // Use Ragextract result here!
            "image_url": "https://api.ragextract.com/v1/share/dsx_B5bsOBDzsXsqfmLo?token=kCThsH"
          }
        ]
      }
    ]
  }'

Why use Ragextract?

The smarter way to increase document processing capabilitiesenabling your team to do more with less

Search first, extract later

Prioritizing AI Document Search to optimise long document extraction

Ragextract challenges the popular document processing model of other providers by avoiding the "parse-first-search-later" approach. In fact, we do the opposite! We believe our method is faster, cheaper and less wasteful.

Instant large document pipeline

Expand beyond LLM context windows to enable smarter and better tuned AI agents

Ragextract handles the long and large documents that most LLMs typically struggle with - scanned Pdfs, technical manuals, image heavy reports etc. Our users typically prefer our more advanced RAG and retrieval setup as they can do more with the output.

Full control for better results

Specially designed for real world document scenarios just like yours

Not only does this reduce our costs significantly, our belief is that bundling models is unnecessary as AI Developers actually prefer to handle their own prompts. Ragextract is built for developers for more challenge document extraction workloads.

Enhances low code platforms

Push your automation further ensuring complex documents won't break your workflow

Many low code platforms aren't designed for heavy document / high cpu-consumption tasks. Typically, they exhaust their limits which then crash your instance. Ragextract pairs perfectly with these services to offer the necessary processing power to make business flow.

Saves you time and resources

From 3 months to 3 days, get your project up and running with just an API key

Document processing infrastructure consumes a large amount of your team's time, budget and reputation if not done correctly. Adopting Ragextract means you can focus on building a great AI experience for your users and reduce development time by weeks.

Pricing

Simple Pricing that Scales as You Grow

All Features. All APIs. No Hidden Fees.

Monthly

Annually

Save

Save 100%

$15/month

/month

Billed $180 $0 annually

Starter

A limited time offer to use Ragextract for free! For internal operations, smaller documents and lighter workloads.

Features

Document processing for up to 100 pages

Multipart upload API for PDF, Docx, Pptx and Xlsx

Secure file storage and page-level retrieval API

Multimodal embeddings support and vector store

Document search API for text and image

Compatible with your favourite AI agents and workflows

Usage

Shared worker pool

10mb upload size limit

100 pages per document limit

100mb storage

14 day data retention

3 concurrent jobs

per workspace

1000 searches

per month

50000 retrievals

per month

Organisation

Up to 5 team members

Up to 30 workspaces

3 API keys

per workspace

Admin, manager, developer & readonly roles

SSO/OAuth Logins

Support

Community forum

Email support on best effort basis

No phone support or SLAs offered

Get Started For Free

Card check required

Save 20%

$149/month

$119.20

/month

Billed $1788 $1430.40 annually

Standard

Best for daily documents, startups or integrating into proprietary and commercial apps.

Features

Large document processing pipeline for 100+ pages

Multipart upload API for PDF, Docx, Pptx and Xlsx

Secure file storage and page-level retrieval API

Multimodal embeddings support and vector store

Document search API for text and image

Compatible with your favourite AI agents and workflows

Usage

Shared worker pool

300mb upload size limit

500 pages per document limit

30gb storage

90 days data retention

20 concurrent jobs

per workspace

15000 searches

per month

750000 retrievals

per month

Organisation

Up to 30 team members

Up to 150 workspaces

10 API keys

per workspace

Admin, manager, developer & readonly roles

SSO/OAuth Logins

Support

Community forum

Email and chat support

Phone support and SLAs available (+fees)

Get Started For Free

14 day free trial. Card required.

Dedicated Support

Base Price

Custom

Quote tailored to requirements

On-prem or custom build

For stricter regulatory requirements where own-cloud or on-prem is desired. Tailored resources to match need.

Features

Large document processing pipeline for 100+ pages

Multipart upload API for PDF, Docx, Pptx and Xlsx

Secure file storage and page-level retrieval API

Multimodal embeddings support and vector store

Document search API for text and image

Compatible with your favourite AI agents and workflows

Usage

Dedicated worker pool

Custom upload size limit

Custom pages per document limit

Custom storage

Custom data rentention

Custom number of searches

Custom number of retrievals

Organisation

Custom quantity of team members

Custom quantity of workspaces

Custom quantity of API keys

per workspace

Custom concurrent jobs

per workspace

Admin, manager, developer & readonly roles

SSO/OAuth Logins

Support

Community forum

Email and chat support

Phone support and SLAs available (+fees)

Get a Quote

or email us at sales@subworkflow.ai

14 Days Free Trial

Cancel Anytime

30 days Refund Policy

Migration Support Available

Help available for your document AI workflows!

Ragextract™ is brought to you by the document AI specialists at Subworkflow. We help automation teams, startups and small businesses get them their time back with simple AI automations that are easy to understand and maintain.

Book a free consultation to discuss your project or automation need!

FAQs

Not Finding What You're Looking For? Contact Us!

What is Ragextract?

Ragextract is an AI-powered document search service which let's users upload documents and search them semantically afterwards via API. Ragextract helps handle upload, storage, indexing, embeddings, vector stores and providing retrieval and search APIs compatible with AI agents.

Is Ragextract right for my use case?

Ragextract is a key abstraction for all AI document workflows where partial extraction of the contents is required. If you need to search for answers - not just keywords - within one or more documents then Ragextract is a perfect fit! Examples include extracting claimant details in insurace forms, filtering transcripts in case files and finding special clauses or signals in tenders or contracts.

Do you have a free plan?

Yes! Our free plan is designed to let you evaluate Ragextract but also functional for internal operations and occassional (weekly or monthly) documents processing. We recommend up upgrading for more capacity and production use cases.

How many documents can I upload per month?

As many as you like... just as long as you stay within the designated storage limit. We don't charge by the number of pages so whether 10k pages or 100k pages, the price stays the same.

Can I cancel my subscription at any time?

Yes of course! We want you to be happy to stay with us so if anything isn't working right, reach out and let's see if we can fix it or implement a new feature for you. Just head over to the billing page and you can manage your subscription from there.

Why is a credit card needed to sign up?

Though we'd like to offer an easier alternative, credit card registration is one of the few productive ways to protect the service from sign-up spam and fake users. This in-turn ensures better stability for the all users on our platform.

Will I get charged at the end of my trial?

Yes. If you're happy to continue with Ragextract, we'll automatically handle the billing and you don't need to take any further action. If it turns out Ragextract isn't for you, please cancel your subscription before the trial expires.

I'm not sure if Ragextract is for me. Can I get a demo?

Yes! Please reach out to us at sales@subworkflow.ai and we'll get back to you as soon as we can during business hours. Please include details of your organisation, team size and high level brief of what you're looking for as this will help us tailor the demo for you.

Ragextract is a AI document search for AI agents

Everything Else

Follow for Updates

Stop messy PDFs breaking your AI automationsSimply upload and we'll handle the rest

Solutions & Use Cases

How it works

Why use Ragextract?

Pricing

Subworkflow - Automation services for Startups and SMEs

FAQs