The Technology

From Documents to Answers

No AI expertise needed. No code required. Upload your documents, start asking questions.

1

Upload Your Documents

Bring your knowledge. We handle the rest.

Supported Formats

PDF DOCX TXT HTML Web Pages

Upload Methods

Drag & Drop

Simple browser upload. Select files or drop them into the upload zone.

Bulk Import

Upload entire folders at once. Hundreds of documents in a single operation.

Pro

Web Scraper

Auto-ingest entire websites. Point us at a URL and we crawl it for you.

REST API

Programmatic document management. Integrate uploads into your own systems.

What Should You Upload?

Think of it like building a library for your AI assistant. The more relevant material you give it, the smarter and more accurate it becomes.

Internal policies & handbooks
Contracts & agreements
Legal documents & regulations
Technical manuals & specs
Standard operating procedures
Customer FAQs & support docs
Training & onboarding materials
Industry regulations & compliance
Meeting notes & decision logs
Reading room with natural light

From Chaos to Clarity

Every format. Every language. One pipeline.

Whether it's a scanned PDF from 2008 or a freshly exported Word document, CorpusAI's parser extracts clean text while preserving tables, headers, and footnotes. Your documents go in messy — and come out structured.

2

AI Processes & Learns

Your documents go through a five-stage pipeline — fully automatic.

1

Document Parsing

Clean text extraction from any format. Tables, headers, and footnotes are preserved intact.

2

Smart Chunking

Semantically meaningful splits — not dumb page breaks. Context stays together so answers make sense.

3

Embedding

Each chunk gets a 768-dimensional mathematical "fingerprint" that captures its meaning — like DNA for text.

4

Vector Indexing

Stored in a Qdrant vector database with HNSW indexing for millisecond-speed search across thousands of documents.

5

Quality Check

Validates every chunk is searchable and properly grounded. Bad data never makes it into your corpus.

Processing Speed

Most documents are ready in seconds. Large batches (1,000+ docs) complete within minutes. You can start asking questions as soon as the first documents finish processing.

Privacy First

All processing happens on our EU servers (Hetzner, Germany & Finland). Your data never leaves European infrastructure. No third-party APIs are used for processing.

3

Ask & Get Answers

Plain language in, sourced answers out. Every time.

How a Question Becomes an Answer

1

You type a question in plain language

No special syntax, no boolean operators. Just ask like you would ask a colleague.

2

Your question is embedded into the same vector space

The same embedding model converts your question into a 768-dimensional fingerprint.

3

Vector search finds the most relevant chunks

Like finding the nearest stars to a point in space — the closest chunks in meaning are retrieved in milliseconds.

4

A large language model reads the chunks and writes your answer

Synthesizes the retrieved passages into a clear, coherent response grounded in your documents.

5

Answer includes citations — exactly which document and section

No black box. Every answer links back to the source material so you can verify and trust the response.

Example Interaction

CorpusAI Chat your-company corpus

What is our refund policy for digital products?

According to your Terms of Service (v3.2, Section 7.1), digital products are eligible for refund within 14 days of purchase, provided the product has not been downloaded or activated.

terms-of-service-v3.2.pdf — Section 7.1

CorpusAI · qwen2.5:72b · 2.3s GPU

Fountain pen on lined paper

Trust the Source

Every answer links back to your documents

No black boxes. No hallucinated facts. Every response includes clickable citations — the exact document, section, and paragraph where the answer was found. Verify in seconds.

4

Choose Your AI Power

Every plan includes a capable AI. Higher plans unlock deeper reasoning.

Fine-Tuned for Norwegian Law

All plans
  • Trained on 4,754 Norwegian legal documents
  • Optimized for legal terminology and citation formats
  • Quick lookups, statute questions, and document search
  • Response time: 5–15 seconds
  • Included in every plan, including Starter

Deep Analysis with GPU Power

Pro & Enterprise
  • 27–72 billion parameter models for complex reasoning
  • Cross-reference multiple laws and precedents in a single answer
  • Process entire documents, not just excerpts
  • Response time: 2–5 seconds
  • Available on Pro and Enterprise plans

When to use which

Standard AI

“What does barneloven § 36 say?”

GPU AI

“Compare custody provisions in barneloven against recent Supreme Court practice and identify conflicts”

Standard AI

“Find our refund policy”

GPU AI

“Review this 40-page contract and flag all liability risks with severity assessment”

Standard AI

“What is the deadline for filing an appeal?”

GPU AI

“Analyze how NOU 2020:14 recommendations changed interpretation of barnevernsloven § 4-12 across three court decisions”

Need a model that thinks like your team?

Our Custom plan includes a dedicated AI trained on your documents — combining deep domain knowledge with real-time document search. We build and maintain the model for you. Contact sales →

What CorpusAI Doesn't Do

Not a Replacement for Human Judgment

CorpusAI helps you find information faster. Decisions are always yours to make.

Not Connected to the Internet

Answers come only from your uploaded documents. No web search, no external data leaking in.

Not Shared Between Clients

Your corpus is completely isolated. Other clients can never see or search your documents.

Not Storing Questions Permanently

Query logs are used only for billing and debugging. Your questions are not training data.

Continue exploring

Ready to see it in action?

Start with a free trial — 100 queries, no credit card. Upload your first document in under 2 minutes.