Automate data capture in all your documents

Use LLMs and OCR technology to turn any document into structured data. Customizable, fast and reliable.

50 free credits

No credit card required

Data protection

Background gridBackground gridBackground grid

Over 10,000 users from leading companies rely on Koncile

logo Kuhn
logo Sightness
logo demathieu bard
logo SpendHQ
logo Giesper
logo Beal
logo Cosma
logo skyrm

Features

The complete tool for document automation

Custom Extractors

Define the exact fields you want to capture, fully tailored to your documents

API/SDK  integration

Seamlessly connect Koncile to your everyday tools with our full documentation

Confidence Score

Measure the accuracy of every extracted field and trust your data with confidence

Document Classification

Automatically recognize document types and route them to the right extractor

Intelligent Page Splitting

Automatically detect and separate multi-page or bundled documents into the right categories

Table Extraction

Capture every row and cell from complex tables with full customization

Data Categorization

Each data point is automatically matched to the right category for faster organization

Database Linking

Link your extracted data directly to your databases for instant use

Multi-format Exports

Generate outputs in the format you need ready for analysis and automation

Alerts

Get automatic alerts that flag inconsistencies and errors based on your business rules

Handwriting Recognition

Transform handwritten notes into structured, actionable data

Email Integration

Send unsorted attachments to a dedicated inbox

They trust us

Join 100+ clients who are streamlining their document management.

Philip Hamichi

Solution Architect at Grainz

"Thanks to Koncile, we capture key invoice data to enrich our management system. This makes it easier to manage collective catering, purchasing, and supplier monitoring."

Lire l’histoire

Alain Borri

Co-founder & CEO at Sightness

“Koncile has been a great help in extracting information from transport and logistics invoices for our clients.”

Lire l’histoire

Ludovic Belfadel

Purchasing Director at Demathieu Bard

“We had long dreamed of a tool that could automate the tedious task of invoice verification, since our P2P software doesn’t protect us against overbilling. Koncile efficiently extracts billing data and uncovers savings opportunities that had gone unnoticed by our teams for months.”

Lire l’histoire

Pierre Laprée

Founder & CPO at SpendHQ

“Koncile automates the intelligent extraction of structured contractual data. Despite the volume and complexity of our clients’ contracts, from major international groups, the tool delivers a level of quality and speed that saves us invaluable time.”

Lire l’histoire

Anne-Laure Chemin

CFO at Béal

“Koncile is a highly effective tool for extracting complex information from our transport invoices. Thanks to this tool, we analyze billing data each month and recover significant credits from our suppliers.”

Lire l’histoire

Olivier Hugo

Managing Director at Cosma Experts

“Koncile helps us process thousands of client documents and speed up our audits. It’s an intuitive, reliable OCR tool that handles even the most complex files with ease.”

Lire l’histoire

Charles Lacroux

Head of Venture at RewardPulse

“Thanks to Koncile, processing receipts no longer requires a whole team: one person is enough, and invoice validation is three times faster.”

Lire l’histoire

Mike Oskarson

CFO at Mark Young Construction

“We thought such a project would take months. In just a few days, Koncile was live, automating our insurance certificate extraction and strengthening compliance across sites.”

Lire l’histoire

Timothée Berthault

Project Manager at Nona

“The adaptability of Koncile was a real plus. It’s a simple yet powerful tool that let us boost efficiency without complicating our methods. We can configure accounts much faster than before, and chefs can get to grips with the software more quickly.”

Lire l’histoire

Start with a pre-built template

And customize it to perfectly match your data extraction needs.

BROWSE ALL TEMPLATES
Bank account details thumbnails

Accounting

Account holder's details, bank, and IBAN

An invoice with a dollar sign

Procurement

Supplier details and line-by-line details

Quote thumbnails

Procurement

Supplier information and line-by-line details

Identity document thumbnails

Compliance / KYC

Passports, ID cards, driving licenses, and identity documents

Logo de Koncile.

Try Koncile now

Create your extraction templates, test on a first document, and scale up with ease!

Security by design

Choose the secure OCR extraction tool

No Training on
your Data

Secure Data Storage

Fully GDPR Compliant

Advanced Encryption

Got more questions?

Need further assistance? Contact us at contact@koncile.ai, check out our documentation or book a demo.

What is an OCR software?

OCR (Optical Character Recognition) is a technology that allows different types of documents, such as scanned images, PDF files, or photos of text, to be converted into editable and searchable text data. In other words, OCR transforms an image containing text into a text file that you can edit.

This technology works by analyzing the image of a text, identifying individual characters and their layout, and then converting them into editable text. An OCR software typically uses artificial intelligence and machine learning algorithms to improve recognition accuracy.

"Traditional" OCR software simply transcribes raw text. However, advanced solutions like Koncile OCR go beyond simple transcription. They do not merely convert all the text in a document into data. Thanks to AI integration, particularly LLMs (large language models), these tools can identify and extract the specific data the user is looking for.

For example, in an invoice, Koncile can automatically find and extract the total amount, supplier name, date, line item details (products, quantities, unit prices), VAT numbers, and much more. Koncile understands the document and extracts relevant information in a structured way, ready to be used in other systems (accounting, ERP, etc.). This is known as intelligent data extraction.

What is Koncile?

Koncile is a French startup reinventing the management of unstructured documents in businesses. Our AI-powered SaaS solution automates data extraction from all types of documents. We combine a cutting-edge OCR engine (Optical Character Recognition) with LLMs (large language models) to transform raw, often unusable data into structured, ready-to-use information.

The Koncile tool is, above all, a simple interface accessible to everyone, allowing users to define the fields to capture in their documents. Once you have selected your fields, you can integrate the extracted data into your systems using our API / SDK.

How does data extraction work with Koncile?

The data extraction process with Koncile can be broken down into 3 steps:

  1. Pre-processing (Image Optimization): If the document is an image (scan, photo), Koncile improves it to facilitate text recognition. It can straighten the document, remove imperfections, adjust contrast, etc. The goal is to obtain the clearest possible image.
  2. Advanced OCR (Reading and Structuring): Koncile's OCR engine "reads" the text from the image and converts it into digital text. This OCR is "advanced" because it is optimized with machine learning, making it highly accurate. It doesn’t just recognize letters—it also understands the document's structure (tables, columns, paragraphs) to organize the information.
  3. LLM (Intelligent Understanding and Extraction): LLMs (large language models) analyze the text extracted by the OCR. They understand the meaning of words and sentences, allowing the system to find the specific information the user is looking for, such as the total amount of an invoice or the supplier's name, with the highest reliability.

In summary: Koncile cleans the image, reads the text and understands the structure, then comprehends the meaning to find the relevant information the user is seeking.

What are the benefits of an OCR solution?

An OCR (Optical Character Recognition) solution transforms the way businesses can utilize scanned documents, PDFs, or images. It allows businesses to make use of data that would often be lost. The main benefits are:

  • Major time savings: Automation of manual data entry, targeted extraction of relevant information. No need to search or retype.
  • Drastic reduction of errors: Minimization of human errors, more reliable data.
  • Increased productivity: Faster document processing, employees focused on value-added tasks instead of manual data entry.
  • Creation of usable databases: Transformation of unstructured documents (paper, PDFs, images) into structured data, ready for analysis and decision-making.
  • Process optimization: Better data, faster, helps improve the overall operations of the company.
  • Better decision-making, easier compliance, and a competitive advantage through optimized information management.

Thanks to LLMs, Koncile goes beyond traditional OCR by understanding the content of documents, enabling intelligent extraction and easy integration with your existing tools.

What types of documents can be processed by OCR?

An OCR software, especially an advanced solution like Koncile, can process a wide variety of documents. You can start from our library with all our document templates. Some of the most popular templates include OCR for invoices, OCR for identity documents, and OCR for bank statements. Here's a list of documents that can be processed by OCR:

Common professional documents:

  • Invoices: Supplier invoices, customer invoices, regardless of format (paper, PDF, image) or layout.
  • Purchase Orders: Extraction of product details, quantities, prices, etc.
  • Delivery Notes: Verification of received goods, tracking of deliveries.
  • Contracts: Extraction of key clauses, due dates, and stakeholders.
  • HR Documents: Summaries, cover letters, hiring forms, performance evaluations.
  • Legal Documents: Leases, non-disclosure agreements, various legal documents.
  • Financial Documents: Bank statements, transfer orders, financial reports.
  • Marketing Documents: Contact forms, survey responses, coupons.
  • Logistics Documents: Bills of lading, transport contracts, road or sea transport invoices.

Handwritten documents:

  • Handwritten forms: Questionnaires, surveys, etc.
  • Handwritten notes: Notes taken during meetings, annotations on documents.
  • Medical prescriptions: Koncile is particularly effective in this area.
  • Handwritten tables
  • Handwritten lists

Other types of documents:

  • Digitized documents: Scanned paper archives (books, newspapers, historical documents).
  • Photos of documents: Taken with a smartphone or camera.
  • Screenshots: Containing text.
  • PDF files: “Image” PDFs (scans) and native PDFs (generated by software).
  • Technical documents: Product sheets, manuals.
  • Multilingual documents: From any country and written in any language.

How does Koncile OCR handle poor-quality documents (blurry, poorly scanned)?

Koncile has a state-of-the-art OCR engine, optimized by machine learning. This engine is specifically trained to convert images into text with maximum accuracy, even when faced with documents of varying quality, unusual fonts, or complex layouts. It doesn't just read characters; it also analyzes the structure of the document (tables, columns) to faithfully reproduce the content.

Thanks to the integration of LLMs (large language models), the Koncile tool can overcome the traditional shortcomings of OCR engines when translating images into text. These AI models understand the context, allowing them to confirm or even infer certain information, even when a character is difficult to read or ambiguous. By relying on the overall meaning of the sentence or document, the LLMs surpass the limitations of a traditional OCR.

Can Koncile OCR read handwriting?

Yes, Koncile’s OCR handwriting capabilities allow it to accurately read handwritten content by combining advanced AI with large language models (LLMs). This makes it especially effective for extracting data from prescriptions, signatures, handwritten notes, as well as tables and forms filled out by hand. Each recognition is accompanied by a confidence score to indicate reliability, as handwriting is inherently more variable than printed text.

How can the Koncile OCR solution automate accounting tasks?

Koncile's OCR automates accounting tasks, including automatic categorization and reconciliation, by transforming a manual process into an efficient digital workflow:

  1. End of manual entry: Automatic extraction of data from various accounting documents (invoices, expense reports, bank statements, etc.).
  2. Intelligent extraction: Koncile understands the document and extracts key information (amounts, dates, supplier/client details, line item details, etc.), not just raw text.
  3. Structured data: The data is organized in a format compatible with accounting software (JSON, CSV, XLSX).
  4. Software integration: Automatic transfer of data to major accounting software (Sage, Cegid, etc.) via API or connectors.
  5. Advanced automation: Automatic categorization of transactions, automated bank reconciliation, and customizable workflows (e.g., automatic approval based on amount).

In short, Koncile automates the collection, extraction, structuring, integration, categorization, and reconciliation of accounting data, freeing up time for higher-value tasks.

How can an OCR solution automate transport and logistics management?

The Koncile OCR solution enables automated processing of key documents in transport and logistics:

  • Automatic extraction of essential data from delivery notes, bills of lading, transport invoices, proof of delivery (POD), and customs documents. No more manual entry, fewer errors.
  • The information is immediately usable, whether it's from shippers, recipients, tracking numbers, or product descriptions.

Thanks to AI, Koncile can extract information regardless of the document's format.

How can I integrate a data capture tool into my existing systems?

Integrating a tool like Koncile into your existing systems is mainly done through:

  1. API and SDK: Flexible, customizable, and automated, but requires technical expertise. Ideal for full and real-time integration.
  2. Pre-built connectors: Easy and quick to set up for popular applications (e.g., Zapier, accounting software).
  3. File exports (CSV, XLSX, JSON): Simple but manual and lacks real-time automation.

Does Koncile's OCR solution suit businesses of all sizes?

Koncile adapts to all sizes of businesses, from freelancers to multinationals, with two types of plans:

  1. Flexible subscriptions (by volume): Ideal for VSEs/SMEs, with a cost adjusted to the number of pages processed monthly. Maximum flexibility: you choose the volume as well as the duration of your commitment (monthly or annual).
  2. Enterprise solutions (tailor-made): For large accounts, with unlimited volume, advanced features, dedicated support, and personalized pricing.

Koncile offers a scalable solution, adapted to your budget and your growth.

How are my data secured?

Your data is secured with Koncile through a "security by design" approach:

  • No use of your data for training AI models.
  • Full encryption of the application.
  • Secure data storage (protected servers).
  • GDPR compliance ensured.

What is the difference between Koncile, scraping, and parsing?

Koncile is primarily an invoice parsing tool, but it uses techniques that can resemble scraping in certain situations. It's important to understand the difference:

  • Scraping: Extracting unstructured data from web pages (e.g., retrieving prices from an e-commerce site).
  • Parsing: Extracting structured data from documents with a known format (e.g., extracting the number and date from a PDF invoice).

What is the processing time for a document by an OCR?

The processing time for a document by Koncile's OCR is generally very fast (1 to 2 seconds), but it can vary depending on:

  • Document quality: A clear document is processed faster.
  • Complexity: A simple invoice is processed more quickly.
  • Format: Native PDFs are the fastest.
  • Amount of information: The more fields to extract, the longer it takes.
  • Subscription type: Business subscriptions are faster.

On average:

  • Simple invoice (< 3 pages, native PDF): a few seconds.
  • Complex invoice: 5-15 seconds.

Does Koncile handle multilingual documents and different currencies?

Koncile's OCR handles multilingual and international documents:

  • Recognition of multiple languages: Latin, Cyrillic, Greek alphabets, and ideograms (Chinese, Japanese, etc.), thanks to AI and LLMs. Automatic language detection in most cases.
  • Date and number formats: Koncile recognizes and interprets different international formats (DD/MM/YYYY, MM/DD/YYYY, thousand separators, etc.). Dates are reformatted for machine reading.
  • Currencies: Correct identification and extraction of amounts, even with various currency symbols (€ , $, £, ¥, etc.).

Can Koncile process tables and lists?

Yes, Koncile accurately handles tables and lists, even complex ones, thanks to advanced OCR with robust table detection. It identifies rows, columns, and cells, and uses AI (LLMs) to understand context, manage merged cells, and extract data in a structured, meaningful way. By combining OCR and LLMs, Koncile ensures reliable parsing of visually complex tables.

Can I customize the data extraction?

Yes, Koncile offers advanced customization for data extraction, allowing you to tailor it precisely to your needs. Through an intuitive interface, you can easily define the information to be extracted without requiring technical skills. It’s possible to create custom fields, such as "Contract Number," "Customer Reference," or "Due Date," and assign each one a specific data type (text, number, date, amount, email address, etc.). This helps optimize extraction and ensure data validity. Additionally, you can guide the algorithm with extraction rules, such as specifying that the VAT number is always near a certain keyword.

Can I train Koncile to recognize document types specific to my business?

Yes, Koncile allows you to train the platform to recognize document types specific to your business. You can define the key fields to extract based on each document type, ensuring precise and tailored extraction to meet your needs. Each document can have a different extraction model, optimizing the retrieval of relevant data without requiring complex configuration.

How can I control the quality of the data extracted by the OCR?

Koncile has a confidence score system that allows you to assess the reliability of the extracted data. This score takes into account several factors, including the readability of the text on the document, the complexity of the query, and the volume of data to process. For example, when a document contains a large amount of information, the extraction quality may be affected. The algorithm analyzes both visual aspects (image quality, text clarity) and semantic aspects (content coherence, contextual recognition) to produce a combined confidence score, helping you identify the most reliable data.

How is my data protected?

Data protection and security are Koncile's priorities. In line with our Security and Privacy Policy, all processing is carried out on ISO 27001-certified servers based in France. For Enterprise plans, deployment on a private cloud is available. Contact us to learn more.

What is Koncile’s pricing structure?

Koncile offers three plans, including enterprise options for handling large data volumes. Check out our pricing on the dedicated page in our documentation.

What are “General Fields” and “Repeated Fields”?

In each extraction template, you’ll find:

General fields: These are pieces of information that appear only once per document (e.g., an invoice number or date).

Repeated fields: These are elements that appear multiple times within a document, such as item descriptions or prices in each line of a quote. Use repeated fields to extract tables and structured data from your documents.

I need to parse a document that’s very specific to my industry. 
Can Koncile handle it?

Of course! Start by exploring our library of pre-built templates, covering a wide range of industries. If you don’t find one that fits your needs, you can easily create your own custom model.

What file formats does Koncile support?

Koncile allows you to import PDF files and all common image formats, including PNG and JPEG.

Can I extract a specific field?

Absolutely! Koncile’s OCR extraction service offers fully customizable fields. Our pre-built models are just a starting point—you can modify and add your own fields to meet your specific needs.

They trust us

Climbing Beal testimonial

“Koncile is a highly effective tool for extracting complex data from our transport invoices. With it, we can seamlessly analyze billing data every month and recover significant credit notes from our suppliers.”

Spend HQ testimonial

“Koncile automates the intelligent extraction of structured contract data. Despite the volume and complexity of our clients’ contracts—many from large international corporations—the tool delivers unmatched speed and quality, saving us invaluable time.”

demathieu bard construction visual

“We had long been looking for a tool to automate the tedious task of invoice verification, as our P2P software didn’t protect us from overbilling. Koncile efficiently extracts billing data and uncovers cost-saving opportunities that had gone unnoticed by our teams for months.”