Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
May 23, 2024 - Python
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
A curated list of resources for Document Understanding (DU) topic
A Repo For Document AI
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
This repository includes all computer vision, audio, document AI, and multimodal projects.
Table detection and table structure recognition using Yolov5
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.
ReadingBank: A Benchmark Dataset for Reading Order Detection
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
AI & Data, Google Cloud Skills Boost
A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.
Create an Identity Auto-Filler API with Google Cloud Document AI
Add a description, image, and links to the document-ai topic page so that developers can more easily learn about it.
To associate your repository with the document-ai topic, visit your repo's landing page and select "manage topics."