Home

Awesome

OCR-paper-arxiv-daily latest papers

Automated deployment @ 2023-06-07 08:05:21 Asia/Shanghai

Welcome to contribute! Add your topics and keywords in topic.yml. You can also view historical data through the storage.

OCR

OCR

Publish DateTitleAuthorsPDFCode
2023-06-05Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned DocumentsDavid Kreuzer et.al.2306.02815v1null
2023-06-03TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal DomainSagar Chakraborty et.al.2306.02142v1link
2023-06-02DocFormerv2: Local Features for Document UnderstandingSrikar Appalaraju et.al.2306.01733v1null
2023-06-01Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question AnsweringWenjin Wang et.al.2306.00526v1link
2023-05-31Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic ModelHaisong Ding et.al.2305.19543v1null
2023-05-30DuoSearch: A Novel Search Engine for Bulgarian Historical DocumentsAngel Beshirov et.al.2305.19392v1link
2023-05-29GlyphControl: Glyph Conditional Control for Visual Text GenerationYukang Yang et.al.2305.18259v1link
2023-05-28FuseCap: Leveraging Large Language Models to Fuse Visual Data into Enriched Image CaptionsNoam Rotstein et.al.2305.17718v1link
2023-05-27Exploring Better Text Image Translation with Multimodal CodebookZhibin Lan et.al.2305.17415v2link
2023-05-27Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution LayersValfride Nascimento et.al.2305.17313v1link
2023-05-26People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval TextsVít Novotný et.al.2305.16718v1null
2023-05-24Quantifying Character Similarity with Vision TransformersXinmei Yang et.al.2305.14672v1link
2023-05-21Measuring Intersectional Biases in Historical DocumentsNadav Borenstein et.al.2305.12376v1link
2023-05-19XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented LanguagesSebastian Ruder et.al.2305.11938v2link
2023-05-18TextDiffuser: Diffusion Models as Text PaintersJingye Chen et.al.2305.10855v2link
2023-05-16Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document UnderstandingShuwei Feng et.al.2305.10448v1null
2023-05-16Mobile User Interface Element Detection Via Adaptively Prompt TuningZhangxuan Gu et.al.2305.09699v1link
2023-05-13On the Hidden Mystery of OCR in Large Multimodal ModelsYuliang Liu et.al.2305.07895v2link
2023-05-12Visual Information Extraction in the Wild: Practical Dataset and End-to-end SolutionJianfeng Kuang et.al.2305.07498v1link
2023-05-11Combining OCR Models for Reading Early Modern Printed BooksMathias Seuret et.al.2305.07131v1link
2023-05-09E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine TranslationCong Ma et.al.2305.05166v2link
2023-05-04Text Reading Order in Uncontrolled Conditions by Sparse Graph SegmentationRenshen Wang et.al.2305.02577v1null
2023-05-03Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library CollectionsMing Jiang et.al.2305.02291v1null
2023-04-28LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelPeng Gao et.al.2304.15010v1link
2023-04-24DocParser: End-to-end OCR-free Information Extraction from Visually Rich DocumentsMohamed Dhouib et.al.2304.12484v2null
2023-04-24ICDAR 2023 Competition on Reading the Seal TitleWenwen Yu et.al.2304.11966v2null
2023-04-17Multimodal Short Video Rumor Detection System Based on Contrastive LearningYuxing Yang et.al.2304.08401v3null
2023-04-15TransDocs: Optical Character Recognition with word to word translationAbhishek Bamotra et.al.2304.07637v1link
2023-04-07Linking Representations with Multimodal Contrastive LearningAbhishek Arora et.al.2304.03464v2null
2023-04-07Cleansing Jewel: A Neural Spelling Correction Model Built On Google OCR-ed Tibetan ManuscriptsQueenie Luo et.al.2304.03427v1null

scene text

scene text

Publish DateTitleAuthorsPDFCode
2023-06-05Neuralangelo: High-Fidelity Neural Surface ReconstructionZhaoshuo Li et.al.2306.03092v1null
2023-06-05Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative ModelsAndrew F. Luo et.al.2306.03089v1null
2023-06-05Machine Learning and Statistical Approaches to Measuring Similarity of Political PartiesDaria Boratyn et.al.2306.03079v1null
2023-06-05Interactive Editing for Text SummarizationYujia Xie et.al.2306.03067v1link
2023-06-05Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across CagesMichael P. J. Camilleri et.al.2306.03066v1null
2023-06-05Structured Voronoi SamplingAfra Amini et.al.2306.03061v1null
2023-06-05ELEV-VISION: Automated Lowest Floor Elevation Estimation from Segmenting Street View ImagesYu-Hsuan Ho et.al.2306.03050v1null
2023-06-05Designing Equilibria in Concurrent Games with Social Welfare and Temporal Logic ConstraintsJulian Gutierrez et.al.2306.03045v1null
2023-06-05HeadSculpt: Crafting 3D Head Avatars with TextXiao Han et.al.2306.03038v1null
2023-06-05Tackling Cooperative Incompatibility for Zero-Shot Human-AI CoordinationYang Li et.al.2306.03034v1null
2023-06-05Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion AutoencoderAyodeji Ijishakin et.al.2306.03022v1null
2023-06-05Automating Style Analysis and Visualization With Explainable AI -- Case Studies on Brand RecognitionYu-hsuan Chen et.al.2306.03021v1link
2023-06-05Using Sequences of Life-events to Predict Human LivesGermans Savcisens et.al.2306.03009v1null
2023-06-05Nonparametric Iterative Machine TeachingChen Zhang et.al.2306.03007v1null
2023-06-05Unveiling the Two-Faced Truth: Disentangling Morphed Identities for Face Morphing DetectionEduarda Caldeira et.al.2306.03002v1link
2023-06-05BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance FieldsAKM Shahariar Azad Rabby et.al.2306.03000v1null
2023-06-05Long-range UAV Thermal Geo-localization with Satellite ImageryJiuhong Xiao et.al.2306.02994v1link
2023-06-05Second-scale rotational coherence and dipolar interactions in a gas of ultracold polar moleculesPhilip D. Gregory et.al.2306.02991v1null
2023-06-05Integrated Sensing, Computation, and Communication for UAV-assisted Federated Edge LearningYao Tang et.al.2306.02990v1null
2023-06-05Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion modelsMuhammad Usman Akbar et.al.2306.02986v1null
2023-06-05A Term-based Approach for Generating Finite Automata from Interaction DiagramsErwan Mahe et.al.2306.02983v1null
2023-06-05Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?Damián Furman et.al.2306.02978v1link
2023-06-05Best of Both Worlds: Hybrid SNN-ANN Architecture for Event-based Optical Flow EstimationShubham Negi et.al.2306.02960v1null
2023-06-05Complex Preferences for Different Convergent Priors in Discrete Graph DiffusionAlex M. Tseng et.al.2306.02957v1null
2023-06-05Explicit Neural Surfaces: Learning Continuous Geometry With Deformation FieldsThomas Walker et.al.2306.02956v1null
2023-06-05A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical QuestionnairesHoyun Song et.al.2306.02955v1null
2023-06-05Color-aware Deep Temporal Backdrop Duplex Matting SystemHendrik Hachmann et.al.2306.02954v1null
2023-06-05INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse ProblemsDi You et.al.2306.02949v1null
2023-06-05Continual Learning with Pretrained Backbones by Tuning in the Input SpaceSimone Marullo et.al.2306.02947v1null
2023-06-05Human Spine Motion Capture using Perforated Kinesiology TapeHendrik Hachmann et.al.2306.02930v1link