Document Type
Article
Publication Date
2022
Publication Source
IEEE Access
Abstract
Document image understanding is increasingly useful since the number of digital documents is increasing day-by-day and the need for automation is increasing. Object detection plays a significant role in detecting vital objects and layouts in document images and contributes to providing a clearer understanding of the documents. Nonetheless, previous research mainly focuses on English document images, and studies on Vietnamese document images are limited. In this study, we extensively benchmark state-of-the-art object detectors and analyze the performance of each method on Vietnamese document images. Moreover, we also investigate the effectiveness of four different loss functions on the experimental object detection methods. Extensive experiments on the UIT-DODV dataset are conducted to provide insightful discussions.
Inclusive pages
108046-108066
ISBN/ISSN
2169-3536
Document Version
Published Version
Publisher
IEEE-INST Electrical Electronics Engineers Inc
Volume
10
Peer Reviewed
yes
Sponsoring Agency
VNUHCM-University of Information Technology's Scienti~c Research Support Fund
eCommons Citation
Nguyen, Khang; Nguyen, An; Vo, Nguyen D.; and Nguyen, Tam, "Vietnamese Document Analysis: Dataset, Method and Benchmark Suite" (2022). Computer Science Faculty Publications. 193.
https://ecommons.udayton.edu/cps_fac_pub/193
Comments
This open-access article is provided for download in compliance with the publisher’s policy on self-archiving. To view the version of record, use the DOI: https://doi.org/10.1109/ACCESS.2022.3211069