Keine peer reviewed Publikation (Wissenschaftlicher Artikel und Aufsatz, Proceeding, Artikel in Tagungsband)
Refine
Year of publication
Document Type
- Conference Proceeding (275)
- Article (129)
- Part of a Book (97)
- Working Paper (27)
- Other Publications (3)
- Report (2)
Language
- English (309)
- German (222)
- Multiple languages (2)
Has Fulltext
- no (533) (remove)
Keywords
- 360-degree coverage (1)
- 3D Extended Object Tracking (EOT) (2)
- 3D Skelett Wickeltechnik (1)
- 3D ship detection (1)
- AAL (3)
- ASEAN (1)
- Abrasive grain material (1)
- Academic german (1)
- Accelerometer (1)
- Accelerometer sensor (1)
Institute
- Fakultät Architektur und Gestaltung (5)
- Fakultät Bauingenieurwesen (14)
- Fakultät Elektrotechnik und Informationstechnik (7)
- Fakultät Informatik (27)
- Fakultät Maschinenbau (10)
- Fakultät Wirtschafts-, Kultur- und Rechtswissenschaften (27)
- Institut für Angewandte Forschung - IAF (32)
- Institut für Optische Systeme - IOS (9)
- Institut für Strategische Innovation und Technologiemanagement - IST (26)
- Institut für Systemdynamik - ISD (53)
Fast and reliable acquisition of truth data for document analysis using cyclic suggest algorithms
(2019)
In document analysis the availability of ground truth data plays a crucial role for the success of a project. This is even more true at the rise of new deep learning methods which heavily rely on the availability of training data. But even for traditional, hand crafted algorithms that are not trained on data, reliable test data is important for the improvement and evaluation of the methods. Because ground truth acquisition is expensive and time consuming, semi-automatic methods are introduced which make use of suggestions coming from document analysis systems. The interaction between the human operator and the automatic analysis algorithms is the key to speed up the process while improving the quality of the data. The final confirmation of data may always be done by the human operator. This paper demonstrates a use case for acquisition of truth data in a mail processing system. It shows why a new, extended view on truth data is necessary in development and engineering of such systems. An overview over the tool and the data handling is given, the advantages in the workflow are shown, and consequences for the construction of analysis algorithms are discussed. It can be shown that the interplay between suggest algorithms and human operator leads to very fast truth data capturing. The surprising finding is the fact that if multiple suggest algorithms circularly depend on data, they are especially effective in terms of speed and accuracy.