• No results found

Keypoint-based scene-text detection and character classification using color and gradient features

N/A
N/A
Protected

Academic year: 2021

Share "Keypoint-based scene-text detection and character classification using color and gradient features"

Copied!
2
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

University of Groningen

Keypoint-based scene-text detection and character classification using color and gradient features

Sriman, Bowornrat

DOI:

10.33612/diss.118694101

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version

Publisher's PDF, also known as Version of record

Publication date: 2020

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

Sriman, B. (2020). Keypoint-based scene-text detection and character classification using color and gradient features. University of Groningen. https://doi.org/10.33612/diss.118694101

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

Propositions

1. If it is difficult to extract characters from a plain document image, then the application of this operator to a scene image in an uncontrolled environment will not be easier (Chapter 1).

2. Keypoint-descriptor vectors and keypoint locations of SIFT are useful for both character-shape recognition and for scene-text detection in Thai script (Chapter 2, Fig 2.10).

3. Applying autocorrelation function to a color space is robust to the lighting condition (Chapter 4 Fig. 4.2, Fig. 4.4). 4. Color is useful for the distinction between homogeneous and inhomogeneous image regions, thereby supporting object detection and classification (Chapter 4). 5. A Gaussian blur with small sigma is sufficient to homogenize patchy object areas for the general text-chunk localization in scene images. (Chapter 5, Fig. 5.12). 6. Heterogeneous features achieve higher classification performance than homogeneous features and are applicable to a variety of object classification tasks (Chapter 3 and 4). 7. The advantage of explicit feature methods and k-means clustering is that the computational load and imbalance between target and background patterns can be optimized in detail (Chapter 4).

Referenties

GERELATEERDE DOCUMENTEN

Scale-variant and scale-invariant image representations Training CNNs on large image collections that often exhibit variations in image resolution, depicted object sizes, and

Full body pose estimation: Comparison of body part detection rates and evaluation of different components of the model on the “Iterative Image Parsing” (IIP) dataset [15]

Using the generic HOG detection algorithm, we train a different detector for each class from the positive object samples and a common set of negative samples in the form of

We have evaluated the performance of the proposed skin detector versus an existing color detector [3] defined in HSV color space and versus a histogram-based

Best linear approximation H BLA , measured and calculated using the broadband excitation approach, as a function of input amplitude and frequency.. SINGLE

To provide increased flexibility it is also possible to import external spike templates into a recording. This feature allows for the creation of ground-truth data without the need

Figure S3: Average performance as a function of the fraction of the data that is deferred to a human annotator, for the CI SVM (p low = 5), with trust scores (trust) from a trust

The search for the optimal Λ involves evaluations of the dual objective function, i.e. maximizations of the Lagrangian, which is decoupled over the tones for a given set of λi ’s.