روش رگرسیون خطی افتراقی برای سازگاری طبقه بندی بر اساس چند نمونه و برنامه های کاربردی آن برای او سی آر چینی
|کد مقاله||سال انتشار||مقاله انگلیسی||ترجمه فارسی||تعداد کلمات|
|24650||2013||10 صفحه PDF||سفارش دهید||محاسبه نشده|
Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)
Journal : Pattern Recognition, Volume 46, Issue 8, August 2013, Pages 2313–2322
This paper presents a new discriminative linear regression approach to adaptation of a discriminatively trained prototype-based classifier for Chinese OCR. A so-called sample separation margin based minimum classification error criterion is used in both classifier training and adaptation, while an Rprop algorithm is used for optimizing the objective function. Formulations for both model-space and feature-space adaptation are presented. The effectiveness of the proposed approach is confirmed by a series of experiments for adaptation of font styles and low-quality text, respectively.
With the fast development of mobile internet, OCR-based applications are becoming increasingly more popular (e.g., , ,  and ). However, the most off-the-shelf OCR engines were trained on scanned documents, and they may not work well for new application scenarios where the properties of the captured character images are significantly different from the ones in the training data set (e.g., ). One of the solutions to address this problem is to adapt a pre-trained classifier to deal with the new scenario by using the document to be recognized itself via an unsupervised adaptation strategy, or by using a small amount of adaptation data collected in the target scenario via a supervised adaptation strategy. Using adaptation to improve the OCR accuracy has been a research topic for several decades. Some ingenious ideas specific to OCR have been tried. For example, Nagy et al.  and  demonstrated that a character classifier trained on many typefaces can be adapted effectively to text in a single unknown typeface by using a self-adaptation strategy. Hong  showed how to use an adaptation strategy that alternates between applying “visual constraints” and “linguistic constraints” to reduce errors for recognizing books printed in a single typeface. Authors in , , ,  and  investigated a family of style-conscious algorithms to improve the recognition accuracy on documents which contain only a few typefaces and limited variations in image qualities and other variabilities. More recently, Xiu and Baird  demonstrated how to use a self-adaptation technique to improve the whole-book recognition, where a so-called iconic model and a linguistic model are mutually leveraged and adapted. In the past several decades, many effective adaptation techniques have also been invented for speaker adaptation in the area of speech recognition (see an overview paper  and the references therein) and for writer adaptation in the area of handwriting recognition (see a recent work in  and  and the references therein). Some of them have been applied to OCR adaptation. For example, in , a continuous density hidden Markov model (CDHMM) based English character recognizer was adapted to deal with unseen fonts with two popular adaptation algorithms, namely maximum-likelihood linear regression (MLLR)  and  and maximum a posteriori (MAP) estimation , which were developed originally in speech recognition area for speaker adaptation. Apparently, the style transfer mapping (STM) technique proposed in  and  for writer adaptation can also be used for OCR adaptation. In this paper, we study the adaptation techniques for Chinese OCR. One of the state-of-the-art techniques to build a Chinese OCR engine is to use a discriminatively trained prototype-based classifier as reported in . In spite of the large vocabulary of Chinese characters, such a classifier can be made both compact (e.g.,  and ) and efficient in the recognition stage (e.g., ). A high recognition accuracy can be achieved by using Gabor features, LDA (linear discriminant analysis) or MCE (minimum classification error) based discriminative feature extraction, and MCE-based classifier parameter training . Recently, a so-called sample separation margin (SSM) based MCE training approach was proposed in  for training prototype-based classifiers, which performs better than the MCE training approach in . In , the SSM-MCE training approach has been used to construct a state-of-the-art compact prototype-based handwritten Chinese character recognizer where a batch-mode Quickprop algorithm  is used for optimizing the SSM-MCE objective function. In , the SSM-MCE formulation has been extended to training pattern classifiers with quadratic discriminant functions (QDF), including the “modified quadratic discriminant function (MQDF)”  popular in the areas of OCR and handwriting recognition for East Asian languages. In this study, we have built our baseline classifier for Chinese OCR by using the techniques described in ,  and  with a minor difference: we used an Rprop algorithm (e.g.,  and ) to optimize the SSM-MCE objective function because the setting of control parameters is much easier than the Quickprop algorithm used in . The main contribution of this paper is to propose a new SSM-MCE linear regression (LR) approach to adaptation of an SSM-MCE trained prototype-based classifier and demonstrate its effectiveness for Chinese OCR as an illustrative example. Formulations for both model-space and feature-space adaptation are presented. In terms of general concept, our work is related to the MCE-LR approach reported in  for speaker adaptation of CDHMM-based speech recognizer, where a traditional MCE objective function is used. Our work is also relevant to the STM work on writer adaptation for handwritten Chinese character recognition reported in  and , where a similar MCE training approach as in  is used to train a prototype-based classifier, but a least regularized weighted squared error approach is used to estimate a global feature transform (a.k.a. style transfer mapping (STM)) for writer adaptation. The adaptation capability of the STM approach is similar to our feature-space adaptation approach, but is inferior to our model-space adaptation approach because multiple transforms can be used for model adaptation. Even for feature-space approach, our experimental results show that our approach performs significantly better than the original STM approach in  for both supervised and unsupervised adaptation of font styles and low-quality text, respectively, which confirms that SSM-MCE is a better objective function to learn the feature transform. Fig. 1 illustrates an overall system development flow of our work in this paper. In training stage, after feature extraction of training samples, an LBG clustering algorithm  is used to construct multiple prototypes for each character class. Then a baseline classifier is constructed by using the SSM-MCE training. For model-space adaptation, an adapted classifier is constructed by using the linear regression transform(s) estimated from the adaptation data and the baseline classifier, which will be used to recognize unknown characters in the target scenario. For feature-space adaptation, a global feature transform is estimated from the adaptation data, which will be used to transform the feature vector of the unknown character back into the feature space of the training data so that the baseline classifier can be used in recognition stage. Full-size image (48 K) Fig. 1. Overall flow of system development. Figure options It is noted that the preliminary results of this study have been published in . The current paper is an extended version of the above report by including more detailed descriptions of relevant procedures, reporting additional experimental results and findings, and adding new figures and references to make the presentation more readable and accessible. The remainder of the paper is organized as follows. In Section 2, we describe briefly how to construct a multi-prototype based classifier by using the SSM-MCE training. In Section 3, we present formulations of SSM-MCE LR for both model-space and feature-space adaptation. Several important implementation issues are discussed in Section 4. In Section 5, we report experimental results for adaptation of font styles and low-quality text, respectively. Finally we conclude the paper in Section 6.
نتیجه گیری انگلیسی
In this paper, we have proposed a new SSM-MCE linear regression approach to adaptation of an SSM-MCE trained prototype-based classifier and demonstrated its application for Chinese OCR. In real-world application, the feature-space adaptation method can be used for fast adaptation with a small amount of adaptation data, while the model-space adaptation method can be used to upgrade the performance of the classifier by using increasingly more adaptation data. The proposed hybrid adaptation approach offers a good practical solution for cases with different amount of adaptation data. In this study, we have confirmed the effectiveness of the proposed approach for supervised adaptation of font styles and low-quality text, respectively. As future work, we will: • study more adaptation scenarios with mismatched training and recognition conditions; • study how to further improve the effectiveness of unsupervised adaptation for those difficult recognition tasks; and • apply the proposed approach to writer adaptation for handwriting recognition. Results will be reported elsewhere once they become available.