Combined method for scanned documents images segmentation using sequential extraction of regions
Journal Title: Восточно-Европейский журнал передовых технологий - Year 2018, Vol 5, Issue 2
Abstract
<p>We propose a combined method to segment the images of scanned documents, which, in contrast to known methods, implies a preliminary separation of the graphics and photograph regions from the text regions and a background. In this case, an analysis of the connected components is performed, which are different for graphics, photographs, and text regions. In order to classify the selected regions into the photograph and graphics regions, a block method is employed. It was established that such a technique for splitting the regions into blocks less affects the quality of segmentation when compared to applying the block method directly to the original image. To extract the text regions that are more complex in their shape from the background, the neighborhood of each pixel was processed.</p><p>To detect the boundaries of illustrations on the images of scanned documents, we applied the bloomberg method. In order to classify into photographs and graphics, it is proposed to split an illustration into blocks of pixels. Each block of pixels is identified with a vector of two features: the mean value of the local gradient magnitude, and the mean value of the function that localizes at the images of scanned documents the linear objects (graphics and text characters). The derived feature vectors were classified using a support vector machine.</p><p>When extracting the text regions, we applied a low-frequency filtering and a thresholding.</p><p>The combined method was implemented in practice to segment the test images of scanned newspaper articles from the document database mediateam at oulu university (finland). It was established that the combined method is characterized by an increase in performance speed during image segmentation at high quality processing.</p>
Authors and Affiliations
Marina Polyakova, Alesya Ishchenko, Natalya Volkova, Oleg Pavlov
Research into aero acoustic characteristics of two-row impellers of the axial compressor
<p>We have conducted numerical simulation of current in the axial impellers with a single-row and a two-row geometrically equivalent blade crowns with a density of blade crowns over average radius of 1...2.5. The pressur...
Using the intensity of absorbed gamma radiation to control the content of iron in ore
<p>The paper reports results of mathematical modeling of the intensity of absorbed gamma radiation for determining the iron content in IOR. It was shown that to enhance the accuracy of rapid control of the iron content i...
Synthesis of the structure for the optimal system of flow treatment of raw materials
<p>This paper demonstrates that contemporary studies into optimization of technological processes do not take into consideration in the models of systems and in the applied criteria the requirements to the overall effici...
Estimation of carrying capacity of metallic corrugated structures of the type Multiplate MP 150 during interaction with backfill soil
<p>We estimated the stressed state of a railroad structure with a large cross section spanning more than 6 m, which is made from metallic corrugated sheets of the type Multiplate MP 150. The stressed-strained state of th...
A method for determining information diffusion cascades on social networks
Information diffusion on social networks has many potential real-world applications such as online marketing, e-government campaigns, and predicting large social events. Modeling information diffusion is therefore a cruc...