• Title/Summary/Keyword: shape descriptor

Search Result 103, Processing Time 0.025 seconds

Score Image Retrieval to Inaccurate OMR performance

  • Kim, Haekwang
    • Journal of Broadcast Engineering
    • /
    • v.26 no.7
    • /
    • pp.838-843
    • /
    • 2021
  • This paper presents an algorithm for effective retrieval of score information to an input score image. The originality of the proposed algorithm is that it is designed to be robust to recognition errors by an OMR (Optical Music Recognition), while existing methods such as pitch histogram requires error induced OMR result be corrected before retrieval process. This approach helps people to retrieve score without training on music score for error correction. OMR takes a score image as input, recognizes musical symbols, and produces structural symbolic notation of the score as output, for example, in MusicXML format. Among the musical symbols on a score, it is observed that filled noteheads are rarely detected with errors with its simple black filled round shape for OMR processing. Barlines that separate measures also strong to OMR errors with its long uniform length vertical line characteristic. The proposed algorithm consists of a descriptor for a score and a similarity measure between a query score and a reference score. The descriptor is based on note-count, the number of filled noteheads in a measure. Each part of a score is represented by a sequence of note-count numbers. The descriptor is an n-gram sequence of the note-count sequence. Simulation results show that the proposed algorithm works successfully to a certain degree in score image-based retrieval for an erroneous OMR output.

Shape similarity measure for M:N areal object pairs using the Zernike moment descriptor (저니키 모멘트 서술자를 이용한 M:N 면 객체 쌍의 형상 유사도 측정)

  • Huh, Yong;Yu, Ki-Yun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.2
    • /
    • pp.153-162
    • /
    • 2012
  • In this paper, we propose a new shape similarity measure for M:N polygon pairs regardless of different object cardinalities in the pairs. The proposed method compares the projections of two shape functions onto Zernike polynomial basis functions, where the shape functions were obtained from each overall region of objects, thus not being affected by the cardinalities of object pairs. Moments with low-order basis functions describe global shape properties and those with high-order basis functions describe local shape properties. Therefore several moments up to a certain order where the original shapes were similarly reconstructed can efficiently describe the shape properties thus be used for shape comparison. The proposed method was applied for the building objects in the New address digital map and a car navigation map of Seoul area. Comparing to an overlapping ratio method, the proposed method's similarity is more robust to object cardinality.

The Correlation Analysis Between New Catchment Shape Descriptor and The Lag Time of Nash Model (신집수형상디스크립터와 Nash 모형의 지체시간 사이의 상관성 분석)

  • Kim, Joo-Cheol;Jung, Kwan-Sue;Kim, Jae-Han
    • Journal of Korea Water Resources Association
    • /
    • v.37 no.12
    • /
    • pp.1065-1074
    • /
    • 2004
  • This study aims at the introduction of new catchment shape descriptor, developed by Moussa(2003), based on equivalent ellipse and the assessment of its hydrologic applicability. Two descriptors a+b and a+b+${\varepsilon}OM$were correlated to the lag time and those were applied to the estimation of representative values of Nash model parameters. They are applied in order to examine the practicality to 3 catchments in Korea, catchments in Korea, respectively, i.e. Pyeongchanggang catchment in Han river, Bocheongcheon catchment in Geum river and Wicheon catchment in Nakdong river. As a result both of two descriptors show higher correlations to the lag lime than classical geomorphologic factors and hereby Moussa's suggestion(2003) is confirmed. For the sake of simplicity the former is recommended. Also representative IUHs derived from this study show consistent basin response characteristics. It is desirable to conduct further more case studies on many other basins.

Robust 3D Model Hashing Scheme Based on Shape Feature Descriptor (형상 특징자 기반 강인성 3D 모델 해싱 기법)

  • Lee, Suk-Hwan;Kwon, Seong-Geun;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.6
    • /
    • pp.742-751
    • /
    • 2011
  • This paper presents a robust 3D model hashing dependent on key and parameter by using heat kernel signature (HKS), which is special shape feature descriptor, In the proposed hashing, we calculate HKS coefficients of local and global time scales from eigenvalue and eigenvector of Mesh Laplace operator and cluster pairs of HKS coefficients to 2D square cells and calculate feature coefficients by the distance weights of pairs of HKS coefficients on each cell. Then we generate the binary hash through binarizing the intermediate hash that is the combination of the feature coefficients and the random coefficients. In our experiment, we evaluated the robustness against geometrical and topological attacks and the uniqueness of key and model and also evaluated the model space by estimating the attack intensity that can authenticate 3D model. Experimental results verified that the proposed scheme has more the improved performance than the conventional hashing on the robustness, uniqueness, model space.

3D Model Retrieval Using Sliced Shape Image (단면 형상 영상을 이용한 3차원 모델 검색)

  • Park, Yu-Sin;Seo, Yung-Ho;Yun, Yong-In;Kwon, Jun-Sik;Choi, Jong-Soo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.6
    • /
    • pp.27-37
    • /
    • 2008
  • Applications of 3D data increase with advancement of multimedia technique and contents, and it is necessary to manage and to retrieve for 3D data efficiently. In this paper, we propose a new method using the sliced shape which extracts efficiently a feature description for shape-based retrieval of 3D models. Since the feature descriptor of 3D model should be invariant to translation, rotation and scale for its model, normalization of models requires for 3D model retrieval system. This paper uses principal component analysis(PCA) method in order to normalize all the models. The proposed algorithm finds a direction of each axis by the PCA and creates orthogonal n planes in each axis. These planes are orthogonalized with each axis, and are used to extract sliced shape image. Sliced shape image is the 2D plane created by intersecting at between 3D model and these planes. The proposed feature descriptor is a distribution of Euclidean distances from center point of sliced shape image to its outline. A performed evaluation is used for average of the normalize modified retrieval rank(ANMRR) with a standard evaluation from MPEG-7. In our experimental results, we demonstrate that the proposed method is an efficient 3D model retrieval.

Robust Head Tracking using a Hybrid of Omega Shape Tracker and Face Detector for Robot Photographer (로봇 사진사를 위한 오메가 형상 추적기와 얼굴 검출기 융합을 이용한 강인한 머리 추적)

  • Kim, Ji-Sung;Joung, Ji-Hoon;Ho, An-Kwang;Ryu, Yeon-Geol;Lee, Won-Hyung;Jin, Chung-Myung
    • The Journal of Korea Robotics Society
    • /
    • v.5 no.2
    • /
    • pp.152-159
    • /
    • 2010
  • Finding a head of a person in a scene is very important for taking a well composed picture by a robot photographer because it depends on the position of the head. So in this paper, we propose a robust head tracking algorithm using a hybrid of an omega shape tracker and local binary pattern (LBP) AdaBoost face detector for the robot photographer to take a fine picture automatically. Face detection algorithms have good performance in terms of finding frontal faces, but it is not the same for rotated faces. In addition, when the face is occluded by a hat or hands, it has a hard time finding the face. In order to solve this problem, the omega shape tracker based on active shape model (ASM) is presented. The omega shape tracker is robust to occlusion and illuminationchange. However, whenthe environment is dynamic,such as when people move fast and when there is a complex background, its performance is unsatisfactory. Therefore, a method combining the face detection algorithm and the omega shape tracker by probabilistic method using histograms of oriented gradient (HOG) descriptor is proposed in this paper, in order to robustly find human head. A robot photographer was also implemented to abide by the 'rule of thirds' and to take photos when people smile.

The Usage of Color & Edge Histogram Descriptors for Image Mining (칼라와 에지 히스토그램 기술자를 이용한 영상 마이닝 향상 기법)

  • An, Syungog;Park, Dong-Won;Singh, Kulwinder;Ma, Ming
    • The Journal of Korean Association of Computer Education
    • /
    • v.7 no.5
    • /
    • pp.111-120
    • /
    • 2004
  • The MPEG-7 standard defines a set of descriptors that extracts low-level features such as color, texture and object shape from an image and generates metadata in order to represent these extracted information. But the matching performance for image mining ma y not be satisfactory by u sing only on e of these features. Rather than by combining these features we can achieve a better query performance. In this paper we propose a new image retrieval technique for image mining that combines the features extracted from MPEG-7 visual color and texture descriptors. Specifically, we use only some specifications of Scalable Color Descriptor (SCD) and Non-Homogeneous Texture Descriptor also known as Edge Histogram Descriptor (EHD) for the implementation of the color and edge histograms respectively. MPEG-7 standard defines $l_{1}$-norm based matching in EHD and SCD. But in our approach, for distance measurement, we achieve a better result by using cosine similarity coefficient for color histograms and Euclidean distance for edge histograms. Our approach toward this system is more experimental based than hypothetical.

  • PDF

MPEG-7 Texture Descriptor (MPEG-7 질감 기술자)

  • 강호경;정용주;유기원;노용만;김문철;김진웅
    • Journal of Broadcast Engineering
    • /
    • v.5 no.1
    • /
    • pp.10-22
    • /
    • 2000
  • In this paper, we present a texture description method as a standardization of multimedia contents description. Like color, shape, object and camera motion information, texture is one of very important information in the visual part of international standard (MPEG-7) in multimedia contents description. Current MPEG-7 texture descriptor has been designed to fit human visual system. Many psychophysical experiments give evidence that the brain decomposes the spectra into perceptual channels that are bands in spatial frequency. The MPEG-7 texture description method has employed Radon transform that fits with HVS behavior. By taking average energy and energy deviation of HVS channels, the texture descriptor is generated. To test the performance of current texture descriptor, experiments with MPEG-7 Texture data sets of T1 to T7 are performed. Results show that the current MPEG-7 texture descriptor gives better retrieval rate and fast and fast extraction time for texture feature.

  • PDF

Complex Color Model for Efficient Representation of Color-Shape in Content-based Image Retrieval (내용 기반 이미지 검색에서 효율적인 색상-모양 표현을 위한 복소 색상 모델)

  • Choi, Min-Seok
    • Journal of Digital Convergence
    • /
    • v.15 no.4
    • /
    • pp.267-273
    • /
    • 2017
  • With the development of various devices and communication technologies, the production and distribution of various multimedia contents are increasing exponentially. In order to retrieve multimedia data such as images and videos, an approach different from conventional text-based retrieval is needed. Color and shape are key features used in content-based image retrieval, which quantifies and analyzes various physical features of images and compares them to search for similar images. Color and shape have been used as independent features, but the two features are closely related in terms of cognition. In this paper, a method of describing the spatial distribution of color using a complex color model that projects three-dimensional color information onto two-dimensional complex form is proposed. Experimental results show that the proposed method can efficiently represent the shape of spatial distribution of colors by frequency transforming the complex image and reconstructing it with only a few coefficients in the low frequency.

Soil Particle Shape Analysis Using Fourier Descriptor Analysis (퓨리에 기술자 분석을 이용한 단일 흙 입자의 형상 분석)

  • Koo, Bonwhee;Kim, Taesik
    • Journal of the Korean GEO-environmental Society
    • /
    • v.17 no.3
    • /
    • pp.21-26
    • /
    • 2016
  • Soil particle shape analysis was conducted with sands from Jumujun, Korea and Ras Al Khair, Saudi Arabia. Two hundred times enlarged digital images of the particles of those two sands were obtained with an optical microscope. The resolution of the digital images was $640{\times}320$. By conducting digital image processing, the coordinates of the soil particle boundary were extracted. After mapping those coordinates to the complex space, Fourier transformation was performed and the coefficients of each trigonometry term were computed. The coefficients reflect the shape characteristics of the sand grains and are invariant to translation. To evaluate the shape itself excluding the size of the soil particle, the coefficient was normalized by the equivalent radius of soil particle; this is called Fourier descriptor. After analyzing the Fourier descriptors, it was found that the major characteristics of Jumunjin and Ras Al Khair sands were elongation and asymmetry. Furthermore, it was found that the particle shapes reflect the self-similar, fractal nature of the textural features. The effects of resolution on soil particle shape analysis was also studied. Regarding this, it was found that the significant Fourier descriptors were not significantly affected by the image resolution investigated in this study, but the descriptors associated with textural features were affected.