• Title/Summary/Keyword: approximate similarity

Search Result 60, Processing Time 0.023 seconds

An Improvement in K-NN Graph Construction using re-grouping with Locality Sensitive Hashing on MapReduce (MapReduce 환경에서 재그룹핑을 이용한 Locality Sensitive Hashing 기반의 K-Nearest Neighbor 그래프 생성 알고리즘의 개선)

  • Lee, Inhoe;Oh, Hyesung;Kim, Hyoung-Joo
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.11
    • /
    • pp.681-688
    • /
    • 2015
  • The k nearest neighbor (k-NN) graph construction is an important operation with many web-related applications, including collaborative filtering, similarity search, and many others in data mining and machine learning. Despite its many elegant properties, the brute force k-NN graph construction method has a computational complexity of $O(n^2)$, which is prohibitive for large scale data sets. Thus, (Key, Value)-based distributed framework, MapReduce, is gaining increasingly widespread use in Locality Sensitive Hashing which is efficient for high-dimension and sparse data. Based on the two-stage strategy, we engage the locality sensitive hashing technique to divide users into small subsets, and then calculate similarity between pairs in the small subsets using a brute force method on MapReduce. Specifically, generating a candidate group stage is important since brute-force calculation is performed in the following step. However, existing methods do not prevent large candidate groups. In this paper, we proposed an efficient algorithm for approximate k-NN graph construction by regrouping candidate groups. Experimental results show that our approach is more effective than existing methods in terms of graph accuracy and scan rate.

Behavior of Poisson Bracket Mapping Equation in Studying Excitation Energy Transfer Dynamics of Cryptophyte Phycocyanin 645 Complex

  • Lee, Weon-Gyu;Kelly, Aaron;Rhee, Young-Min
    • Bulletin of the Korean Chemical Society
    • /
    • v.33 no.3
    • /
    • pp.933-940
    • /
    • 2012
  • Recently, it has been shown that quantum coherence appears in energy transfers of various photosynthetic lightharvesting complexes at from cryogenic to even room temperatures. Because the photosynthetic systems are inherently complex, these findings have subsequently interested many researchers in the field of both experiment and theory. From the theoretical part, simplified dynamics or semiclassical approaches have been widely used. In these approaches, the quantum-classical Liouville equation (QCLE) is the fundamental starting point. Toward the semiclassical scheme, approximations are needed to simplify the equations of motion of various degrees of freedom. Here, we have adopted the Poisson bracket mapping equation (PBME) as an approximate form of QCLE and applied it to find the time evolution of the excitation in a photosynthetic complex from marine algae. The benefit of using PBME is its similarity to conventional Hamiltonian dynamics. Through this, we confirmed the coherent population transfer behaviors in short time domain as previously reported with a more accurate but more time-consuming iterative linearized density matrix approach. However, we find that the site populations do not behave according to the Boltzmann law in the long time limit. We also test the effect of adding spurious high frequency vibrations to the spectral density of the bath, and find that their existence does not alter the dynamics to any significant extent as long as the associated reorganization energy is changed not too drastically. This suggests that adopting classical trajectory based ensembles in semiclassical simulations should not influence the coherence dynamics in any practical manner, even though the classical trajectories often yield spurious high frequency vibrational features in the spectral density.

Search Space Reduction Techniques in Small Molecular Docking (소분자 도킹에서 탐색공간의 축소 방법)

  • Cho, Seung Joo
    • Journal of Integrative Natural Science
    • /
    • v.3 no.3
    • /
    • pp.143-147
    • /
    • 2010
  • Since it is of great importance to know how a ligand binds to a receptor, there have been a lot of efforts to improve the quality of prediction of docking poses. Earlier efforts were focused on improving search algorithm and scoring function in a docking program resulting in a partial improvement with a lot of variations. Although these are basically very important and essential, more tangible improvements came from the reduction of search space. In a normal docking study, the approximate active site is assumed to be known. After defining active site, scoring functions and search algorithms are used to locate the expected binding pose within this search space. A good search algorithm will sample wisely toward the correct binding pose. By careful study of receptor structure, it was possible to prioritize sub-space in the active site using "receptor-based pharmacophores" or "hot spots". In a sense, these techniques reduce the search space from the beginning. Further improvements were made when the bound ligand structure is available, i.e., the searching could be directed by molecular similarity using ligand information. This could be very helpful to increase the accuracy of binding pose. In addition, if the biological activity data is available, docking program could be improved to the level of being useful in affinity prediction for a series of congeneric ligands. Since the number of co-crystal structures is increasing in protein databank, "Ligand-Guided Docking" to reduce the search space would be more important to improve the accuracy of docking pose prediction and the efficiency of virtual screening. Further improvements in this area would be useful to produce more reliable docking programs.

Measurement of the Laminar Boundary Layer in a Streamwise Corner by using PIV Technique (PIV 기법을 이용한 Streamwise Corner 층류 경계층 측정 연구)

  • Park, Dong-Hun;Park, Seung-O;Kwon, Ki-Jung;Shim, Ho-Joon
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.37 no.12
    • /
    • pp.1165-1172
    • /
    • 2009
  • The laminar boundary layer along a streamwise corner formed by two flat plates intersecting at right angle is measured by using Particle Image Velocimetry(PIV) technique. The free stream velocity ranges from 2.96m/s to 3.0m/s. The angle of incidence of the corner is set to 1.2 degree providing slightly favourable pressure gradient to ensure a laminar flow in the corner region. A round shape leading edge is used and the length of the model is about 1000mm. In the bisector plane, the measurement data show separation type velocity profiles having an inflection point which is a typical characteristic of laminar corner boundary layers. As the distance away from the bisector plane increases, velocity profiles are found to change into the Blasius profile. The change completes around half length of the boundary layer thickness in the bisector plane away from the bisector plane along the plate. In the bisector plane, the growth characteristic of the boundary layer thickness and the approximate similarity of velocity profiles are confirmed from the measurement data.

Partial Substitution of Copper with Nickel for the Superconducting Bismuth Compound and Its Effect on the Physical and Electrical Properties

  • Kareem Ali Jasim;Riyam Abd Al-Zahra Fadil;Kassim Mahdi Wadi;Auday Hattem Shaban
    • Korean Journal of Materials Research
    • /
    • v.33 no.9
    • /
    • pp.360-366
    • /
    • 2023
  • This study focuses on how the partial substitution of copper by nickel nanoparticles affects the electrical and structural properties of the Bi2Ba2Ca2Cu2.9Ni0.1O10+δ, Bi2Ba2Ca2Cu2.8Ni0.2O10+δ and Bi2Ba2Ca2Cu2.6Ni0.4O10+δ compounds. Approximate values of crystallization size and crystallization percentage for the three compounds were calculated using the Scherer, modified Scherer, and Williamson-Hall methods. A great similarity was observed in the crystal size values from the Scherer method, 243.442 nm, and the Williamson-Hall method, 243.794 nm for the second sample. At the same time this sample exhibited the highest crystal size value for the three methods. In the examination of electrical properties, the sample with 0.1 partial substitution, Bi2Ba2Ca2Cu2.9Ni0.1O10+δ was determined to be the best with a critical temperature of 100 K and an energy gap of 6.57639 × 10-21 MeV. Using the SEM technique to analyze the structural morphology of the three phases, it was discovered that the size of the granular forms exceeds 25 nm. It was determined that the samples' shapes alter when nickel concentration rises. The patterns that reveal the distribution of the crystal structure also exhibit clear homogeneity.

Characterization of Mamestra brassicae Nucleopolyhedrovirus (MabrNPV)-K1 Isolated in Korea

  • Lee, Jae-Kyung;Shin, Tae-Young;Bae, Sung-Min;Choi, Jae-Bang;Oh, Jeong-Mi;Koo, Hyun-Na;Kim, Ju-Il;Kwon, Min;Woo, Soo-Dong
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.17 no.1
    • /
    • pp.125-129
    • /
    • 2008
  • The purpose of this study was to investigate the characteristics of Mamestra brassicae nucleopolyhedrovirus (MabrNPV)-K1 isolated in Korea. Polyhedra of MabrNPV-K1 showed irregular appearance in shape with the average diameter $1.8{\mu}m$. MabrNPV-K1 contained a number of nucleocapsids within a viral envelope embedded in polyhedron. The polyhedrin of MabrNPV-K1 was composed of single polypeptide with a M.W. of approximate 31 kDa which is identical to the commercialized MabrNPV, Mamestrin, as a biological control agent. The nucleotide and amino acid sequences within the coding region of MabrNPV-K1 polyhedrin shared 99.0% similarity with the polyhedrin gene from previous reported MabrNPVs. The median lethal concentrations ($LC_{50}$) of MabrNPV-K1 and Mamestrin to M. brassicae larvae were $3.9{\times}10^3$ PIBs/larva and $6.0{\times}10^4$ PIBs/larva, respectively. Mortality of the MabrNPV-K1 against to the third instars larvae was 15 times higher than that of the Mamestrin. The median lethal times ($LT_{50})$ of MabrNPV-K1 by the concentration of polyhedra were lower ($4.4{\sim}6.1$ days) than those of Mamestrin ($4.1{\sim}8.6$ days). These results suggest that a local strain MabrNPV-K1 has high pathogenicity to M. brassicae and may be useful for the development of biological control agent to control this.

Efficient Multi-Step k-NN Search Methods Using Multidimensional Indexes in Large Databases (대용량 데이터베이스에서 다차원 인덱스를 사용한 효율적인 다단계 k-NN 검색)

  • Lee, Sanghun;Kim, Bum-Soo;Choi, Mi-Jung;Moon, Yang-Sae
    • Journal of KIISE
    • /
    • v.42 no.2
    • /
    • pp.242-254
    • /
    • 2015
  • In this paper, we address the problem of improving the performance of multi-step k-NN search using multi-dimensional indexes. Due to information loss by lower-dimensional transformations, existing multi-step k-NN search solutions produce a large tolerance (i.e., a large search range), and thus, incur a large number of candidates, which are retrieved by a range query. Those many candidates lead to overwhelming I/O and CPU overheads in the postprocessing step. To overcome this problem, we propose two efficient solutions that improve the search performance by reducing the tolerance of a range query, and accordingly, reducing the number of candidates. First, we propose a tolerance reduction-based (approximate) solution that forcibly decreases the tolerance, which is determined by a k-NN query on the index, by the average ratio of high- and low-dimensional distances. Second, we propose a coefficient control-based (exact) solution that uses c k instead of k in a k-NN query to obtain a tigher tolerance and performs a range query using this tigher tolerance. Experimental results show that the proposed solutions significantly reduce the number of candidates, and accordingly, improve the search performance in comparison with the existing multi-step k-NN solution.

A Design and Implementation of a Content_Based Image Retrieval System using Color Space and Keywords (칼라공간과 키워드를 이용한 내용기반 화상검색 시스템 설계 및 구현)

  • Kim, Cheol-Ueon;Choi, Ki-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.6
    • /
    • pp.1418-1432
    • /
    • 1997
  • Most general content_based image retrieval techniques use color and texture as retrieval indices. In color techniques, color histogram and color pair based color retrieval techniques suffer from a lack of spatial information and text. And This paper describes the design and implementation of content_based image retrieval system using color space and keywords. The preprocessor for image retrieval has used the coordinate system of the existing HSI(Hue, Saturation, Intensity) and preformed to split One image into chromatic region and achromatic region respectively, It is necessary to normalize the size of image for 200*N or N*200 and to convert true colors into 256 color. Two color histograms for background and object are used in order to decide on color selection in the color space. Spatial information is obtained using a maximum entropy discretization. It is possible to choose the class, color, shape, location and size of image by using keyword. An input color is limited by 15 kinds keyword of chromatic and achromatic colors of the Korea Industrial Standards. Image retrieval method is used as the key of retrieval properties in the similarity. The weight values of color space ${\alpha}(%)and\;keyword\;{\beta}(%)$ can be chosen by the user in inputting the query words, controlling the values according to the properties of image_contents. The result of retrieval in the test using extracted feature such as color space and keyword to the query image are lower that those of weight value. In the case of weight value, the average of te measuring parameters shows approximate Precision(0.858), Recall(0.936), RT(1), MT(0). The above results have proved higher retrieval effects than the content_based image retrieval by using color space of keywords.

  • PDF

LED Sensitive Light System Development by Brain-wave (LED감성조명 장치 개발을 통한 뇌파분석)

  • Choi, Keum-Yeon;Eo, Ik-Soo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.1
    • /
    • pp.61-66
    • /
    • 2010
  • The purpose of this experiment is to analyze the basic status of brain. Which are consist of rest, attention and concentration, of the brain by measuring the temperature of color by changing RGB color after manufacturing LED-illumination stand. Basic status (rest, attention and concentration) of experimenter were measured temperature of colors having three difference temperature like as $2,300^{\circ}K$, $4,000^{\circ}K$ and $6,000^{\circ}K$. The results was shown that experimenter feels more comfortable and relaxation by decreasing the temperature of color. For example we can see the little increase of concentration index at $4,000^{\circ}K$ condition and we can estimate that right brain can be more activated at the $4,000^{\circ}K$ condition. But we can not find out any different at the $6,000^{\circ}K$ condition. Main cause of no difference from the color temperature was the similarity of color temperature under the general fluorescent lamp. And interface temperature of radiant heat design results LED and PCB was approximately 80 degrees to COMSOL Multiphysics, and changed until approximately 50 degrees until a floor plane of PCB, and verification as arranged chip LED to metal PCB, and it was possible, and a near radiant heat design was confirmed to an approximate value of, as a result, acid manufacture.

A Study on Fractionation and Characterization of Water-Soluble Natural Fe-Chelates From Garbage Compost and Activated Sewage Sludge (활성오니(活性汚泥) 및 진개퇴비중(塵芥堆肥中) 수용성(水溶性) 철(鐵) 킬레이트의 분리(分離)와 특성(特性)에 관(關)한 연구(硏究))

  • Park, Nae-Joung;Lindsay, W.L.
    • Applied Biological Chemistry
    • /
    • v.18 no.4
    • /
    • pp.194-202
    • /
    • 1975
  • This study was conducted to study the properties of the water-soluble natural chelating agents from garbage compost and activated sewage sludge responsible for Fe chelation, which is closely associated with the effectiveness in correcting iron chlorosis in plant. The water-soluble fraction of these materials was fractionated by menas of Sephadex gel filtration and the fractions of Fe chehates were traced by radioactive $^{59}Fe$. The fractions were examined by ultraviolet and infrared. spectroscopy and stability constants for Fe. The water-soluble fraction from garbage compost was separated by Sephadex G-25 into approximately four fractions. Most of the added $^{59}Fe$ was associated with fraction I, which appeared at the void volume. Further fractionation by Sephadex G-50 indicated that the molecular weight of water-soluble chelating agents is in the approximate range of 5000 to 10,000. The water-soluble fraction from activated sewage sludge gave six fractions by Sephadex G-25. Most of the added $^{59}Fe$ was found in the fraction I,II, and III, The molecular weights of most chelating agents associated with $^{59}Fe$ appeared to be less than 5,000 and those of fraction I that appeared at the void volume was in the range of 5,000 to 1,000. Discrepancy between radio activity count and UV absorption indicated the heterogeneity of the fractions obtained by Sephadex gel filtration. Ultraviolet absorption spectra of all fractions separated by Sephadex G-25 and containing chelating agents showed no differences. Fraction IV and V of sewage extract showed absorption maxima and shifting similar to nucleic acid components suggesting the presence of decomposition products of nucleic acid. Similarity fraction VI contained phenolic type amino acid groups. Fraction I of compost extract contained most of the added $^{59}Fe$ and showed weak but extra definite absorption in the 1230, and $1270cm^{-1}$ region, suggesting that extra oxygen groups in polyphenolic structure were probably involved in Fe chelation. In sewage extract, fraction I,II, and III in which most of the $^{59}Fe$ was found, showed strong definite polypeptide absorption in the region of $1540cm^{-1}$ due to NH deformation and C-N stretching of amide groups in the peptidebond. These extra functional groups in fraction I, II, and III appeared to be associated with Fe chelation. The other fractions, not associated with $^{59}Fe$, still have carboxyl and hydroxyl groups, suggesting that these functional groups in these water extracts may not independently form the Fe chelates. Precipitation of ferric hydroxide precluded measuring the stability constants for Fe-chelates. However, the formation constants for Zn chelates as log K values for compost extract and sewage extract at pH 4.0 from which the strength of chelation with Fe could be presumed, were 8.23, and 9.75, respectively, indicating strong complexation with metals. The chelating capacity of compost extract containing 6.5 g organic matter per liter was 0.82 mM, and that of sewage extract containing 5.3 g per liter was 0. 64 mM.

  • PDF