• Title/Summary/Keyword: Similarity rule

Search Result 119, Processing Time 0.025 seconds

An Innovative Approach of Bangla Text Summarization by Introducing Pronoun Replacement and Improved Sentence Ranking

  • Haque, Md. Majharul;Pervin, Suraiya;Begum, Zerina
    • Journal of Information Processing Systems
    • /
    • v.13 no.4
    • /
    • pp.752-777
    • /
    • 2017
  • This paper proposes an automatic method to summarize Bangla news document. In the proposed approach, pronoun replacement is accomplished for the first time to minimize the dangling pronoun from summary. After replacing pronoun, sentences are ranked using term frequency, sentence frequency, numerical figures and title words. If two sentences have at least 60% cosine similarity, the frequency of the larger sentence is increased, and the smaller sentence is removed to eliminate redundancy. Moreover, the first sentence is included in summary always if it contains any title word. In Bangla text, numerical figures can be presented both in words and digits with a variety of forms. All these forms are identified to assess the importance of sentences. We have used the rule-based system in this approach with hidden Markov model and Markov chain model. To explore the rules, we have analyzed 3,000 Bangla news documents and studied some Bangla grammar books. A series of experiments are performed on 200 Bangla news documents and 600 summaries (3 summaries are for each document). The evaluation results demonstrate the effectiveness of the proposed technique over the four latest methods.

Ontology Knowledge Base Scheme for User Query Semantic Interpretation (사용자 질의 의미 해석을 위한 온톨로지 지식베이스 스키마 구축)

  • Doh, Hana;Lee, Moo-Hun;Jeong, Hoon;Choi, Eui-In
    • Journal of Digital Convergence
    • /
    • v.11 no.3
    • /
    • pp.285-292
    • /
    • 2013
  • The method of recent information retrieval passes into an semantic search to provide more accurate results than keyword-based search. But in common user case, they are still accustomed to using existing keyword-based search. Hence they are hard to create a typed structured query language. In this paper, we propose to ontology knowledge-base scheme for query interpretation of these user. The proposed scheme was designed based on the OWL-DL for description logic reasoning, it can provide a richer representation of the relationship between the object by using SWRL(Semantic Web Rule Language). Finally, we are describe the experimental results of the similarity measurement for verification of a user query semantic interpretation.

Hydraulic Tests of Lox Pump for 75-ton class Liquid Rocket Engines (75톤급 로켓엔진용 산화제펌프의 수력성능시험)

  • Kim, Dae-Jin;Hong, Soon-Sam;Choi, Chang-Ho;Kim, Jin-Han
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • 2010.05a
    • /
    • pp.77-80
    • /
    • 2010
  • A series of hydraulic tests of a Lox pump are performed using water at a room temperature. According to the test results, the Lox pump satisfies its design requirement but both the head and the efficiency do not fully follow the conventional similarity rule. The deviation of the head from the rule is assumed to be due to the increased volute loss at high rotational speed. Furthermore, it is found that when the pump rotates with the flow ratio less then the design requirement the leakage flowrate seems to be increased.

  • PDF

Recommendation System using Associative Web Document Classification by Word Frequency and α-Cut (단어 빈도와 α-cut에 의한 연관 웹문서 분류를 이용한 추천 시스템)

  • Jung, Kyung-Yong;Ha, Won-Shik
    • The Journal of the Korea Contents Association
    • /
    • v.8 no.1
    • /
    • pp.282-289
    • /
    • 2008
  • Although there were some technological developments in improving the collaborative filtering, they have yet to fully reflect the actual relation of the items. In this paper, we propose the recommendation system using associative web document classification by word frequency and ${\alpha}$-cut to address the short comings of the collaborative filtering. The proposed method extracts words from web documents through the morpheme analysis and accumulates the weight of term frequency. It makes associative rules and applies the weight of term frequency to its confidence by using Apriori algorithm. And it calculates the similarity among the words using the hypergraph partition. Lastly, it classifies related web document by using ${\alpha}$-cut and calculates similarity by using adjusted cosine similarity. The results show that the proposed method significantly outperforms the existing methods.

An Analysis of the Student's Algebra Word Problem Solving Process (대수 문장제 해결을 위한 학생들의 풀이 과정 분석: 일련의 표시(Chain of signification) 관점의 사례연구)

  • Park, Hyun-Jeong;Lee, Chong-Hee
    • School Mathematics
    • /
    • v.9 no.1
    • /
    • pp.141-160
    • /
    • 2007
  • The purpose of this paper was to evaluate how students apply prior knowledge or experience in solving algebra word problems from the chain of signification-based perspective. Three middle school students were evaluated in this case study. The results showed that the subjects formed similarities in the process of applying knowledge needed for solving a problem. The student A and C used semi-open-end formulas and closed formulas as solutions. They then formed concrete shape for each solution using the chain of signification that was applied for solution by forming procedural similarity. At this time, the chain of signification could be the combination of numbers, words, and pictures (such as diagrams or graphs) or just numbers or words. On the other hand, the student C who recognized closed formulas and her own rule as a solution method could not formulate completely procedural similarity due to many errors arising from number information. Nonetheless, all of the subjects showed something in common in the process of coming up with a algorithm that was semi-open-end formula or closed formula.

  • PDF

Content Recommendation Techniques for Personalized Software Education (개인화된 소프트웨어 교육을 위한 콘텐츠 추천 기법)

  • Kim, Wan-Seop
    • Journal of Digital Convergence
    • /
    • v.17 no.8
    • /
    • pp.95-104
    • /
    • 2019
  • Recently, software education has been emphasized as a key element of the fourth industrial revolution. Many universities are strengthening the software education for all students according to the needs of the times. The use of online content is an effective way to introduce SW education for all students. However, the provision of uniform online contents has limitations in that it does not consider individual characteristics(major, sw interest, comprehension, interests, etc.) of students. In this study, we propose a recommendation method that utilizes the directional similarity between contents in the boolean view history data environment. We propose a new item-based recommendation formula that uses the confidence value of association rule analysis as the similarity level and apply it to the data of domestic paid contents site. Experimental results show that the recommendation accuracy is improved than when using the traditional collaborative recommendation using cosine or jaccard for similarity measurements.

Cryptocurrency Recommendation Model using the Similarity and Association Rule Mining (유사도와 연관규칙분석을 이용한 암호화폐 추천모형)

  • Kim, Yechan;Kim, Jinyoung;Kim, Chaerin;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.287-308
    • /
    • 2022
  • The explosive growth of cryptocurrency, led by Bitcoin has emerged as a major issue in the financial market recently. As a result, interest in cryptocurrency investment is increasing, but the market opens 24 hours and 365 days a year, price volatility, and exponentially increasing number of cryptocurrencies are provided as risks to cryptocurrency investors. For that reasons, It is raising the need for research to reduct investors' risks by dividing cryptocurrency which is not suitable for recommendation. Unlike the previous studies of maximizing returns by simply predicting the future of cryptocurrency prices or constructing cryptocurrency portfolios by focusing on returns, this paper reflects the tendencies of investors and presents an appropriate recommendation method with interpretation that can reduct investors' risks by selecting suitable Altcoins which are recommended using Apriori algorithm, one of the machine learning techniques, but based on the similarity and association rules of Bitocoin.

A Multi-Phase Decision Making Model for Supplier Selection Under Supply Risks (공급 리스크를 고려한 공급자 선정의 다단계 의사결정 모형)

  • Yoo, Jun-Su;Park, Yang-Byung
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.40 no.4
    • /
    • pp.112-119
    • /
    • 2017
  • Selecting suppliers in the global supply chain is the very difficult and complicated decision making problem particularly due to the various types of supply risk in addition to the uncertain performance of the potential suppliers. This paper proposes a multi-phase decision making model for supplier selection under supply risks in global supply chains. In the first phase, the model suggests supplier selection solutions suitable to a given condition of decision making using a rule-based expert system. The expert system consists of a knowledge base of supplier selection solutions and an "if-then" rule-based inference engine. The knowledge base contains information about options and their consistency for seven characteristics of 20 supplier selection solutions chosen from articles published in SCIE journals since 2010. In the second phase, the model computes the potential suppliers' general performance indices using a technique for order preference by similarity to ideal solution (TOPSIS) based on their scores obtained by applying the suggested solutions. In the third phase, the model computes their risk indices using a TOPSIS based on their historical and predicted scores obtained by applying a risk evaluation algorithm. The evaluation algorithm deals with seven types of supply risk that significantly affect supplier's performance and eventually influence buyer's production plan. In the fourth phase, the model selects Pareto optimal suppliers based on their general performance and risk indices. An example demonstrates the implementation of the proposed model. The proposed model provides supply chain managers with a practical tool to effectively select best suppliers while considering supply risks as well as the general performance.

Flow Resistance and Modeling Rule of Fishing Nets 3. Establishment of Modeling Rule and its Theoritical Examination (그물어구의 유수저항과 모형수칙 3. 모형수칙의 수립 및 이론적 검토)

  • KIM Dae-An
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.30 no.4
    • /
    • pp.543-549
    • /
    • 1997
  • The problems in the existing modeling rules for fishing nets, especially in the Tauti's rule which had been used most commonly, were investigated and it was found that the rules could not give a good similarity between the prototype and model nets because they din neither analyze the flow resistance of nets accurately nor decide the ratio of flow velocity between the two nets properly. Thus, the modeling rule was newly derived by regarding the nets as holey structures sucking water into their mouth and then filtering water through their meshes as in the previous paper. The similarity conditions obtained, between the two nets distinguished by subscript 1 and 2, are as follows; $$\frac{d_2}{d_1}=\sqrt{\frac{l_2}{l_1}},\;\frac{N_2}{N_1}=(\frac{d_1}{d_2})^{1.5}\frac{L_2}{L_1},\;\varphi_1=\varphi_2,\;\frac{d_{r2}}{d_{r1}}=\sqrt{\frac{L_2{(\rho_{r1}-\rho_{w1})}}{{L_1{(\rho_{r2}-\rho_{w2})}}$$ $$\frac{N_{a2}}{N_{a1}}=\frac{W_{a1}}{W_{a2}}(\frac{L_2}{L_1})^2,\;\nu_1=\nu_2\;and\;\frac{R_2}{R_1}=(\frac{L_2}{L_1})^2$$, where L is the length of nettings, d the diameter of netting twines, 2l the mesh size, $2\varphi$ the angle between two adjacent bars, N the number of meshes at the sides of nettings, $d_r$, the diameter of ropes, $\rho_r$, the specific gravity of ropes, $W_a$ the weight in water of one piece of float or sinker, $N_a$ the number of floats or sinkers, $\nu$ the flow velocity, and R the flow resistance of net. In the case where the model experiments aim at investigating the influence of weight in water of nettings on their shapes in nets subjected to the water flow of very low velocity, however, the following condition is added; $$\frac{\rho_2-\rho_{w2}}{\rho_1-\rho_{w1}}=\frac{d_1}{d_2}$$ where $\rho$ is the specific gravity of netting twines.

  • PDF

K-means Clustering for Environmental Indicator Survey Data

  • Park, Hee-Chang;Cho, Kwang-Hyun
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.04a
    • /
    • pp.185-192
    • /
    • 2005
  • There are many data mining techniques such as association rule, decision tree, neural network analysis, clustering, genetic algorithm, bayesian network, memory-based reasoning, etc. We analyze 2003 Gyeongnam social indicator survey data using k-means clustering technique for environmental information. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. In this paper, we used k-means clustering of several clustering techniques. The k-means clustering is classified as a partitional clustering method. We can apply k-means clustering outputs to environmental preservation and environmental improvement.

  • PDF