• Title/Summary/Keyword: Similar Software Classification

Search Result 51, Processing Time 0.027 seconds

An Approach to Applying Multiple Linear Regression Models by Interlacing Data in Classifying Similar Software

  • Lim, Hyun-il
    • Journal of Information Processing Systems
    • /
    • v.18 no.2
    • /
    • pp.268-281
    • /
    • 2022
  • The development of information technology is bringing many changes to everyday life, and machine learning can be used as a technique to solve a wide range of real-world problems. Analysis and utilization of data are essential processes in applying machine learning to real-world problems. As a method of processing data in machine learning, we propose an approach based on applying multiple linear regression models by interlacing data to the task of classifying similar software. Linear regression is widely used in estimation problems to model the relationship between input and output data. In our approach, multiple linear regression models are generated by training on interlaced feature data. A combination of these multiple models is then used as the prediction model for classifying similar software. Experiments are performed to evaluate the proposed approach as compared to conventional linear regression, and the experimental results show that the proposed method classifies similar software more accurately than the conventional model. We anticipate the proposed approach to be applied to various kinds of classification problems to improve the accuracy of conventional linear regression.

Advanced Faceted Classification Scheme and Semantic Similarity Measure for Reuse of Software Components (소프트웨어 부품의 재사용을 위한 개선된 패싯 분류 방법과 의미 유사도 측정)

  • Gang, Mun-Seol
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.855-865
    • /
    • 1996
  • In this paper, we propose a automation of the classification process for reusable software component and construction method of structured software components library. In order to efficient and automatic classification of software component, we decide the facets to represent characteristics of software component by acquiring semantic and syntactic information from software components descriptions in natural language, and compose the software component identifier or automatic extract terms corresponds to each facets. And then, in order to construct the structured software components library, we sore in the near location with software components of similar characteristic according to semantic similarity of the classified software components. As the result of applying proposed method, we can easily identify similar software components, the classification process of software components become simple, and the software components store in the structured software components library.

  • PDF

An Extended Faceted Classification Scheme and Hybrid Retrieval Model to Support Software Reuse (소프트웨어 재사용을 지원하는 확장된 패싯 분류 방식과 혼합형 검색 모델)

  • Gang, Mun-Seol;Kim, Byeong-Gi
    • The Transactions of the Korea Information Processing Society
    • /
    • v.1 no.1
    • /
    • pp.23-37
    • /
    • 1994
  • In this paper, we design and implement the prototype system, and propose the Extended Faceted Classification. Scheme and the Hybrid Retrieval Method that support classifying the software components, storing in library, and efficient retrieval according to user's request. In order to designs the classification scheme, we identify several necessary items by analyzing basic classes of software components that are to be classified. Then, we classify the items by their characteristics, decide the facets, and compose the component descriptors. According to their basic characteristics, we store software components in the library by clustering their application domains and are assign weights to the facets and its items to describe the component characteristics. In order to retrieve the software components, we use the retrieval-by-query model, and the weights and similarity for easy retrieval of similar software components. As the result of applying proposed classification scheme and retrieval model, we can easily identify similar components and the process of classification become simple. Also, the construction of queries becomes simple, the control of the size and order of the components to be retrieved possible, and the retrieval effectiveness is improved.

  • PDF

A Study for the Effective Classification and Retrieval of Software Component (효과적인 소프트웨어 컴포넌트 분류 및 검색에 관한 연구)

  • Cho, Byung-Ho
    • Journal of Internet Computing and Services
    • /
    • v.7 no.6
    • /
    • pp.1-10
    • /
    • 2006
  • A software development using components reuse is an useful method to reduce the software development cost. But a retrieval method by the keyword and category classifications is difficult to search an exact matching component due to components complexity in component reuse. Therefore, after different existing methods are examined and analyzed, an effective classification and retrieval method using XML specifications and the system architecture of components integrated management based on it are presented. Many discording elements of DTD which is component meta-expression exist in components retrieval. To compensate it, this retrieval method using estimations of precision and concision is effective one to catch considerable matching preference components. This method makes possible to retrieve suitable components having better priority due to searching similar matching components that are difficult in an existing keyword matching method.

  • PDF

The Application of RS and GIS Technologies on Landslide Information Extraction of ALOS Images in Yanbian Area, China

  • Quan, He Chun;Lee, Byung Gul
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.23 no.3
    • /
    • pp.85-93
    • /
    • 2015
  • This paper mainly introduces the methods of extracting landslide information using ALOS(Advanced Land Observing Satellite) images and GIS(Geographical Information System) technology. In this study, we classified images using three different methods which are the unsupervised the supervised and the PCA(Principal Components Analysis) for extracting landslide information based on characteristics of ALOS image. From the image classification results, we found out that the quality of classified image extracted with PCA supervised method was superior than the other images extracted with the other methods. But the accuracy of landslide information extracted from this image classification was still very low as the pixels were very similar between the landslide and safety regions. It means that it is really difficult to distinguish those areas with an image classification method alone because the values of pixels between the landslide and other areas were similar, particularly in a region where the landslide and other areas coexist. To solve this problem, we used the LSM(Landslide Susceptibility Map) created with ArcView software through weighted overlay GIS method in the areas. Finally, the developed LSM was applied to the image classification process using the ALOS images. The accuracy of the extracted landslide information was improved after adopting the PCA and LSM methods. Finally, we found that the landslide region in the study area can be calculated and the accuracy can also be improved with the LSM and PCA image classification methods using GIS tools.

A Study on Changes in Achievement Goals According to Course Classification in a Liberal Arts Software Education

  • Seung-Hun Shin;Joo-Young Seo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.10
    • /
    • pp.301-311
    • /
    • 2023
  • In university liberal arts education, learners' achievement goals are an important research topic, and this also applies to liberal arts software education. In this paper, we analyzed changes in learning motivation of learners taking liberal arts software courses according to course classification using a 3 × 2 achievement goal model. The analysis was conducted on a discussion-oriented class taken together by learners receiving credits for different purposes, such as required and elective. As a result, it was confirmed that learners begin the semester with similar achievement goals. However, the avoidance goals of learners taking elective courses decreased, showing a significant difference at the end of the semester. It was a different result from the existing liberal arts software education studies that pointed to mandatory enrollment as the cause of lack of motivation to learn. In addition, it was confirmed that learners who take elective courses relatively focus on achievement rather than competition.

A Component storage Design Supporting formalization of Game Engine Development Process (게임엔진 개발 공정의 정형화를 지원하는 컴포넌트 저장소의 설계)

  • Song, Eui-Cheol
    • Journal of Korea Game Society
    • /
    • v.3 no.2
    • /
    • pp.35-41
    • /
    • 2003
  • There arose problems of double investment about the game engine part when a lot of game software similar to the property and procedure processed in the game engine develop new game without the reference or reuse in the other games. In particular, using various software development processes is one of main problems of double investment when the enterprises for the game software development develop games now Accordingly, because it does not make standardization of process about the game engine, it does not understand and reuse products created in process of the other software development process in development now. Accordingly, the newly analyzed and designed software was big problems with the present game software about the game engine process similar to the other game software when the enterprises for any game software develop a special game. For solving these problems, this study is to suggest the process improvement about the game engine development, analysis of structure and relation, classification and combination method by the class and module, implementation of storage, and processor model in order to apply the development method based on the component.

  • PDF

SCLC-Edge Detection Algorithm for Skin Cancer Classification (피부암 병변 분류를 위한 SCLC-Edge 검출 알고리즘)

  • June-Young Park;Chang-Min Kim;Roy C. Park
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.4
    • /
    • pp.256-263
    • /
    • 2022
  • Skin cancer is one of the most common diseases in the world, and the incidence rate in Korea has increased by about 100% over the past five years. In the United States, more than 5 million people are diagnosed with skin cancer every year. Skin cancer mainly occurs when skin tissue is damaged for a long time due to exposure to ultraviolet rays. Melanoma, a malignant tumor of skin cancer, is similar in appearance to Atypical melanocytic nevus occurring on the skin, making it difficult for the general public to be aware of it unless secondary signs occur. In this paper, we propose a skin cancer lesion edge detection algorithm and a deep learning model, CRNN, which performs skin cancer lesion classification for early detection and classification of these skin cancers. As a result of the experiment, when using the contour detection algorithm proposed in this paper, the classification accuracy was the highest at 97%. For the Canny algorithm, 78% was shown, 55% for Sobel, and 46% for Laplacian.

Estimating the Time to Fix Bugs Using Bug Reports (버그 리포트를 이용한 버그 정정 시간 추정)

  • Kwon, Kimun;Jin, Kwanghue;Lee, Byungjeong
    • Journal of KIISE
    • /
    • v.42 no.6
    • /
    • pp.755-763
    • /
    • 2015
  • As fixing bugs is a large part of software development and maintenance, estimating the time to fix bugs -bug fixing time- is extremely useful when planning software projects. Therefore, in this study, we propose a way to estimate bug fixing time using bug reports. First, we classify previous bug reports with meta fields by applying a k-NN method. Next, we compute the similarity of the new bug and previous bugs by using data from bug reports. Finally, we estimate how long it will take to fix the new bug using the time it took to repair similar bugs. In this study, we perform experiments with open source software. The results of these experiments show that our approach effectively estimates the bug fixing time.

Similar Question Search System for Q&A board of The National Institute of the Korean Language using Topic Classification (주제 분류를 활용한 국립국어원 질의응답 게시판 유사 질문 검색 시스템)

  • Mun, Jung-Min;Song, Yeong-Ho;Jin, Ji-Hwan;Lee, Hyun-Seob;Lee, Hyun-Ah
    • Annual Conference on Human and Language Technology
    • /
    • 2014.10a
    • /
    • pp.201-205
    • /
    • 2014
  • 국립국어원의 온라인 가나다 서비스는 한국어에 대한 다양한 질문과 정확한 답변을 제공한다. 만일 새롭게 등록되는 질문에 대해 유사한 질문을 자동으로 찾을 수 있다면, 질문자는 빠른 시간에 답변을 얻을 수 있고 서비스 관리자는 수동 답변 작성의 부담을 덜 수 있다. 본 논문에서는 국립국어원 질의응답게시판의 특성을 분석하여 질문의 주제를 6가지로 분류하고, 주제 분류 정보와 벡터 유사도, 수열 유사도를 결합하여 유사한 질문을 검색하는 시스템을 제안한다. 평가에서는 본 논문에서 제시한 주제 분류 정보를 활용한 결과 1위 정답 검색 정확률이 향상되는 결과를 얻었다. 최종 실험에서는 MRR이 0.62, 정답이 1위, 5위내에 검색될 확률은 각각 54.2%, 78.2%를 보였다.

  • PDF