• Title/Summary/Keyword: Accuracy

Search Result 34,331, Processing Time 0.059 seconds

Measuring Similarity of Android Applications Using Method Reference Frequency and Manifest Information (메소드 참조 빈도와 매니페스트 정보를 이용한 안드로이드 애플리케이션들의 유사도 측정)

  • Kim, Gyoosik;Hamedani, Masoud Reyhani;Cho, Seong-je;Kim, Seong Baeg
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.13 no.3
    • /
    • pp.15-25
    • /
    • 2017
  • As the value and importance of softwares are growing up, software theft and piracy become a much larger problem. To tackle this problem, it is highly required to provide an accurate method for detecting software theft and piracy. Especially, while software theft is relatively easy in the case of Android applications (apps), screening illegal apps has not been properly performed in Android markets. In this paper, we propose a method to effectively measure the similarity between Android apps for detecting software theft at the executable file level. Our proposed method extracts method reference frequency and manifest information through static analysis of executable Android apps as the main features for similarity measurement. Each app is represented as an n-dimensional vectors with the features, and then cosine similarity is utilized as the similarity measure. We demonstrate the effectiveness of our proposed method by evaluating its accuracy in comparison with typical source code-based similarity measurement methods. As a result of the experiments for the Android apps whose source file and executable file are available side by side, we found that our similarity degree measured at the executable file level is almost equivalent to the existing well-known similarity degree measured at the source file level.

Parameter search methodology of support vector machines for improving performance (속도 향상을 위한 서포트 벡터 머신의 파라미터 탐색 방법론)

  • Lee, Sung-Bo;Kim, Jae-young;Kim, Cheol-Hong;Kim, Jong-Myon
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.3
    • /
    • pp.329-337
    • /
    • 2017
  • This paper proposes a search method that explores parameters C and σ values of support vector machines (SVM) to improve performance while maintaining search accuracy. A traditional grid search method requires tremendous computational times because it searches all available combinations of C and σ values to find optimal combinations which provide the best performance of SVM. To address this issue, this paper proposes a deep search method that reduces computational time. In the first stage, it divides C-σ- accurate metrics into four regions, searches a median value of each region, and then selects a point of the highest accurate value as a start point. In the second stage, the selected start points are re-divided into four regions, and then the highest accurate point is assigned as a new search point. In the third stage, after eight points near the search point. are explored and the highest accurate value is assigned as a new search point, corresponding points are divided into four parts and it calculates an accurate value. In the last stage, it is continued until an accurate metric value is the highest compared to the neighborhood point values. If it is not satisfied, it is repeated from the second stage with the input level value. Experimental results using normal and defect bearings show that the proposed deep search algorithm outperforms the conventional algorithms in terms of performance and search time.

Sea Ice Drift Tracking from SAR Images and GPS Tracker (SAR 영상과 GPS 추적기를 이용한 여름철 해빙 이동 궤적 추적)

  • Jeong-Won Park;Hyun-Cheol Kim;Minji Seo;Ji-Eun Park;Jinku Park
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.3
    • /
    • pp.257-268
    • /
    • 2023
  • Sea ice plays an important role in Earth's climate by regulating the amount of solar energy absorbed and controlling the exchange of heat and material across the air-sea interface. Its growth, drift, and melting are monitored on a regular basis by satellite observations. However, low-resolution products with passive microwave radiometer have reduced accuracy during summer to autumn when the ice surface changes rapidly. Synthetic aperture radar (SAR) observations are emerging as a powerful complementary, but previous researches have mainly focused on winter ice. In this study, sea ice drift tracking was evaluated and analyzed using SAR images and tracker with global positioning system (GPS) during late summer-early autumn period when ice surface condition changes a lot. The results showed that observational uncertainty increases compared to winter period, however, the correlation coefficient with GPS measurements was excellent at 0.98, and the performance of the ice tracking algorithm was proportional to the sea ice concentration with a correlation coefficient of 0.59 for ice concentrations above 50%.

Corporate Bankruptcy Prediction Model using Explainable AI-based Feature Selection (설명가능 AI 기반의 변수선정을 이용한 기업부실예측모형)

  • Gundoo Moon;Kyoung-jae Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.241-265
    • /
    • 2023
  • A corporate insolvency prediction model serves as a vital tool for objectively monitoring the financial condition of companies. It enables timely warnings, facilitates responsive actions, and supports the formulation of effective management strategies to mitigate bankruptcy risks and enhance performance. Investors and financial institutions utilize default prediction models to minimize financial losses. As the interest in utilizing artificial intelligence (AI) technology for corporate insolvency prediction grows, extensive research has been conducted in this domain. However, there is an increasing demand for explainable AI models in corporate insolvency prediction, emphasizing interpretability and reliability. The SHAP (SHapley Additive exPlanations) technique has gained significant popularity and has demonstrated strong performance in various applications. Nonetheless, it has limitations such as computational cost, processing time, and scalability concerns based on the number of variables. This study introduces a novel approach to variable selection that reduces the number of variables by averaging SHAP values from bootstrapped data subsets instead of using the entire dataset. This technique aims to improve computational efficiency while maintaining excellent predictive performance. To obtain classification results, we aim to train random forest, XGBoost, and C5.0 models using carefully selected variables with high interpretability. The classification accuracy of the ensemble model, generated through soft voting as the goal of high-performance model design, is compared with the individual models. The study leverages data from 1,698 Korean light industrial companies and employs bootstrapping to create distinct data groups. Logistic Regression is employed to calculate SHAP values for each data group, and their averages are computed to derive the final SHAP values. The proposed model enhances interpretability and aims to achieve superior predictive performance.

Toward understanding learning patterns in an open online learning platform using process mining (프로세스 마이닝을 활용한 온라인 교육 오픈 플랫폼 내 학습 패턴 분석 방법 개발)

  • Taeyoung Kim;Hyomin Kim;Minsu Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.285-301
    • /
    • 2023
  • Due to the increasing demand and importance of non-face-to-face education, open online learning platforms are getting interests both domestically and internationally. These platforms exhibit different characteristics from online courses by universities and other educational institutions. In particular, students engaged in these platforms can receive more learner autonomy, and the development of tools to assist learning is required. From the past, researchers have attempted to utilize process mining to understand realistic study behaviors and derive learning patterns. However, it has a deficiency to employ it to the open online learning platforms. Moreover, existing research has primarily focused on the process model perspective, including process model discovery, but lacks a method for the process pattern and instance perspectives. In this study, we propose a method to identify learning patterns within an open online learning platform using process mining techniques. To achieve this, we suggest three different viewpoints, e.g., model-level, variant-level, and instance-level, to comprehend the learning patterns, and various techniques are employed, such as process discovery, conformance checking, autoencoder-based clustering, and predictive approaches. To validate this method, we collected a learning log of machine learning-related courses on a domestic open education platform. The results unveiled a spaghetti-like process model that can be differentiated into a standard learning pattern and three abnormal patterns. Furthermore, as a result of deriving a pattern classification model, our model achieved a high accuracy of 0.86 when predicting the pattern of instances based on the initial 30% of the entire flow. This study contributes to systematically analyze learners' patterns using process mining.

Development of an IMU-based Wearable Ankle Device for Military Motion Recognition (군사 동작 인식을 위한 IMU 기반 발목형 웨어러블 디바이스 개발)

  • Byeongjun Jang;Jeonghoun Cho;Dohyeon Kim;Kyeong-Won Park
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.23-34
    • /
    • 2023
  • Wearable technology for military applications has received considerable attention as a means of personal status check and monitoring. Among many, an implementation to recognize specific motion states of a human is promising in that allows active management of troops by immediately collecting the operational status and movement status of individual soldiers. In this study, as an extension of military wearable application research, a new ankle wearable device is proposed that can glean the information of a soldier on the battlefield on which action he/she takes in which environment. Presuming a virtual situation, the soldier's upper limbs are easily exposed to uncertainties about circumstances. Therefore, a sensing module is attached to the ankle of the soldier that may always interact with the ground. The obtained data comprises 3-axis accelerations and 3-axis rotational velocities, which cannot be interpreted by hand-made algorithms. In this study, to discern the behavioral characteristics of a human using these dynamic data, a data-driven model is introduced; four features extracted from sliced data (minimum, maximum, mean, and standard deviation) are utilized as an input of the model to learn and classify eight primary military movements (Sitting, Standing, Walking, Running, Ascending, Descending, Low Crawl, and High Crawl). As a result, the proposed device could recognize a movement status of a solider with 95.16% accuracy in an arbitrary test situation. This research is meaningful since an effective way of motion recognition has been introduced that can be furtherly extended to various military applications by incorporating wearable technology and artificial intelligence.

Predicting Future ESG Performance using Past Corporate Financial Information: Application of Deep Neural Networks (심층신경망을 활용한 데이터 기반 ESG 성과 예측에 관한 연구: 기업 재무 정보를 중심으로)

  • Min-Seung Kim;Seung-Hwan Moon;Sungwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.85-100
    • /
    • 2023
  • Corporate ESG performance (environmental, social, and corporate governance) reflecting a company's strategic sustainability has emerged as one of the main factors in today's investment decisions. The traditional ESG performance rating process is largely performed in a qualitative and subjective manner based on the institution-specific criteria, entailing limitations in reliability, predictability, and timeliness when making investment decisions. This study attempted to predict the corporate ESG rating through automated machine learning based on quantitative and disclosed corporate financial information. Using 12 types (21,360 cases) of market-disclosed financial information and 1,780 ESG measures available through the Korea Institute of Corporate Governance and Sustainability during 2019 to 2021, we suggested a deep neural network prediction model. Our model yielded about 86% of accurate classification performance in predicting ESG rating, showing better performance than other comparative models. This study contributed the literature in a way that the model achieved relatively accurate ESG rating predictions through an automated process using quantitative and publicly available corporate financial information. In terms of practical implications, the general investors can benefit from the prediction accuracy and time efficiency of our proposed model with nominal cost. In addition, this study can be expanded by accumulating more Korean and international data and by developing a more robust and complex model in the future.

Algorithm for Correcting Error in Smart Card Data Using Bus Information System Data (버스정보시스템 데이터를 활용한 교통카드 정류장 정보 오류 보정 알고리즘)

  • Hye Inn Song;Hwa Jeong Tak;Kang Won Shin;Sang Hoon Son
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.3
    • /
    • pp.131-146
    • /
    • 2023
  • Smart card data is widely used in the public transportation field. Despite the inevitability of various errors occur during the data collection and storage; however, smart card data errors have not been extensively studied. This paper investigates inherent errors in boarding and alighting station information in smart card data. A comparison smart card data and bus boarding and alighting survey data for the same time frame shows that boarding station names differ by 6.2% between the two data sets. This indicates that the error rate of smart card data is 6.2% in terms of boarding station information, given that bus boarding and alighting survey data can be considered as ground truth. This paper propose 6-step algorithm for correcting errors in smart card boarding station information, linking them to corresponding information in Bus Information System(BIS) Data. Comparing BIS data and bus boarding and alighting survey data for the same time frame reveals that boarding station names correspond by 98.3% between the two data sets, indicating that BIS data can be used as reliable reference for ground truth. To evaluate its performance, applying the 6-step algorithm proposed in this paper to smart card data set shows that the error rate of boarding station information is reduced from 6.2% to 1.0%, resulting in a 5.2%p improvement in the accuracy of smart card data. It is expected that the proposed algorithm will enhance the process of adjusting bus routes and making decisions related to public transportation infrastructure investments.

Biomechanical Research Trends for Alpine Ski Analysis (알파인 스키 분석을 위한 운동역학 연구 동향)

  • Lee, Jusung;Moon, Jeheon;Kim, Jinhae;Hwang, Jinny;Kim, Hyeyoung
    • 한국체육학회지인문사회과학편
    • /
    • v.57 no.6
    • /
    • pp.293-308
    • /
    • 2018
  • This study was carried out to investigate the current trends in skiing-related research from existing literature in the field of kinematics, measurement sensor and computer simulation. In the field of kinematics, research is being conducted on the mechanism of ski turn, posture analysis according to the grade and skill level of skiers, friction force of ski and snow, and air resistance. In the field of measurement sensor and computer simulation, researches are being conducted for researching and developing equipment using IMU sensor and GPS. The results of this study are as follows. First, beyond the limits of the existing kinematic analysis, it is necessary to develop measurement equipment that can analyze the entire skiing area and can be deployed with ease at the sports scene. Second, research on the accuracy of information obtained using measurement sensors and various analysis techniques based on these measures should be carried out continuously to provide data that can help the sports scene. Third, it is necessary to use computer simulation methods to clarify the injury mechanism and discover ways to prevent injuries related to skiing. Fourth, it is necessary to provide optimized ski trajectory algorithm by developing 3D ski model using computer simulation and comparing with actual skiing data.

Digital Twin-based Cadastral Resurvey Performance Sharing Platform Design and Implementation (디지털트윈 기반의 지적재조사 성과공유 플랫폼 설계 및 구현)

  • Kim, IL
    • Journal of Cadastre & Land InformatiX
    • /
    • v.53 no.1
    • /
    • pp.37-46
    • /
    • 2023
  • As real estate values rise, interest in cadastral resurvey is increasing. Accordingly, a cadastral resurvey project is actively underway for drone operation through securing work efficiency and improving accuracy. The need for utilization and management of cadastral resurvey results (drone images) is being raised, and through this study, a 3D spatial information platform was developed to solve the existing drone image management and utilization limitations and to provide drone image-based 3D cadastral information. It is proposed to build and use. The study area was selected as a district that completed the latest cadastral resurvey project in which the study was organized in February 2023. Afterwards, a web-based 3D platform was applied to the study to solve the user's spatial limitations, and the platform was designed and implemented based on drone images, spatial information, and attribute information. Major functions such as visualization of cadastral resurvey results based on 3D information and comparison of performance between previous cadastral maps and final cadastral maps were implemented. Through the open platform established in this study, anyone can easily utilize the cadastral resurvey results, and it is expected to utilize and share systematic cadastral resurvey results based on 3-dimensional information that reflects the actual business district. In addition, a continuous management plan was proposed by integrating the distributed results into one platform. It is expected that the usability of the 3D platform will be further improved if a platform is established for the whole country in the future and a service linked to the cadastral resurvey administrative system is established.