• Title/Summary/Keyword: Ranking Data

Search Result 658, Processing Time 0.026 seconds

Determining Contingency Ranking Using the Probabilistic Method of the Power System (확률적 방법을 이용한 전력계통의 상정사고 순위 결정)

  • Kim, Kyoung-Young;Lee, Seung-Hyuk;Kim, Jin-O;Kim, Tae-Kyun
    • Proceedings of the KIEE Conference
    • /
    • 2003.07a
    • /
    • pp.113-115
    • /
    • 2003
  • The electric power industry throughout the world is undergoing considerable changes from the vertically integrated utility structure to the deregulated market. However, the deregulated electricity market is operated with respect to theory of economical efficiency, and therefore, the system operator requires data with fast contingency ranking for security of the bulk power system. This paper presents fast calculation method for determining contingency ranking using the weather dependant probabilistic risk index(PRI). The probabilistic risk index can be classified into normal weather and adverse weather. This paper proposes calculation method using the probabilistic risk index in determining contingency ranking requiring for security under the deregulated electricity market.

  • PDF

Analysis of Correlation between Real-time Sales Ranking and Information Provided by Mobile Movie Platform: Focus on Non-descriptive Information in Google Play Store's Best-selling Movies

  • Nam, Sangzo
    • Journal of Advanced Information Technology and Convergence
    • /
    • v.9 no.2
    • /
    • pp.41-54
    • /
    • 2019
  • The cinema circuit is facing a digital, network, and mobile age, which expands non-theater accessibility to movies. Application platforms are situated as the most competitive business model that provide digital content such as games, music, books, and movies. Consumers can acquire content-related information not just offline, but online as well. Therefore, item information provided by application platforms is required. The information provided by application platforms consists of richly descriptive information such as storyline summary, consumer reviews, and related articles, while non-descriptive normative information covers data such as sales ranking, release date, genre, rental or purchase cost, domestic/foreign classification, consumer rating, number of consumer ratings, film rating, and so on. In this study, we surveyed and analyzed statistically the correlation between real-time sales ranking and other comparable non-descriptive information.

Derivation of Digital Music's Ranking Change Through Time Series Clustering (시계열 군집분석을 통한 디지털 음원의 순위 변화 패턴 분류)

  • Yoo, In-Jin;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.171-191
    • /
    • 2020
  • This study focused on digital music, which is the most valuable cultural asset in the modern society and occupies a particularly important position in the flow of the Korean Wave. Digital music was collected based on the "Gaon Chart," a well-established music chart in Korea. Through this, the changes in the ranking of the music that entered the chart for 73 weeks were collected. Afterwards, patterns with similar characteristics were derived through time series cluster analysis. Then, a descriptive analysis was performed on the notable features of each pattern. The research process suggested by this study is as follows. First, in the data collection process, time series data was collected to check the ranking change of digital music. Subsequently, in the data processing stage, the collected data was matched with the rankings over time, and the music title and artist name were processed. Each analysis is then sequentially performed in two stages consisting of exploratory analysis and explanatory analysis. First, the data collection period was limited to the period before 'the music bulk buying phenomenon', a reliability issue related to music ranking in Korea. Specifically, it is 73 weeks starting from December 31, 2017 to January 06, 2018 as the first week, and from May 19, 2019 to May 25, 2019. And the analysis targets were limited to digital music released in Korea. In particular, digital music was collected based on the "Gaon Chart", a well-known music chart in Korea. Unlike private music charts that are being serviced in Korea, Gaon Charts are charts approved by government agencies and have basic reliability. Therefore, it can be considered that it has more public confidence than the ranking information provided by other services. The contents of the collected data are as follows. Data on the period and ranking, the name of the music, the name of the artist, the name of the album, the Gaon index, the production company, and the distribution company were collected for the music that entered the top 100 on the music chart within the collection period. Through data collection, 7,300 music, which were included in the top 100 on the music chart, were identified for a total of 73 weeks. On the other hand, in the case of digital music, since the cases included in the music chart for more than two weeks are frequent, the duplication of music is removed through the pre-processing process. For duplicate music, the number and location of the duplicated music were checked through the duplicate check function, and then deleted to form data for analysis. Through this, a list of 742 unique music for analysis among the 7,300-music data in advance was secured. A total of 742 songs were secured through previous data collection and pre-processing. In addition, a total of 16 patterns were derived through time series cluster analysis on the ranking change. Based on the patterns derived after that, two representative patterns were identified: 'Steady Seller' and 'One-Hit Wonder'. Furthermore, the two patterns were subdivided into five patterns in consideration of the survival period of the music and the music ranking. The important characteristics of each pattern are as follows. First, the artist's superstar effect and bandwagon effect were strong in the one-hit wonder-type pattern. Therefore, when consumers choose a digital music, they are strongly influenced by the superstar effect and the bandwagon effect. Second, through the Steady Seller pattern, we confirmed the music that have been chosen by consumers for a very long time. In addition, we checked the patterns of the most selected music through consumer needs. Contrary to popular belief, the steady seller: mid-term pattern, not the one-hit wonder pattern, received the most choices from consumers. Particularly noteworthy is that the 'Climbing the Chart' phenomenon, which is contrary to the existing pattern, was confirmed through the steady-seller pattern. This study focuses on the change in the ranking of music over time, a field that has been relatively alienated centering on digital music. In addition, a new approach to music research was attempted by subdividing the pattern of ranking change rather than predicting the success and ranking of music.

Analysis of cycle racing ranking using statistical prediction models (통계적 예측모형을 활용한 경륜 경기 순위 분석)

  • Park, Gahee;Park, Rira;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.25-39
    • /
    • 2017
  • Over 5 million people participate in cycle racing betting and its revenue is more than 2 trillion won. This study predicts the ranking of cycle racing using various statistical analyses and identifies important variables which have influence on ranking. We propose competitive ranking prediction models using various classification and regression methods. Our model can predict rankings with low misclassification rates most of the time. We found that the ranking increases as the grade of a racer decreases and as overall scores increase. Inversely, we can observe that the ranking decreases when the grade of a racer increases, race number four is given, and the ranking of the last race of a racer decreases. We also found that prediction accuracy can be improved when we use centered data per race instead of raw data. However, the real profit from the future data was not high when we applied our prediction model because our model can predict only low-return events well.

Seismic vulnerability of reinforced concrete structures using machine learning

  • Ioannis Karampinis;Lazaros Iliadis
    • Earthquakes and Structures
    • /
    • v.27 no.2
    • /
    • pp.83-95
    • /
    • 2024
  • The prediction of seismic behavior of the existing building stock is one of the most impactful and complex problems faced by countries with frequent and intense seismic activities. Human lives can be threatened or lost, the economic life is disrupted and large amounts of monetary reparations can be potentially required. However, authorities at a regional or national level have limited resources at their disposal in order to allocate to preventative measures. Thus, in order to do so, it is essential for them to be able to rank a given population of structures according to their expected degree of damage in an earthquake. In this paper, the authors present a ranking approach, based on Machine Learning (ML) algorithms for pairwise comparisons, coupled with ad hoc ranking rules. The case study employed data from 404 reinforced concrete structures with various degrees of damage from the Athens 1999 earthquake. The two main components of our experiments pertain to the performance of the ML models and the success of the overall ranking process. The former was evaluated using the well-known respective metrics of Precision, Recall, F1-score, Accuracy and Area Under Curve (AUC). The performance of the overall ranking was evaluated using Kendall's tau distance and by viewing the problem as a classification into bins. The obtained results were promising, and were shown to outperform currently employed engineering practices. This demonstrated the capabilities and potential of these models in identifying the most vulnerable structures and, thus, mitigating the effects of earthquakes on society.

An Experimental Study on Feature Ranking Schemes for Text Classification (텍스트 분류를 위한 자질 순위화 기법에 관한 연구)

  • Pan Jun Kim
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.1
    • /
    • pp.1-21
    • /
    • 2023
  • This study specifically reviewed the performance of the ranking schemes as an efficient feature selection method for text classification. Until now, feature ranking schemes are mostly based on document frequency, and relatively few cases have used the term frequency. Therefore, the performance of single ranking metrics using term frequency and document frequency individually was examined as a feature selection method for text classification, and then the performance of combination ranking schemes using both was reviewed. Specifically, a classification experiment was conducted in an environment using two data sets (Reuters-21578, 20NG) and five classifiers (SVM, NB, ROC, TRA, RNN), and to secure the reliability of the results, 5-Fold cross-validation and t-test were applied. As a result, as a single ranking scheme, the document frequency-based single ranking metric (chi) showed good performance overall. In addition, it was found that there was no significant difference between the highest-performance single ranking and the combination ranking schemes. Therefore, in an environment where sufficient learning documents can be secured in text classification, it is more efficient to use a single ranking metric (chi) based on document frequency as a feature selection method.

How Role Overload Affects Physical and Psychological Health of Low-ranking Government Employees at Different Ages: The Mediating Role of Burnout

  • Huang, Qing;Wang, Yidan;Yuan, Ke;Liu, Huaxing
    • Safety and Health at Work
    • /
    • v.13 no.2
    • /
    • pp.207-212
    • /
    • 2022
  • Background: The public now imposes higher demands on the government than in the past, which has created the role overload faced by low-ranking government employees in China. This research investigates the relationship between role overload and health among low-ranking government employees and explores the mediating effects of burnout. Methods: It draws on a survey of 2064 low-ranking government employees by probability proportionate to size sampling in China's Shandong Province. Structural equation modeling (SEM) methods are used to analyze the data. Results: Both role overload and burnout were found to have negative effects on low-ranking government employees' health; however, the associations varied among the three age groups (less than 36, between 36 and 45, and over 45). Those over 45 reported the highest level of both physical and psychological health, while the youngest age group (less than 36) reported the lowest level of health. Role overload has a direct influence on health among government employees over 45 but not among those below 45. Burnout's mediating effects between role overload and health are significant among all age groups, but most significant among the youngest civil servants below 36. Conclusions: The findings evidenced that both role overload and burnout affect low-ranking government employees' self-reported physical and psychological health. In addition, the effect of age differences in coping with role stressors and burnout should be considered.

Retrieval Model using Subject Classification Table, User Profile, and LSI (전공분류표, 사용자 프로파일, LSI를 이용한 검색 모델)

  • Woo Seon-Mi
    • The KIPS Transactions:PartD
    • /
    • v.12D no.5 s.101
    • /
    • pp.789-796
    • /
    • 2005
  • Because existing information retrieval systems, in particular library retrieval systems, use 'exact keyword matching' with user's query, they present user with massive results including irrelevant information. So, a user spends extra effort and time to get the relevant information from the results. Thus, this paper will propose SULRM a Retrieval Model using Subject Classification Table, User profile, and LSI(Latent Semantic Indexing), to provide more relevant results. SULRM uses document filtering technique for classified data and document ranking technique for non-classified data in the results of keyword-based retrieval. Filtering technique uses Subject Classification Table, and ranking technique uses user profile and LSI. And, we have performed experiments on the performance of filtering technique, user profile updating method, and document ranking technique using the results of information retrieval system of our university' digital library system. In case that many documents are retrieved proposed techniques are able to provide user with filtered data and ranked data according to user's subject and preference.

Development of a Ranking System for Tourist Destination Using BERT-based Semantic Search (BERT 기반 의미론적 검색을 활용한 관광지 순위 시스템 개발)

  • KangWoo Lee;MyeongSeon Kim;Soon Goo Hong;SuGyeong Roh
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.29 no.4
    • /
    • pp.91-103
    • /
    • 2024
  • A tourist destination ranking system was designed that employs a semantic search to extract information with reasonable accuracy. To this end the process involves collecting data, preprocessing text reviews of tourist spots, and embedding the corpus and queries with SBERT. We calculate the similarity between data points, filter out those below a specified threshold, and then rank the remaining tourist destinations using a count-based algorithm to align them semantically with the query. To assess the efficacy of the ranking algorithm experiments were conducted with four queries. Furthermore, 58,175 sentences were directly labeled to ascertain their semantic relevance to the third query, 'crowdedness'. Notably, human-labeled data for crowdedness showed similar results. Despite challenges including optimizing thresholds and imbalanced data, this study shows that a semantic search is a powerful method for understanding user intent and recommending tourist destinations with less time and costs.

Neighbor Caching for P2P Applications in MUlti-hop Wireless Ad Hoc Networks (멀티 홉 무선 애드혹 네트워크에서 P2P 응용을 위한 이웃 캐싱)

  • 조준호;오승택;김재명;이형호;이준원
    • Journal of KIISE:Information Networking
    • /
    • v.30 no.5
    • /
    • pp.631-640
    • /
    • 2003
  • Because of multi-hop wireless communication, P2P applications in ad hoc networks suffer poor performance. We Propose neighbor caching strategy to overcome this shortcoming and show it is more efficient than self caching that nodes store data in theirs own cache individually. A node can extend its caching storage instantaneously with neighbor caching by borrowing the storage from idle neighbors, so overcome multi-hop wireless communications with data source long distance away from itself. We also present the ranking based prediction that selects the most appropriate neighbor which data can be stored in. The node that uses the ranking based prediction can select the neighbor that has high possibility to keep data for a long time and avoid caching the low ranked data. Therefore the ranking based prediction improves the throughput of neighbor caching. In the simulation results, we observe that neighbor caching has better performance, as large as network size, as long as idle time, and as small as cache size. We also show the ranking based prediction is an adaptive algorithm that adjusts times of data movement into the neighbor, so makes neighbor caching flexible according to the idleness of nodes