• Title/Summary/Keyword: Feature Value Similarity

Search Result 79, Processing Time 0.028 seconds

Similar Contents Recommendation Model Based On Contents Meta Data Using Language Model (언어모델을 활용한 콘텐츠 메타 데이터 기반 유사 콘텐츠 추천 모델)

  • Donghwan Kim
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.27-40
    • /
    • 2023
  • With the increase in the spread of smart devices and the impact of COVID-19, the consumption of media contents through smart devices has significantly increased. Along with this trend, the amount of media contents viewed through OTT platforms is increasing, that makes contents recommendations on these platforms more important. Previous contents-based recommendation researches have mostly utilized metadata that describes the characteristics of the contents, with a shortage of researches that utilize the contents' own descriptive metadata. In this paper, various text data including titles and synopses that describe the contents were used to recommend similar contents. KLUE-RoBERTa-large, a Korean language model with excellent performance, was used to train the model on the text data. A dataset of over 20,000 contents metadata including titles, synopses, composite genres, directors, actors, and hash tags information was used as training data. To enter the various text features into the language model, the features were concatenated using special tokens that indicate each feature. The test set was designed to promote the relative and objective nature of the model's similarity classification ability by using the three contents comparison method and applying multiple inspections to label the test set. Genres classification and hash tag classification prediction tasks were used to fine-tune the embeddings for the contents meta text data. As a result, the hash tag classification model showed an accuracy of over 90% based on the similarity test set, which was more than 9% better than the baseline language model. Through hash tag classification training, it was found that the language model's ability to classify similar contents was improved, which demonstrated the value of using a language model for the contents-based filtering.

Recognition of Partially Occluded Binary Objects using Elastic Deformation Energy Measure (탄성변형에너지 측도를 이용한 부분적으로 가려진 이진 객체의 인식)

  • Moon, Young-In;Koo, Ja-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.10
    • /
    • pp.63-70
    • /
    • 2014
  • Process of recognizing objects in binary images consists of image segmentation and pattern matching. If binary objects in the image are assumed to be separated, global features such as area, length of perimeter, or the ratio of the two can be used to recognize the objects in the image. However, if such an assumption is not valid, the global features can not be used but local features such as points or line segments should be used to recognize the objects. In this paper points with large curvature along the perimeter are chosen to be the feature points, and pairs of points selected from them are used as local features. Similarity of two local features are defined using elastic deformation energy for making the lengths and angles between gradient vectors at the end points same. Neighbour support value is defined and used for robust recognition of partially occluded binary objects. An experiment on Kimia-25 data showed that the proposed algorithm runs 4.5 times faster than the maximum clique algorithm with same recognition rate.

Classification and Stand Characteristics of Subalpine Forest Vegetation at Hyangjeukbong and Jungbong in Mt. Deogyusan (덕유산 향적봉 및 중봉 아고산대의 산림식생유형분류와 임분 특성)

  • Han, Sang Hak;Han, Sim Hee;Yun, Chung Weon
    • Journal of Korean Society of Forest Science
    • /
    • v.105 no.1
    • /
    • pp.48-62
    • /
    • 2016
  • This study was conducted to classify forest vegetation structure and stand feature of Mt. Deogyusan National Park from Hyangjeukbong to Jungbong, 48 plots were surveyed. The type classification of the vegetation structure was performed with Z-M phytosociological method. As a result, Quercus mongolica community group was classified into the Picea jezoensis community, Carpinus cordata community and Tilia amurensis community in community unit. P. jezoensis community was subdivided into Deutzia glabrata group and Viburnum opulus var. calvescens group in group unit. D. glabrata group was subdivided into Acer mandshuricum subgroup and Ribes mandshuricum subgroup and V. opulus var. calvescens group was subdivided into Hemerocallis dumortieri subgroup and Prunus padus subgroup in subgroup unit. In the result of estimating the importance value, it constituted Q. mongolica (23.9%), Abies koreana (14.7%), Taxus cuspidata (10.2%), P. jezoensis (8.2%) and Betula ermanii (7.4%) in tree layer. It constituted Acer komarovii (18.6%), Acer pseudosieboldianum (18.4%) and Q. mongolica (8.9%) in subtree layer. It constituted Rhododendron schlippenbachii (20.7%), A. pseudosieboldianum (17.4%) and Symplocos chinensis (8.5%) in shrub layer. Indicator species analysis of vegetation unit 1 was consisted of Hydrangea serrata, Fraxinus mandshurica and D. glabrata that species prefer moist valley in subalpine or rocks. In the results of analyzing the species diversity, vegetation unit 1, 4 and 5 represented that there were different and complex local distributions. As in the similarity between the vegetation units, the vegetation units 1, 2, 3 and 4 represented high with 0.5 or above. It represented that there wasn't no differences on composition species in vegetation units.

A Design and Implementation of a Content_Based Image Retrieval System using Color Space and Keywords (칼라공간과 키워드를 이용한 내용기반 화상검색 시스템 설계 및 구현)

  • Kim, Cheol-Ueon;Choi, Ki-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.6
    • /
    • pp.1418-1432
    • /
    • 1997
  • Most general content_based image retrieval techniques use color and texture as retrieval indices. In color techniques, color histogram and color pair based color retrieval techniques suffer from a lack of spatial information and text. And This paper describes the design and implementation of content_based image retrieval system using color space and keywords. The preprocessor for image retrieval has used the coordinate system of the existing HSI(Hue, Saturation, Intensity) and preformed to split One image into chromatic region and achromatic region respectively, It is necessary to normalize the size of image for 200*N or N*200 and to convert true colors into 256 color. Two color histograms for background and object are used in order to decide on color selection in the color space. Spatial information is obtained using a maximum entropy discretization. It is possible to choose the class, color, shape, location and size of image by using keyword. An input color is limited by 15 kinds keyword of chromatic and achromatic colors of the Korea Industrial Standards. Image retrieval method is used as the key of retrieval properties in the similarity. The weight values of color space ${\alpha}(%)and\;keyword\;{\beta}(%)$ can be chosen by the user in inputting the query words, controlling the values according to the properties of image_contents. The result of retrieval in the test using extracted feature such as color space and keyword to the query image are lower that those of weight value. In the case of weight value, the average of te measuring parameters shows approximate Precision(0.858), Recall(0.936), RT(1), MT(0). The above results have proved higher retrieval effects than the content_based image retrieval by using color space of keywords.

  • PDF

Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

  • Thay, Setha;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.1-20
    • /
    • 2013
  • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.

A Novel Video Copy Detection Method based on Statistical Analysis (통계적 분석 기반 불법 복제 비디오 영상 감식 방법)

  • Cho, Hye-Jeong;Kim, Ji-Eun;Sohn, Chae-Bong;Chung, Kwang-Sue;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.14 no.6
    • /
    • pp.661-675
    • /
    • 2009
  • The carelessly and illegally copied contents are raising serious social problem as internet and multimedia technologies are advancing. Therefore, development of video copy detection system must be settled without delay. In this paper, we propose the hierarchical video copy detection method that estimates similarity using statistical characteristics between original video and manipulated(transformed) copy video. We rank according to luminance value of video to be robust to spacial transformation, and choose similar videos categorized as candidate segments in huge amount of database to reduce processing time and complexity. The copy videos generally insert black area in the edge of the image, so we remove rig black area and decide copy or not by using statistical characteristics of original video and copied video with center part of frame that contains important information of video. Experiment results show that the proposed method has similar keyframe accuracy to reference method, but we use less memory to save feature information than reference's, because the number of keyframes is less 61% than that of reference's. Also, the proposed method detects if the video is copied or not efficiently despite expansive spatial transformations such as blurring, contrast change, zoom in, zoom out, aspect ratio change, and caption insertion.

Community Analysis of Oribatid Mites(Acari: Oribatida) in Namsan and Kwangreung Coniferous Forests (남산과 광릉 침엽수림의 날개응애 군집분석)

  • 박홍현;이준호
    • Korean journal of applied entomology
    • /
    • v.39 no.1
    • /
    • pp.31-41
    • /
    • 2000
  • Community analysis of oribatid mites was conducted in Namsan and Kwangreung coniferous forests which have been received by different degrees of environmental pressures through urbanization processes. Oribatid mites were sampled in the litter and soil layer of study sites from May 1993 to October 1994. Although two sites have been under similar weather condition, seasonal changes in oribatid mites density did not show a synchronized pattern. Density in spring and summer showed stable pattern with low fluctuations, but unstable pattern in autumn between 1993 and 1994. And these patterns were highly correlated with precipitation. The density and species number were higher in the litter layer than in the soil layer and showed no typical seasonal changes. The dominant species were Scheloribates latipes (1 l.78%), Pergalumna altera (8.92%), Eohypochthonius crassisetiger (7.58%), Scheloribates sp. (6.89%) and Suctobelbella yezoensis (5.04%) in Namsan, and Ceratozetes japonicus (25.72%), Punctoribates punctum (14.15%), Trichogalumna nipponica (10.96%) and Ramusella sengbuschi (5.08%) in Kwangreung. The number of species with high constany were 10 and 18 in Namsan and Kwangreung, respectively. Namsan showed the feature of urban forests. In analysis of species diversity, species richness was significantly higher in Kwangreung than in Namsan, while shannon (H') and evenness index (J') were higher in Namsan than in Kwangreung. The values of shannon index (H') in Namsan and Kwangreugn were 2.83 and 2.62, respectively and evenness index (J') were 0.78 and 0.67, respectively. The value of similarity index between two sites was 0.68.

  • PDF

Color-related Query Processing for Intelligent E-Commerce Search (지능형 검색엔진을 위한 색상 질의 처리 방안)

  • Hong, Jung A;Koo, Kyo Jung;Cha, Ji Won;Seo, Ah Jeong;Yeo, Un Yeong;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.109-125
    • /
    • 2019
  • As interest on intelligent search engines increases, various studies have been conducted to extract and utilize the features related to products intelligencely. In particular, when users search for goods in e-commerce search engines, the 'color' of a product is an important feature that describes the product. Therefore, it is necessary to deal with the synonyms of color terms in order to produce accurate results to user's color-related queries. Previous studies have suggested dictionary-based approach to process synonyms for color features. However, the dictionary-based approach has a limitation that it cannot handle unregistered color-related terms in user queries. In order to overcome the limitation of the conventional methods, this research proposes a model which extracts RGB values from an internet search engine in real time, and outputs similar color names based on designated color information. At first, a color term dictionary was constructed which includes color names and R, G, B values of each color from Korean color standard digital palette program and the Wikipedia color list for the basic color search. The dictionary has been made more robust by adding 138 color names converted from English color names to foreign words in Korean, and with corresponding RGB values. Therefore, the fininal color dictionary includes a total of 671 color names and corresponding RGB values. The method proposed in this research starts by searching for a specific color which a user searched for. Then, the presence of the searched color in the built-in color dictionary is checked. If there exists the color in the dictionary, the RGB values of the color in the dictioanry are used as reference values of the retrieved color. If the searched color does not exist in the dictionary, the top-5 Google image search results of the searched color are crawled and average RGB values are extracted in certain middle area of each image. To extract the RGB values in images, a variety of different ways was attempted since there are limits to simply obtain the average of the RGB values of the center area of images. As a result, clustering RGB values in image's certain area and making average value of the cluster with the highest density as the reference values showed the best performance. Based on the reference RGB values of the searched color, the RGB values of all the colors in the color dictionary constructed aforetime are compared. Then a color list is created with colors within the range of ${\pm}50$ for each R value, G value, and B value. Finally, using the Euclidean distance between the above results and the reference RGB values of the searched color, the color with the highest similarity from up to five colors becomes the final outcome. In order to evaluate the usefulness of the proposed method, we performed an experiment. In the experiment, 300 color names and corresponding color RGB values by the questionnaires were obtained. They are used to compare the RGB values obtained from four different methods including the proposed method. The average euclidean distance of CIE-Lab using our method was about 13.85, which showed a relatively low distance compared to 3088 for the case using synonym dictionary only and 30.38 for the case using the dictionary with Korean synonym website WordNet. The case which didn't use clustering method of the proposed method showed 13.88 of average euclidean distance, which implies the DBSCAN clustering of the proposed method can reduce the Euclidean distance. This research suggests a new color synonym processing method based on RGB values that combines the dictionary method with the real time synonym processing method for new color names. This method enables to get rid of the limit of the dictionary-based approach which is a conventional synonym processing method. This research can contribute to improve the intelligence of e-commerce search systems especially on the color searching feature.

The Discourse of Capitalist Society on East Asian Pop Culture: A TV Series of Superhero Animation (대중문화에 재현된 동아시아 자본주의 사회의 담론 : 슈퍼히어로 애니메이션 <타이거 앤 버니>를 중심으로)

  • Woo, Ji-Woon;Noh, Kwang-Woo;Kwon, Jae-Woong
    • Cartoon and Animation Studies
    • /
    • s.37
    • /
    • pp.45-82
    • /
    • 2014
  • Comics and cartoons of superheroes in the West have adopted various semiotic systems and other art-forms, including their politico-socio-economic condition, and made parody of other popular texts, as well. Based on the idea of the development of superhero genre, this article focuses on how East Asian popular texts appropriate and reconstruct the genre, which was once considered the realization of American idea, by analyzing a series of TV animation (Japan, Sunrise,2011). Through the feature of parody with intertextuality, provides East Asian value and sensibility of characters as corporation-centered modern humans in capitalist society. This animation has similarity and difference, compared to that of Western superhero cartoons. It satires Western capitalist society and emphasizes Eastern family-oriented value. The performances of superheroes on TV represent the satire on Western style individualism and estimation through each one's achievement. It metaphorically criticizes the situation in which modern human falls into dependency on capital and media, and the capitalistic system in which public good is used for the method of private profit. emphasizes East Asian value of human and society, the cooperative relation for the success and maintenance of community by combining members of state and society through familial sensibility. Tiger functions as a spiritual leader in the group of superheroes who have been obsessed with competition for their own private purpose rather than public cause, Bunny and other colleagues are gradually influenced by Tiger's familial communicative style. emphasizes community-centered view and self-sacrificing sensibility as an international citizen to solve social pathology of modern world.