Search | Korea Science

Character-based Subtitle Generation by Learning of Multimodal Concept Hierarchy from Cartoon Videos (멀티모달 개념계층모델을 이용한 만화비디오 컨텐츠 학습을 통한 등장인물 기반 비디오 자막 생성)

Kim, Kyung-Min;Ha, Jung-Woo;Lee, Beom-Jin;Zhang, Byoung-Tak
- Journal of KIISE
- /
- v.42 no.4
- /
- pp.451-458
- /
- 2015
Previous multimodal learning methods focus on problem-solving aspects, such as image and video search and tagging, rather than on knowledge acquisition via content modeling. In this paper, we propose the Multimodal Concept Hierarchy (MuCH), which is a content modeling method that uses a cartoon video dataset and a character-based subtitle generation method from the learned model. The MuCH model has a multimodal hypernetwork layer, in which the patterns of the words and image patches are represented, and a concept layer, in which each concept variable is represented by a probability distribution of the words and the image patches. The model can learn the characteristics of the characters as concepts from the video subtitles and scene images by using a Bayesian learning method and can also generate character-based subtitles from the learned model if text queries are provided. As an experiment, the MuCH model learned concepts from 'Pororo' cartoon videos with a total of 268 minutes in length and generated character-based subtitles. Finally, we compare the results with those of other multimodal learning models. The Experimental results indicate that given the same text query, our model generates more accurate and more character-specific subtitles than other models.
https://doi.org/10.5626/JOK.2015.42.4.451 인용 KSCI

Ontology Modeling and Rule-based Reasoning for Automatic Classification of Personal Media (미디어 영상 자동 분류를 위한 온톨로지 모델링 및 규칙 기반 추론)

Park, Hyun-Kyu;So, Chi-Seung;Park, Young-Tack
- Journal of KIISE
- /
- v.43 no.3
- /
- pp.370-379
- /
- 2016
Recently personal media were produced in a variety of ways as a lot of smart devices have been spread and services using these data have been desired. Therefore, research has been actively conducted for the media analysis and recognition technology and we can recognize the meaningful object from the media. The system using the media ontology has the disadvantage that can't classify the media appearing in the video because of the use of a video title, tags, and script information. In this paper, we propose a system to automatically classify video using the objects shown in the media data. To do this, we use a description logic-based reasoning and a rule-based inference for event processing which may vary in order. Description logic-based reasoning system proposed in this paper represents the relation of the objects in the media as activity ontology. We describe how to another rule-based reasoning system defines an event according to the order of the inference activity and order based reasoning system automatically classify the appropriate event to the category. To evaluate the efficiency of the proposed approach, we conducted an experiment using the media data classified as a valid category by the analysis of the Youtube video.
https://doi.org/10.5626/JOK.2016.43.3.370 인용 KSCI

ECoMOT : An Efficient Content-based Multimedia Information Retrieval System Using Moving Objects' Trajectories in Video Data (ECoMOT : 비디오 데이터내의 이동체의 제적을 이용한 효율적인 내용 기반 멀티미디어 정보검색 시스템)

Shim Choon-Bo;Chang Jae-Woo;Shin Yong-Won;Park Byung-Rae
- The KIPS Transactions:PartB
- /
- v.12B no.1 s.97
- /
- pp.47-56
- /
- 2005
A moving object has a various features that its spatial location, shape, and size are changed as time goes. In addition, the moving object has both temporal feature and spatial feature. It is one of the highly interested feature information in video data. In this paper, we propose an efficient content-based multimedia information retrieval system, so tailed ECoMOT which enables user to retrieve video data by using a trajectory information of moving objects in video data. The ECoMOT includes several novel techniques to achieve content-based retrieval using moving objects' trajectories : (1) Muitiple trajectory modeling technique to model the multiple trajectories composed of several moving objects; (2) Multiple similar trajectory retrieval technique to retrieve more similar trajectories by measuring similarity between a given two trajectories composed of several moving objects; (3) Superimposed signature-based trajectory indexing technique to effectively search corresponding trajectories from a large trajectory databases; (4) convenient trajectory extraction, query generation, and retrieval interface based on graphic user interface
https://doi.org/10.3745/KIPSTB.2005.12B.1.047 인용 PDF KSCI

A Proposal of Multimedia Retrieval System and XML Meta-data Modeling Techniques (XML 메타데이터 모델링기법과 멀티미디어 검색시스템의 제안)

윤미희;조동욱
- Proceedings of the Korea Contents Association Conference
- /
- 2003.05a
- /
- pp.393-398
- /
- 2003
Video which contains the multiple data such as text, images, audio and motion of objects is typical multimedia data. Multimedia retrieval system using XML is essential for efficient rep. of multimedia data. Therefore, multimedia retrieval system for retrieval and structural understanding is needed to retrieve the multimedia data. This Paper Proposes the multimedia retrieval system based on XML Meta-data modeling techniques.
PDF

Particle System Graphics Library for Generating Special Effects

Kim Eung-Kon
- International Journal of Contents
- /
- v.2 no.2
- /
- pp.1-5
- /
- 2006
The modeling and animation of natural phenomena have received much attention from the computer graphics community. Synthetic of natural phenomena are required for such diverse applications as flight simulators, special effects, video games and other virtual realty. In special effects industry there is a high demand to convincingly mimic the appearance and behavior of natural phenomena such as smoke, waterfall, rain, and fire. Particle systems are methods adequate for modeling fuzzy objects of natural phenomena. This paper presents particle system API(Application Program Interfaces) for generating special effects in virtual reality applications. The API are a set of functions that allow C++ programs to simulate the dynamics of particles for special effects in interactive and non-interactive graphics applications, not for scientific simulation.
PDF

Balancing of Digital VCR Head Drum (디지털 VCR 헤드 드럼의 밸런싱 연구)

조여욱;이진구
- Journal of the Korean Society for Precision Engineering
- /
- v.15 no.2
- /
- pp.61-67
- /
- 1998
Dynamic stability in rotation of the head drum of digital VCR is very important due to the nature of high rotation speed and small angular inertia. Therefore special considerations on reducing the unbalance and assuring the stability are required the design and manufacturing process. In this paper, newly developed digital head drum is introduced. And advanced methods in analyzing and reducing the unbalance is suggested. LDV(Laser Doppler Vibrometer) was used as a measurement system verifying our modeling and new method for balancing. Experiments show that the theoretical data estimated by modeling of shaft bending caused by unbalance mass and the measured data are almost identical. The deflection was reduced to 30％ by applying the suggested balancing method.
PDF

A Study on 3D Culture Character Modeling Tools for Establishment (창업을 위한 3D문화캐릭터 모델링 도구에 관한 연구)

Park, Hea-sook
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2017.07a
- /
- pp.372-373
- /
- 2017
본 논문에서는 도래한 4차산업사회 속에서 문화콘텐츠인 캐릭터를 무료 3D모델링 도구를 이용하여 디자인하고 3D 프린팅 기술을 접목하여 개인맞춤형 캐릭터를 제작 및 판매 하고자 하는 사업에 대한 로드맵을 제시해보고자 한다. 이를 위해 첫 단계로서 3D 모델링 도구의 전반적인 특성을 살펴보고 3D 캐릭터 모델링 도구들 중에서 무료로 사용할 수 있고 사용하기 편한 도구들을 정리하여 제시해보고자 한다. 제작하고자 하는 캐릭터의 특성에 맞는 도구를 선택하고 학습하여 성공적인 창업을 할 수 있도록 방향을 제시하고자 한다.
PDF

User Review Mining: An Approach for Software Requirements Evolution

Lee, Jee Young
- International journal of advanced smart convergence
- /
- v.9 no.4
- /
- pp.124-131
- /
- 2020
As users of internet-based software applications increase, functional and non-functional problems for software applications are quickly exposed to user reviews. These user reviews are an important source of information for software improvement. User review mining has become an important topic of intelligent software engineering. This study proposes a user review mining method for software improvement. User review data collected by crawling on the app review page is analyzed to check user satisfaction. It analyzes the sentiment of positive and negative that users feel with a machine learning method. And it analyzes user requirement issues through topic analysis based on structural topic modeling. The user review mining process proposed in this study conducted a case study with the a non-face-to-face video conferencing app. Software improvement through user review mining contributes to the user lock-in effect and extending the life cycle of the software. The results of this study will contribute to providing insight on improvement not only for developers, but also for service operators and marketing.
https://doi.org/10.7236/IJASC.2020.9.4.124 인용 PDF KSCI

Deep Learning-Based Occupancy Detection and Visualization for Architecture and Urban Data - Towards Augmented Reality and GIS Integration for Improved Safety and Emergency Response Modeling - (건물 내 재실자 감지 및 시각화를 위한 딥러닝 모델 - 증강현실 및 GIS 통합을 통한 안전 및 비상 대응 개선모델 프로토타이핑 -)

Shin, Dongyoun
- Journal of KIBIM
- /
- v.13 no.2
- /
- pp.29-36
- /
- 2023
This study explores the potential of utilizing video-based data analysis and machine learning techniques to estimate the number of occupants within a building. The research methodology involves developing a sophisticated counting system capable of detecting and tracking individuals' entry and exit patterns. The proposed method demonstrates promising results in various scenarios; however, it also identifies the need for improvements in camera performance and external environmental conditions, such as lighting. The study emphasizes the significance of incorporating machine learning in architectural and urban planning applications, offering valuable insights for the field. In conclusion, the research calls for further investigation to address the limitations and enhance the system's accuracy, ultimately contributing to the development of a more robust and reliable solution for building occupancy estimation.
https://doi.org/10.13161/kibim.2023.13.2.029 인용 PDF

Improving Efficiency of Timeslot Assignment for Non-realtime Data in a DVB-RCS Return Link: Modeling and Algorithm

Lee, Ki-Dong;Cho, Yong-Hoon;Lee, Ho-Jin;Oh, Deock-Gil
- ETRI Journal
- /
- v.25 no.4
- /
- pp.211-218
- /
- 2003
This paper presents a dynamic resource allocation algorithm with multi-frequency time-division multiple access for the return link of interactive satellite multimedia networks such as digital video broadcasting return channel via satellite systems. The proposed timeslot assignment algorithm, called the very efficient dynamic timeslot assignment (VEDTA) algorithm, gives an optimal assignment plan within a very short period. The optimality and computational efficiency of this algorithm demonstrate that it will be useful in field applications.
PDF

Search Result 309, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)