Search | Korea Science

Story-based Information Retrieval (스토리 기반의 정보 검색 연구)

You, Eun-Soon;Park, Seung-Bo
- Journal of Intelligence and Information Systems
- /
- v.19 no.4
- /
- pp.81-96
- /
- 2013
Video information retrieval has become a very important issue because of the explosive increase in video data from Web content development. Meanwhile, content-based video analysis using visual features has been the main source for video information retrieval and browsing. Content in video can be represented with content-based analysis techniques, which can extract various features from audio-visual data such as frames, shots, colors, texture, or shape. Moreover, similarity between videos can be measured through content-based analysis. However, a movie that is one of typical types of video data is organized by story as well as audio-visual data. This causes a semantic gap between significant information recognized by people and information resulting from content-based analysis, when content-based video analysis using only audio-visual data of low level is applied to information retrieval of movie. The reason for this semantic gap is that the story line for a movie is high level information, with relationships in the content that changes as the movie progresses. Information retrieval related to the story line of a movie cannot be executed by only content-based analysis techniques. A formal model is needed, which can determine relationships among movie contents, or track meaning changes, in order to accurately retrieve the story information. Recently, story-based video analysis techniques have emerged using a social network concept for story information retrieval. These approaches represent a story by using the relationships between characters in a movie, but these approaches have problems. First, they do not express dynamic changes in relationships between characters according to story development. Second, they miss profound information, such as emotions indicating the identities and psychological states of the characters. Emotion is essential to understanding a character's motivation, conflict, and resolution. Third, they do not take account of events and background that contribute to the story. As a result, this paper reviews the importance and weaknesses of previous video analysis methods ranging from content-based approaches to story analysis based on social network. Also, we suggest necessary elements, such as character, background, and events, based on narrative structures introduced in the literature. We extract characters' emotional words from the script of the movie Pretty Woman by using the hierarchical attribute of WordNet, which is an extensive English thesaurus. WordNet offers relationships between words (e.g., synonyms, hypernyms, hyponyms, antonyms). We present a method to visualize the emotional pattern of a character over time. Second, a character's inner nature must be predetermined in order to model a character arc that can depict the character's growth and development. To this end, we analyze the amount of the character's dialogue in the script and track the character's inner nature using social network concepts, such as in-degree (incoming links) and out-degree (outgoing links). Additionally, we propose a method that can track a character's inner nature by tracing indices such as degree, in-degree, and out-degree of the character network in a movie through its progression. Finally, the spatial background where characters meet and where events take place is an important element in the story. We take advantage of the movie script to extracting significant spatial background and suggest a scene map describing spatial arrangements and distances in the movie. Important places where main characters first meet or where they stay during long periods of time can be extracted through this scene map. In view of the aforementioned three elements (character, event, background), we extract a variety of information related to the story and evaluate the performance of the proposed method. We can track story information extracted over time and detect a change in the character's emotion or inner nature, spatial movement, and conflicts and resolutions in the story.
https://doi.org/10.13088/jiis.2013.19.4.081 인용 PDF KSCI

Automatic gasometer reading system using selective optical character recognition (관심 문자열 인식 기술을 이용한 가스계량기 자동 검침 시스템)

Lee, Kyohyuk;Kim, Taeyeon;Kim, Wooju
- Journal of Intelligence and Information Systems
- /
- v.26 no.2
- /
- pp.1-25
- /
- 2020
In this paper, we suggest an application system architecture which provides accurate, fast and efficient automatic gasometer reading function. The system captures gasometer image using mobile device camera, transmits the image to a cloud server on top of private LTE network, and analyzes the image to extract character information of device ID and gas usage amount by selective optical character recognition based on deep learning technology. In general, there are many types of character in an image and optical character recognition technology extracts all character information in an image. But some applications need to ignore non-of-interest types of character and only have to focus on some specific types of characters. For an example of the application, automatic gasometer reading system only need to extract device ID and gas usage amount character information from gasometer images to send bill to users. Non-of-interest character strings, such as device type, manufacturer, manufacturing date, specification and etc., are not valuable information to the application. Thus, the application have to analyze point of interest region and specific types of characters to extract valuable information only. We adopted CNN (Convolutional Neural Network) based object detection and CRNN (Convolutional Recurrent Neural Network) technology for selective optical character recognition which only analyze point of interest region for selective character information extraction. We build up 3 neural networks for the application system. The first is a convolutional neural network which detects point of interest region of gas usage amount and device ID information character strings, the second is another convolutional neural network which transforms spatial information of point of interest region to spatial sequential feature vectors, and the third is bi-directional long short term memory network which converts spatial sequential information to character strings using time-series analysis mapping from feature vectors to character strings. In this research, point of interest character strings are device ID and gas usage amount. Device ID consists of 12 arabic character strings and gas usage amount consists of 4 ~ 5 arabic character strings. All system components are implemented in Amazon Web Service Cloud with Intel Zeon E5-2686 v4 CPU and NVidia TESLA V100 GPU. The system architecture adopts master-lave processing structure for efficient and fast parallel processing coping with about 700,000 requests per day. Mobile device captures gasometer image and transmits to master process in AWS cloud. Master process runs on Intel Zeon CPU and pushes reading request from mobile device to an input queue with FIFO (First In First Out) structure. Slave process consists of 3 types of deep neural networks which conduct character recognition process and runs on NVidia GPU module. Slave process is always polling the input queue to get recognition request. If there are some requests from master process in the input queue, slave process converts the image in the input queue to device ID character string, gas usage amount character string and position information of the strings, returns the information to output queue, and switch to idle mode to poll the input queue. Master process gets final information form the output queue and delivers the information to the mobile device. We used total 27,120 gasometer images for training, validation and testing of 3 types of deep neural network. 22,985 images were used for training and validation, 4,135 images were used for testing. We randomly splitted 22,985 images with 8:2 ratio for training and validation respectively for each training epoch. 4,135 test image were categorized into 5 types (Normal, noise, reflex, scale and slant). Normal data is clean image data, noise means image with noise signal, relfex means image with light reflection in gasometer region, scale means images with small object size due to long-distance capturing and slant means images which is not horizontally flat. Final character string recognition accuracies for device ID and gas usage amount of normal data are 0.960 and 0.864 respectively.
https://doi.org/10.13088/jiis.2020.26.2.001 인용 PDF KSCI

A New Flash Memory Package Structure with Intelligent Buffer System and Performance Evaluation (버퍼 시스템을 내장한 새로운 플래쉬 메모리 패키지 구조 및 성능 평가)

Lee Jung-Hoon;Kim Shin-Dug
- Journal of KIISE:Computer Systems and Theory
- /
- v.32 no.2
- /
- pp.75-84
- /
- 2005
This research is to design a high performance NAND-type flash memory package with a smart buffer cache that enhances the exploitation of spatial and temporal locality. The proposed buffer structure in a NAND flash memory package, called as a smart buffer cache, consists of three parts, i.e., a fully-associative victim buffer with a small block size, a fully-associative spatial buffer with a large block size, and a dynamic fetching unit. This new NAND-type flash memory package can achieve dramatically high performance and low power consumption comparing with any conventional NAND-type flash memory. Our results show that the NAND flash memory package with a smart buffer cache can reduce the miss ratio by around 70% and the average memory access time by around 67%, over the conventional NAND flash memory configuration. Also, the average miss ratio and average memory access time of the package module with smart buffer for a given buffer space (e.g., 3KB) can achieve better performance than package modules with a conventional direct-mapped buffer with eight times(e.g., 32KB) as much space and a fully-associative configuration with twice as much space(e.g., 8KB)
PDF KSCI

Dynamic Hand Gesture Recognition Using CNN Model and FMM Neural Networks (CNN 모델과 FMM 신경망을 이용한 동적 수신호 인식 기법)

Kim, Ho-Joon
- Journal of Intelligence and Information Systems
- /
- v.16 no.2
- /
- pp.95-108
- /
- 2010
In this paper, we present a hybrid neural network model for dynamic hand gesture recognition. The model consists of two modules, feature extraction module and pattern classification module. We first propose a modified CNN(convolutional Neural Network) a pattern recognition model for the feature extraction module. Then we introduce a weighted fuzzy min-max(WFMM) neural network for the pattern classification module. The data representation proposed in this research is a spatiotemporal template which is based on the motion information of the target object. To minimize the influence caused by the spatial and temporal variation of the feature points, we extend the receptive field of the CNN model to a three-dimensional structure. We discuss the learning capability of the WFMM neural networks in which the weight concept is added to represent the frequency factor in training pattern set. The model can overcome the performance degradation which may be caused by the hyperbox contraction process of conventional FMM neural networks. From the experimental results of human action recognition and dynamic hand gesture recognition for remote-control electric home appliances, the validity of the proposed models is discussed.
PDF KSCI

A Hybrid Semantic-Geometric Approach for Clutter-Resistant Floorplan Generation from Building Point Clouds

Kim, Seongyong;Yajima, Yosuke;Park, Jisoo;Chen, Jingdao;Cho, Yong K.
- International conference on construction engineering and project management
- /
- 2022.06a
- /
- pp.792-799
- /
- 2022
Building Information Modeling (BIM) technology is a key component of modern construction engineering and project management workflows. As-is BIM models that represent the spatial reality of a project site can offer crucial information to stakeholders for construction progress monitoring, error checking, and building maintenance purposes. Geometric methods for automatically converting raw scan data into BIM models (Scan-to-BIM) often fail to make use of higher-level semantic information in the data. Whereas, semantic segmentation methods only output labels at the point level without creating object level models that is necessary for BIM. To address these issues, this research proposes a hybrid semantic-geometric approach for clutter-resistant floorplan generation from laser-scanned building point clouds. The input point clouds are first pre-processed by normalizing the coordinate system and removing outliers. Then, a semantic segmentation network based on PointNet++ is used to label each point as ceiling, floor, wall, door, stair, and clutter. The clutter points are removed whereas the wall, door, and stair points are used for 2D floorplan generation. A region-growing segmentation algorithm paired with geometric reasoning rules is applied to group the points together into individual building elements. Finally, a 2-fold Random Sample Consensus (RANSAC) algorithm is applied to parameterize the building elements into 2D lines which are used to create the output floorplan. The proposed method is evaluated using the metrics of precision, recall, Intersection-over-Union (IOU), Betti error, and warping error.
PDF

GAP Estimation on Arterial Road via Vehicle Labeling of Drone Image (드론 영상의 차량 레이블링을 통한 간선도로 차간간격(GAP) 산정)

Jin, Yu-Jin;Bae, Sang-Hoon
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.16 no.6
- /
- pp.90-100
- /
- 2017
The purpose of this study is to detect and label the vehicles using the drone images as a way to overcome the limitation of the existing point and section detection system and vehicle gap estimation on Arterial road. In order to select the appropriate time zone, position, and altitude for the acquisition of the drone image data, the final image data was acquired by shooting under various conditions. The vehicle was detected by applying mixed Gaussian, image binarization and morphology among various image analysis techniques, and the vehicle was labeled by applying Kalman filter. As a result of the labeling rate analysis, it was confirmed that the vehicle labeling rate is 65% by detecting 185 out of 285 vehicles. The gap was calculated by pixel unitization, and the results were verified through comparison and analysis with Daum maps. As a result, the gap error was less than 5m and the mean error was 1.67m with the preceding vehicle and 1.1m with the following vehicle. The gaps estimated in this study can be used as the density of the urban roads and the criteria for judging the service level.
https://doi.org/10.12815/kits.2017.16.6.90 인용 PDF KSCI

Optimal Berth and Crane Scheduling Using Constraint Satisfaction Search and Heuristic Repair (제약만족 탐색과 휴리스틱 교정기법을 이용한 최적 선석 및 크레인 일정계획)

류광렬;김갑환;백영수;황준하;박영만
- Journal of Intelligence and Information Systems
- /
- v.6 no.2
- /
- pp.1-14
- /
- 2000
The berth and crane scheduling problem in a container terminal encompasses the whole process of assigning berth to each ship, determining the duration of berthing, assigning container cranes to each ship, and determining the specific start and end time of each crane service, for all the ships scheduled to be arriving at the terminal during a certain scheduling horizon. This problem is basically a constraint satisfaction problem in which cranes and berths should be assigned in such a way that all the spatial and temporal constraints are satisfied without any interference. However, it is also an optimization problem because the requested arrival and departure time should be met for as many of the scheduled ships as possible, while the operation cost of the terminal should be minimized. In this paper, we present an effective and efficient approach to solving this type of problem, which combines constrain satisfaction search and heuristic repair. We first employ a constraint satisfaction search to find a feasib1e solution. Then, the feasible solution is modified to a more optimal one by iteratively applying our heuristic repair operations within the framework of constraint satisfaction search. Experimental results with a real data from Pusan East Container Terminal showed that our approach can derive a schedule of satisfactory quality in a very short time.
PDF

Plans to Improve Smart Village and Its Challenges (스마트 빌리지, 그 계획과 도전)

Eom, Seong-Jun;Kim, Sang-Bum;Cho, Suk-Yeong;An, Phil-Gyun
- Journal of Agricultural Extension & Community Development
- /
- v.27 no.4
- /
- pp.173-184
- /
- 2020
Is the Fourth Industrial Revolution a revolution for cities only? Through the Fourth Industrial Revolution, Korea has entered quickly in the influence area of intelligent information technology such as IoT, AI, Big data, Cloud, ICT, Digital twin. However, as the information gap between the rural zone and the urban zone worsens, a policy was needed to reduce such a gap. Therefore, this research analyzed EU's smart village project, and investigated the problem and improvement of the actual smart village through the interview and field study with the person in charge of the actual smart village project in Korea. Based on the analytic result, 5 plans were deduced to improve Korea's smart village project. First, make the realistic adjustment of project period to assure the sustainability of smart village; second, make the new establishment of the department in charge of smart village project; third, construct the system of integrating and cooperating the policy that can embrace all the rural zone and the urban zone; the fourth, expand the application area of customized ICT technology according to the new rural policy environment; and finally introduce the residents' capacity development project through the rural guidance project.
https://doi.org/10.12653/jecd.2020.27.4.0173 인용 PDF KSCI

Image Restoration Filter using Combined Weight in Mixed Noise Environment (복합잡음 환경에서 결합가중치를 이용한 영상복원 필터)

Cheon, Bong-Won;Kim, Nam-Ho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.05a
- /
- pp.210-212
- /
- 2021
In modern society, various digital equipment are being distributed due to the influence of the 4th industrial revolution, and they are used in a wide range of fields such as automated processes, intelligent CCTV, medical industry, robots, and drones. Accordingly, the importance of the preprocessing process in a system operating based on an image is increasing, and an algorithm for effectively reconstructing an image is drawing attention. In this paper, we propose a filter algorithm based on a combined weight value to reconstruct an image in a complex noise environment. The proposed algorithm calculates the weight according to the spatial distance and the weight according to the difference between the pixel values for the input image and the pixel values inside the filtering mask, respectively. The final output was filtered by applying the join weights calculated based on the two weights to the mask. In order to verify the performance of the proposed algorithm, we simulated it by comparing it with the existing filter algorithm.
PDF

Application and Evaluation of ITS Map Datum and Location Referencing System for ITS User Services (ITS서비스를 위한 Map Datum 및 위치참조체계 모델의 적용 및 평가)

최기주;이광섭
- Journal of Korean Society of Transportation
- /
- v.17 no.2
- /
- pp.55-68
- /
- 1999
Many ITS services require map databases in digital form to meet desired needs. Due to the dynamic nature of ITS and the sheer diversity of applications, the design and development of spatial databases to meet those needs pose a major challenge to both the public and private sectors. This challenge is further complicated by the necessity to transfer locationally referenced information between different kinds of databases and spatial data handling systems so that ITS products will work seamlessly across the region and nation. The Purpose of this paper is to develop the framework-models commonly to reference locations in the various applications and systems-the ITS Map Datum and LRS(Location Referencing System). The ITS Map Datum consists of the around control points which are the prime intersections (nodes) of the nationwide road network In this study, the major points have been determined along wish link-node modeling procedure. LRS, defined as a system for determining the position (location) of an entity relative to other entities or to some external frame of reference, has also been set up using CSOM type method. The method has been implemented using ArcView GIS software over the Kangnam and Seocho districts in the city of Seoul, showing that the implemented LRS scheme can be used successfully elsewhere. With the proper advent of the K.ITS architecture and services, the procedure can be used to improve the data sharing and to inter operate among systems, enhancing the efficiency both in terms of money and time.
PDF

Search Result 152, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)