• Title/Summary/Keyword: 데이터베이스 구조화

Search Result 339, Processing Time 0.027 seconds

Improving Bidirectional LSTM-CRF model Of Sequence Tagging by using Ontology knowledge based feature (온톨로지 지식 기반 특성치를 활용한 Bidirectional LSTM-CRF 모델의 시퀀스 태깅 성능 향상에 관한 연구)

  • Jin, Seunghee;Jang, Heewon;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.253-266
    • /
    • 2018
  • This paper proposes a methodology applying sequence tagging methodology to improve the performance of NER(Named Entity Recognition) used in QA system. In order to retrieve the correct answers stored in the database, it is necessary to switch the user's query into a language of the database such as SQL(Structured Query Language). Then, the computer can recognize the language of the user. This is the process of identifying the class or data name contained in the database. The method of retrieving the words contained in the query in the existing database and recognizing the object does not identify the homophone and the word phrases because it does not consider the context of the user's query. If there are multiple search results, all of them are returned as a result, so there can be many interpretations on the query and the time complexity for the calculation becomes large. To overcome these, this study aims to solve this problem by reflecting the contextual meaning of the query using Bidirectional LSTM-CRF. Also we tried to solve the disadvantages of the neural network model which can't identify the untrained words by using ontology knowledge based feature. Experiments were conducted on the ontology knowledge base of music domain and the performance was evaluated. In order to accurately evaluate the performance of the L-Bidirectional LSTM-CRF proposed in this study, we experimented with converting the words included in the learned query into untrained words in order to test whether the words were included in the database but correctly identified the untrained words. As a result, it was possible to recognize objects considering the context and can recognize the untrained words without re-training the L-Bidirectional LSTM-CRF mode, and it is confirmed that the performance of the object recognition as a whole is improved.

Design and Implementation of MongoDB-based Unstructured Log Processing System over Cloud Computing Environment (클라우드 환경에서 MongoDB 기반의 비정형 로그 처리 시스템 설계 및 구현)

  • Kim, Myoungjin;Han, Seungho;Cui, Yun;Lee, Hanku
    • Journal of Internet Computing and Services
    • /
    • v.14 no.6
    • /
    • pp.71-84
    • /
    • 2013
  • Log data, which record the multitude of information created when operating computer systems, are utilized in many processes, from carrying out computer system inspection and process optimization to providing customized user optimization. In this paper, we propose a MongoDB-based unstructured log processing system in a cloud environment for processing the massive amount of log data of banks. Most of the log data generated during banking operations come from handling a client's business. Therefore, in order to gather, store, categorize, and analyze the log data generated while processing the client's business, a separate log data processing system needs to be established. However, the realization of flexible storage expansion functions for processing a massive amount of unstructured log data and executing a considerable number of functions to categorize and analyze the stored unstructured log data is difficult in existing computer environments. Thus, in this study, we use cloud computing technology to realize a cloud-based log data processing system for processing unstructured log data that are difficult to process using the existing computing infrastructure's analysis tools and management system. The proposed system uses the IaaS (Infrastructure as a Service) cloud environment to provide a flexible expansion of computing resources and includes the ability to flexibly expand resources such as storage space and memory under conditions such as extended storage or rapid increase in log data. Moreover, to overcome the processing limits of the existing analysis tool when a real-time analysis of the aggregated unstructured log data is required, the proposed system includes a Hadoop-based analysis module for quick and reliable parallel-distributed processing of the massive amount of log data. Furthermore, because the HDFS (Hadoop Distributed File System) stores data by generating copies of the block units of the aggregated log data, the proposed system offers automatic restore functions for the system to continually operate after it recovers from a malfunction. Finally, by establishing a distributed database using the NoSQL-based Mongo DB, the proposed system provides methods of effectively processing unstructured log data. Relational databases such as the MySQL databases have complex schemas that are inappropriate for processing unstructured log data. Further, strict schemas like those of relational databases cannot expand nodes in the case wherein the stored data are distributed to various nodes when the amount of data rapidly increases. NoSQL does not provide the complex computations that relational databases may provide but can easily expand the database through node dispersion when the amount of data increases rapidly; it is a non-relational database with an appropriate structure for processing unstructured data. The data models of the NoSQL are usually classified as Key-Value, column-oriented, and document-oriented types. Of these, the representative document-oriented data model, MongoDB, which has a free schema structure, is used in the proposed system. MongoDB is introduced to the proposed system because it makes it easy to process unstructured log data through a flexible schema structure, facilitates flexible node expansion when the amount of data is rapidly increasing, and provides an Auto-Sharding function that automatically expands storage. The proposed system is composed of a log collector module, a log graph generator module, a MongoDB module, a Hadoop-based analysis module, and a MySQL module. When the log data generated over the entire client business process of each bank are sent to the cloud server, the log collector module collects and classifies data according to the type of log data and distributes it to the MongoDB module and the MySQL module. The log graph generator module generates the results of the log analysis of the MongoDB module, Hadoop-based analysis module, and the MySQL module per analysis time and type of the aggregated log data, and provides them to the user through a web interface. Log data that require a real-time log data analysis are stored in the MySQL module and provided real-time by the log graph generator module. The aggregated log data per unit time are stored in the MongoDB module and plotted in a graph according to the user's various analysis conditions. The aggregated log data in the MongoDB module are parallel-distributed and processed by the Hadoop-based analysis module. A comparative evaluation is carried out against a log data processing system that uses only MySQL for inserting log data and estimating query performance; this evaluation proves the proposed system's superiority. Moreover, an optimal chunk size is confirmed through the log data insert performance evaluation of MongoDB for various chunk sizes.

Antipodal Structuralization Strategy of Character Appearing in : Based on Psychological Functions of MBTI Personality Types Theory (<배가본드>에 나타난 캐릭터의 대척적 구조화 전략: MBTI 성격유형론의 심리기능에 근거하여)

  • Yang, Se-Hyeok;Kim, Dae-Gwon
    • Cartoon and Animation Studies
    • /
    • s.31
    • /
    • pp.117-152
    • /
    • 2013
  • is a comics that the original Yoshokawa Eiji's novel . Since the series started in 1998, it's epic that records the 54 million or more copies of the cumulative sales volume to 1.7 million unit volume average in the book, 34 are currently up to volume has been published. proceeds with narratives in a way of following naturally the personality of a character based on the rule of an author, which is 'people should be described as what they are'. Accordingly it features very unique characterizing. This study focused on the fact that numerous characters in maintain a structural balance through the establishment of a character composition in an antipodal relationship although those characters have strong personalities. In order to analyze the relationship of such characters, the study utilized as an analytic frame MBTI personality types theory which is a psychology test tool. First, the study inferred personality patterns as the temperamental characteristics of MBTI, and tried to analyze the antipodal character composition based on the combination of cognition and judgment which are assumably the most important functions. From this, the study was able to discover the following three structures applied to those characters. (1) The antipodism between Musasi, the main character and Kojiro, a mirror character becomes central to the work, (2) The antipodal relationship between their fosterers and the character playing the mentor's role extends the character attribute of Musasi and Kojiro. (3) The Yoshioka family was also established in the antipodal composition as a role of exchanging influences with Musasi and Kojiro. Through this, the study reached a conclusion that in the pairs of characters in contrast were established as if to reach a dialectic synthesis. As such, the antipodal structuralization of the character composition shown in is deemed to differentiate the inner sides of numerous unique characters; thereby make it possible to describe their inner sides in-depth. Finally, the following common context is found: works in the field of successful comics and animation in terms of criticism and performance are focused on characters. It is probably because their consumers are relatively very interested in those characters as the characters in comics or animation become differentiated from those of novels or movies. Subsequently, it is expected that the analyzed results of characterizing can be referred to during the production of contents by preparing the results as database.

A Research on Applicability of Drone Photogrammetry for Dam Safety Inspection (드론 Photogrammetry 기반 댐 시설물 안전점검 적용성 연구)

  • DongSoon Park;Jin-Il Yu;Hojun You
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.27 no.5
    • /
    • pp.30-39
    • /
    • 2023
  • Large dams, which are critical infrastructures for disaster prevention, are exposed to various risks such as aging, floods, and earthquakes. Better dam safety inspection and diagnosis using digital transformation technologies are needed. Traditional visual inspection methods by human inspectors have several limitations, including many inaccessible areas, danger of working at heights, and know-how based subjective inspections. In this study, drone photogrammetry was performed on two large dams to evaluate the applicability of digital data-based dam safety inspection and propose a data management methodology for continuous use. High-quality 3D digital models with GSD (ground sampling distance) within 2.5 cm/pixel were generated by flat double grid missions and manual photography methods, despite reservoir water surface and electromagnetic interferences, and severe altitude differences ranging from 42 m to 99.9 m of dam heights. Geometry profiles of the as-built conditions were easily extracted from the generated 3D mesh models, orthomosaic images, and digital surface models. The effectiveness of monitoring dam deformation by photogrammetry was confirmed. Cracks and deterioration of dam concrete structures, such as spillways and intake towers, were detected and visualized efficiently using the digital 3D models. This can be used for safe inspection of inaccessible areas and avoiding risky tasks at heights. Furthermore, a methodology for mapping the inspection result onto the 3D digital model and structuring a relational database for managing deterioration information history was proposed. As a result of measuring the labor and time required for safety inspection at the SYG Dam spillway, the drone photogrammetry method was found to have a 48% productivity improvement effect compared to the conventional manpower visual inspection method. The drone photogrammetry-based dam safety inspection is considered very effective in improving work productivity and data reliability.

MLP-based 3D Geotechnical Layer Mapping Using Borehole Database in Seoul, South Korea (MLP 기반의 서울시 3차원 지반공간모델링 연구)

  • Ji, Yoonsoo;Kim, Han-Saem;Lee, Moon-Gyo;Cho, Hyung-Ik;Sun, Chang-Guk
    • Journal of the Korean Geotechnical Society
    • /
    • v.37 no.5
    • /
    • pp.47-63
    • /
    • 2021
  • Recently, the demand for three-dimensional (3D) underground maps from the perspective of digital twins and the demand for linkage utilization are increasing. However, the vastness of national geotechnical survey data and the uncertainty in applying geostatistical techniques pose challenges in modeling underground regional geotechnical characteristics. In this study, an optimal learning model based on multi-layer perceptron (MLP) was constructed for 3D subsurface lithological and geotechnical classification in Seoul, South Korea. First, the geotechnical layer and 3D spatial coordinates of each borehole dataset in the Seoul area were constructed as a geotechnical database according to a standardized format, and data pre-processing such as correction and normalization of missing values for machine learning was performed. An optimal fitting model was designed through hyperparameter optimization of the MLP model and model performance evaluation, such as precision and accuracy tests. Then, a 3D grid network locally assigning geotechnical layer classification was constructed by applying an MLP-based bet-fitting model for each unit lattice. The constructed 3D geotechnical layer map was evaluated by comparing the results of a geostatistical interpolation technique and the topsoil properties of the geological map.

Digital Documentation and Short-term Monitoring on Original Rampart Wall of the Gyejoksanseong Fortress in Daejeon, Korea (대전 계족산성 원형성벽의 디지털기록화 및 단기모니터링 연구)

  • Kim, Sung Han;Lee, Chan Hee;Jo, Young Hoon
    • Economic and Environmental Geology
    • /
    • v.52 no.2
    • /
    • pp.169-188
    • /
    • 2019
  • This study was carried out unmanned aerial photography and terrestrial laser scanning to establish digital database on original wall of Gyejoksanseong fortress, and measured ground control points for continuity of the monitoring. It also performed precise examination with the naked eye, unmanned aerial photogrammetry, endoscopy, total station and handy measurement to examine the structural stability of the original walls. The ground control points were considered as a point where visual field can be secured, 3 points were selected around each of the south and north walls. For the right side of the south original wall, aerial photogrammetry was conducted using drones and a deviation analysis of 3-dimensional digital models was performed for short-term monitoring. As a result, the two original walls were almost matched in range within 5mm, and no difference indicating displacement of stones was found, except for partial deviation. Regular monitoring of the areas with structural deformation such as bulging, weak and fracture zone by precisely examining with the naked eye and using high-resolution photo data revealed no distinct change. The inner foundation observed through endoscopy found out that filling stones of the original walls were still remained, while most filling soil was lost. As a result of measuring the total station focusing around the points with structural deformation on the original walls, the maximum displacements of the north and south walls were somewhat high with 6.6mm and 3.8mm, respectively, while the final displacements were relatively stable at below 2.9mm and 1.4mm, respectively. Handy measurement also did not reveal clear structural deformation with displacements below 0.82mm at all points. Even though the results of displacement monitoring on the original walls are stable, it is hard to secure structural stability due to the characteristics of ramparts where sudden brittle fracture occurs. Therefore, it is necessary to conduct conservational scientific diagnosis, precise monitoring, and structural analysis based on the 3-dimensional figuration information obtained in this research.

Making Cache-Conscious CCMR-trees for Main Memory Indexing (주기억 데이타베이스 인덱싱을 위한 CCMR-트리)

  • 윤석우;김경창
    • Journal of KIISE:Databases
    • /
    • v.30 no.6
    • /
    • pp.651-665
    • /
    • 2003
  • To reduce cache misses emerges as the most important issue in today's situation of main memory databases, in which CPU speeds have been increasing at 60% per year, and memory speeds at 10% per year. Recent researches have demonstrated that cache-conscious index structure such as the CR-tree outperforms the R-tree variants. Its search performance can be poor than the original R-tree, however, since it uses a lossy compression scheme. In this paper, we propose alternatively a cache-conscious version of the R-tree, which we call MR-tree. The MR-tree propagates node splits upward only if one of the internal nodes on the insertion path has empty room. Thus, the internal nodes of the MR-tree are almost 100% full. In case there is no empty room on the insertion path, a newly-created leaf simply becomes a child of the split leaf. The height of the MR-tree increases according to the sequence of inserting objects. Thus, the HeightBalance algorithm is executed when unbalanced heights of child nodes are detected. Additionally, we also propose the CCMR-tree in order to build a more cache-conscious MR-tree. Our experimental and analytical study shows that the two-dimensional MR-tree performs search up to 2.4times faster than the ordinary R-tree while maintaining slightly better update performance and using similar memory space.

GIS spatial D/B formation of geothermal data and Distribution of Heat Flow of Korea (한국의 지열자료 GIS 공간 D/B 구축과 지열류량 분포)

  • Kim, Hyoung-Chan;Lee, Young-Min;Park, Jeong-Min
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 2006.06a
    • /
    • pp.459-460
    • /
    • 2006
  • 현재 남한의 지열류량 측정값으로는 총 363개 지점의 자료가 측정 및 수집되어 있다. 이것은 Mizutani et at. (1970), 장정진 외(1970), 그리고 서정희(1976) 등의 자료, 총 35개의 자료도 추가된 것이다. 1989년 이후부터 측정된 지열류량 자료는 217개 자료이며(임정웅 외, 1989; 임정웅 외, 1996; Lim and Kim, 1997; 염병우 외, 1997), 모두 직접 측정한 것이나, 1989년 이후 보고된 지열류량 자료에 약간의 오류가 있어 이번 연구에서 수정 보완하였다. 또한 과거의 자료 35개 자료는 이미 지열류량 측정치가 논문화 되어 있는 것으로 암석시료는 없다 1989년 이후 2004년까지 자료 217개 2005년도 추가 자료 111개의 지열류량 자료는 암석시료도 있으며, 측정기기가 서로 달라 오차가 있을 수 있어 서로 보정을 해야 할 필요가 있어 시추공 주변 암석을 새로 수집해서 신장비로 다시 측정 보정하였다. 지열류량 D/B 구축은 각 자료의 일련번호, 고유번호 (Sn.), 위경도 좌표 (longitude, lattitude), 암석의 열전도도(thermal conductivity), 지온경사 (thermal gradient), 지열류량 (heat flow)등으로 구성되어 있다. 지열류량 자료 공간 데이터베이스는 점 속성을 가지며 자료형태는 각종 소프트웨어와 호환성이 좋은 shape 파일 형태로 작성하였다. 또만 최근 천부 토양 및 암석 열물성을 이용한 냉난방시스템 즉, Heat Pump System 설계를 위하여 반드시 들어가야 하는 요소인 열확산율, 공극율, 밀도, 비열 등 열물성 특성을 추가하여 GIS 공간 D/B구축하였다. 대륙붕 자료 4개 자료를 제외하고 359개의 지열류량 자료를 이용하여 한반도 남부, 즉 남한의 지열류량 분포도를 작성 분석해 본 결과(그림 1), 우리나라의 지열류량 이상대는 아산만 주변, 보령, 유성, 진안, 울진, 포항, 부산 지역과 포천, 속초, 충주, 수안보 등 지역에서 나타난다 이러한 이상대 주변에는 대개 온천이 발달되어 있었거나 새로 개발되어 있는 곳이다. 온천에 이용하고 있는 시추공의 자료는 배제하였으나 온천이응으로 직접적으로 영향을 받지 않은 시추공의 자료는 사용하였다 이러한 온천 주변 지역이라 하더라도 실제는 온천의 pumping 으로 인한 대류현상으로 주변 일대의 온도를 올려놓았기 때문에 비교적 높은 지열류량 값을 보인다. 한편 한반도 남동부 일대는 이번 추가된 자료에 의해 새로운 지열류량 분포 변화가 나타났다 강원 북부 오색온천지역 부근에서 높은 지열류량 분포를 보이며 또한 우리나라 대단층 중의 하나인 양산단층과 같은 방향으로 발달한 밀양단층, 모량단층, 동래단층 등 주변부로 NNE-SSW 방향의 지열류량 이상대가 발달한다. 이것으로 볼 때 지열류량은 지질구조와 무관하지 않음을 파악할 수 있다. 특히 이러한 단층대 주변은 지열수의 순환이 깊은 심도까지 가능하므로 이러한 대류현상으로 지표부근까지 높은 지온 전달이 되어 나타나는 것으로 판단된다.

  • PDF

3D GIS Network Modeling of Indoor Building Space Using CAD Plans (CAD 도면을 이용한 건축물 내부 공간의 3차원 GIS 네트워크 모델링)

  • Kang Jung A;Yom Jee-Hong;Lee Dong-Cheon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.23 no.4
    • /
    • pp.375-384
    • /
    • 2005
  • Three dimensional urban models are being increasingly applied for various purposes such as city planning, telecommunication cell planning, traffic analysis, environmental monitoring and disaster management. In recent years, technologies from CAD and GIS are being merged to find optimal solutions in three dimensional modeling of urban buildings. These solutions include modeling of the interior building space as well as its exterior shape visualization. Research and development effort in this area has been performed by scientists and engineers from Computer Graphics, CAD and GIS. Computer Graphics and CAD focussed on precise and efficient visualization, where as GIS emphasized on topology and spatial analysis. Complementary research effort is required for an effective model to serve both visualization and spatial analysis purposes. This study presents an efficient way of using the CAD plans included in the building register documents to reconstruct the internal space of buildings. Topological information was built in the geospatial database and merged with the geometric information of CAD plans. as well as other attributal data from the building register. The GIS network modeling method introduced in this study is expected to enable an effective 3 dimensional spatial analysis of building interior which is developing with increasing complexity and size.

The Strategy of Characterizing Space that uses Anti-House as a Metaphor for Character's Self-Defense Mechanism - Focusing on the TV Series and the Theater version of - (캐릭터의 자아방어기제를 은유하는 '안티돔' 공간의 성격화 전략 - <에반게리온>의 TV 시리즈와 극장판 를 중심으로 -)

  • Yang, Se-Hyeok;Ryu, Beom-Yeol
    • Cartoon and Animation Studies
    • /
    • s.41
    • /
    • pp.75-106
    • /
    • 2015
  • Animations characterize space as a strategy to effectively show the inner conflicts of characters and to highlight the theme. During the process of inner conflict, characters unconsciously use defense mechanism to protect their egos from the fear that came from deficiency, and because of the self-deceptive quality of self-defense mechanism, the reality is distorted and conflicts get intensified. This study focuses on the concept of anti-house, the space where conflicts get intensified, analyzes animations to find out the aspect of inner conflict, and interprets the characteristic of space that is used for metaphoric structure frame. Also, it aims to reveal how the defense mechanism, which intensifies the inner conflict of characters, is characterized as anti-house. The analysis in this study was mainly done with the TV series, , and the theater version of . It is because the characters have serious deficiency from broken home and have a psychological quality of closed boundary that is symbolized as 'A.T. field'. Especially, the core character, 'Shinji Ikari', shows how a character uses compulsive self-defense mechanism to deal with inner conflict and as a result, goes through ego-collapse and then introspection. This process of the character's experience is the core of the whole plot. Through analysis, the relationship between the character's self-defense mechanism and the space, anti-house(which expands to Anti-city), was inferred. The space is made up of three axes, x-axis of horizontal space, y-axis of vertical space, and in the sense that all the space has no exit, z-axis of deeper contradictory space. This thesis started with the decision that is the most suitable work in analyzing the metaphorical relationship between self-defense mechanism and anti-house. There was limitation, however, as the typical characteristics of Japanese animations, pedantic composition and the possibility of broad interpretation, hindered clear verification. Hopefully, this limitation will be overcome by following studies and this study is expected to show the importance of space in interpreting the text of animations, and to serve as database for other creative works.