• Title/Summary/Keyword: 공간정보추출언어

Search Result 50, Processing Time 0.025 seconds

Visualization analysis using R Shiny (R의 Shiny를 이용한 시각화 분석 활용 사례)

  • Na, Jonghwa;Hwang, Eunji
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1279-1290
    • /
    • 2017
  • R's {shiny} package provides an environment for creating web applications with only R scripts. Shiny does not require knowledge of a separate web programming language and its development is very easy and straightforward. In addition, Shiny has a variety of extensibility, and its functions are expanding day by day. Therefore, the presentation of high-quality results is an excellent tool for R-based analysts. In this paper, we present actual cases of large data analysis using Shiny. First, geological anomaly zone is extracted by analyzing topographical data expressed in the form of contour lines by analysis related to spatial data. Next, we will construct a model to predict major diseases by 16 cities and provinces nationwide using weather, environment, and social media information. In this process, we want to show that Shiny is very effective for data visualization and analysis.

Target Word Selection Disambiguation using Untagged Text Data in English-Korean Machine Translation (영한 기계 번역에서 미가공 텍스트 데이터를 이용한 대역어 선택 중의성 해소)

  • Kim Yu-Seop;Chang Jeong-Ho
    • The KIPS Transactions:PartB
    • /
    • v.11B no.6
    • /
    • pp.749-758
    • /
    • 2004
  • In this paper, we propose a new method utilizing only raw corpus without additional human effort for disambiguation of target word selection in English-Korean machine translation. We use two data-driven techniques; one is the Latent Semantic Analysis(LSA) and the other the Probabilistic Latent Semantic Analysis(PLSA). These two techniques can represent complex semantic structures in given contexts like text passages. We construct linguistic semantic knowledge by using the two techniques and use the knowledge for target word selection in English-Korean machine translation. For target word selection, we utilize a grammatical relationship stored in a dictionary. We use k- nearest neighbor learning algorithm for the resolution of data sparseness Problem in target word selection and estimate the distance between instances based on these models. In experiments, we use TREC data of AP news for construction of latent semantic space and Wail Street Journal corpus for evaluation of target word selection. Through the Latent Semantic Analysis methods, the accuracy of target word selection has improved over 10% and PLSA has showed better accuracy than LSA method. finally we have showed the relatedness between the accuracy and two important factors ; one is dimensionality of latent space and k value of k-NT learning by using correlation calculation.

Transportation Network Data Generation from the Topological Geographic Database (GIS위상구조자료로부터 교통망자료의 추출에 관한 연구)

  • 최기주
    • Spatial Information Research
    • /
    • v.2 no.2
    • /
    • pp.147-163
    • /
    • 1994
  • This paper presents three methods of generating the transportation network data out of the topological geographic database in the hope that the conversion of the geographic database file containing the topology to the conventional node-link type trans¬portation network file may facilitate the integration between transportation planning mod¬els and GIS by alleviating the inherent problems of both computing environments. One way of the proposed conversion method is to use the conversion software that allows the bi-directional conversion between the UTPS (Urban Transportation Planning System) type transportation planning model and GIS. The other two methods of data structure conversion approach directly transform the GIS's user-level topology into the transportation network data topology, and have been introduced with codes programmed with FORTRAN and AML (Arc Macro Language) of ARC/INFO. If used successfully, any approach would not only improve the efficiency of transportation planning process and the associated decision-making activities in it, but enhance the productivity of trans¬portation planning agencies.

  • PDF

Metadata Schema Design for Integrated Registry of B2B Business Processes (기업간 비즈니스 프로세스의 통합적 등록저장을 위한 메타데이터 스키마 설계)

  • Kim, Jong-Woo;Kim, Hyoung-Do
    • The Journal of Society for e-Business Studies
    • /
    • v.12 no.2
    • /
    • pp.195-217
    • /
    • 2007
  • B2B registries provide spaces to register and retrieve information which is necessary to support B2B transactions among business partners or potential business partners. Business process specifications are one of important contents in B2B registries, and there is high complexity of representation due to complex and dynamic characteristics of business processes. Also, currently there exist several competing specification frameworks such as ebXML BPSS, WSBPEL, BPMN, and so on. This paper proposes a metadata schema to register business process specifications which are represented by different specification frameworks. The proposed schema has extensibility to register business process specifications which are represented by various different specification frameworks. Also, it extends reuse level from whole business specification processes to their components. To show the usefulness of the proposed schema, this paper demonstrates metadata extraction from business process specifications which are represented by two representative XML-based business process specification languages, ebXML BPSS and WSBPEL.

  • PDF

A Road Luminance Measurement Application based on Android (안드로이드 기반의 도로 밝기 측정 어플리케이션 구현)

  • Choi, Young-Hwan;Kim, Hongrae;Hong, Min
    • Journal of Internet Computing and Services
    • /
    • v.16 no.2
    • /
    • pp.49-55
    • /
    • 2015
  • According to the statistics of traffic accidents over recent 5 years, traffic accidents during the night times happened more than the day times. There are various causes to occur traffic accidents and the one of the major causes is inappropriate or missing street lights that make driver's sight confused and causes the traffic accidents. In this paper, with smartphones, we designed and implemented a lane luminance measurement application which stores the information of driver's location, driving, and lane luminance into database in real time to figure out the inappropriate street light facilities and the area that does not have any street lights. This application is implemented under Native C/C++ environment using android NDK and it improves the operation speed than code written in Java or other languages. To measure the luminance of road, the input image with RGB color space is converted to image with YCbCr color space and Y value returns the luminance of road. The application detects the road lane and calculates the road lane luminance into the database sever. Also this application receives the road video image using smart phone's camera and improves the computational cost by allocating the ROI(Region of interest) of input images. The ROI of image is converted to Grayscale image and then applied the canny edge detector to extract the outline of lanes. After that, we applied hough line transform method to achieve the candidated lane group. The both sides of lane is selected by lane detection algorithm that utilizes the gradient of candidated lanes. When the both lanes of road are detected, we set up a triangle area with a height 20 pixels down from intersection of lanes and the luminance of road is estimated from this triangle area. Y value is calculated from the extracted each R, G, B value of pixels in the triangle. The average Y value of pixels is ranged between from 0 to 100 value to inform a luminance of road and each pixel values are represented with color between black and green. We store car location using smartphone's GPS sensor into the database server after analyzing the road lane video image with luminance of road about 60 meters ahead by wireless communication every 10 minutes. We expect that those collected road luminance information can warn drivers about safe driving or effectively improve the renovation plans of road luminance management.

A Statistical Prediction Model of Speakers' Intentions in a Goal-Oriented Dialogue (목적지향 대화에서 화자 의도의 통계적 예측 모델)

  • Kim, Dong-Hyun;Kim, Hark-Soo;Seo, Jung-Yun
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.9
    • /
    • pp.554-561
    • /
    • 2008
  • Prediction technique of user's intention can be used as a post-processing method for reducing the search space of an automatic speech recognizer. Prediction technique of system's intention can be used as a pre-processing method for generating a flexible sentence. To satisfy these practical needs, we propose a statistical model to predict speakers' intentions that are generalized into pairs of a speech act and a concept sequence. Contrary to the previous model using simple n-gram statistic of speech acts, the proposed model represents a dialogue history of a current utterance to a feature set with various linguistic levels (i.e. n-grams of speech act and a concept sequence pairs, clue words, and state information of a domain frame). Then, the proposed model predicts the intention of the next utterance by using the feature set as inputs of CRFs (Conditional Random Fields). In the experiment in a schedule management domain, The proposed model showed the precision of 76.25% on prediction of user's speech act and the precision of 64.21% on prediction of user's concept sequence. The proposed model also showed the precision of 88.11% on prediction of system's speech act and the Precision of 87.19% on prediction of system's concept sequence. In addition, the proposed model showed 29.32% higher average precision than the previous model.

Characteristics of Meteorological and Marine Environments for the Red Tide Occurrence of Mid-South Sea in Korea (한국 남해중부해역의 적조발생에 관한 기상 및 해양환경 특성)

  • 윤홍주;김승철;박일흠
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.4
    • /
    • pp.845-852
    • /
    • 2003
  • This study deals with the relationship between the red tide occurrence and the meteorological and marine factors, the prediction of areas where the red tide is likely to occur based on the information, and the satellite monitoring for the red tide in mid-South Sea of Korea. From 1990 to 2001, the red tide was observed every year and the number of occurrences increased as well. The red tide mostly occurred in July, August, and September. The most important meteorological factor governing the mechanisms of the increase in the number of red tide occurrences is found to be a heavy precipitation. It was found that the favorable marine environmental conditions for the red tide formation are some of marine factors such as the warm water temperature, the low salinity, the high suspended solid, the low phosphorus, and the low nitrogen. The necessary conditions for the red tide occurrence are found to be the heavy precipitation (23.4-54.5 mm) for 2∼4 days, the warm temperature $(24.6∼25.9^{\circ}C)$, proper sunshine (2∼10.3 h), and light winds (2∼4.6 m/s & SW) for the day in red tide occurrence. It was possible to monitor the spatial distributions and concentration of the red tide using the satellite images. It was found that the likely areas for red tide occurrence in August 2000 were Yosu - Dolsan coast, Gamak bay, Namhae coast, Marado coast, Goheung coast, and Deukryang bay.

Analyzing Self-Introduction Letter of Freshmen at Korea National College of Agricultural and Fisheries by Using Semantic Network Analysis : Based on TF-IDF Analysis (언어네트워크분석을 활용한 한국농수산대학 신입생 자기소개서 분석 - TF-IDF 분석을 기초로 -)

  • Joo, J.S.;Lee, S.Y.;Kim, J.S.;Kim, S.H.;Park, N.B.
    • Journal of Practical Agriculture & Fisheries Research
    • /
    • v.23 no.1
    • /
    • pp.89-104
    • /
    • 2021
  • Based on the TF-IDF weighted value that evaluates the importance of words that play a key role, the semantic network analysis(SNA) was conducted on the self-introduction letter of freshman at Korea National College of Agriculture and Fisheries(KNCAF) in 2020. The top three words calculated by TF-IDF weights were agriculture, mathematics, study (Q. 1), clubs, plants, friends (Q. 2), friends, clubs, opinions, (Q. 3), mushrooms, insects, and fathers (Q. 4). In the relationship between words, the words with high betweenness centrality are reason, high school, attending (Q. 1), garbage, high school, school (Q. 2), importance, misunderstanding, completion (Q.3), processing, feed, and farmhouse (Q. 4). The words with high degree centrality are high school, inquiry, grades (Q. 1), garbage, cleanup, class time (Q. 2), opinion, meetings, volunteer activities (Q.3), processing, space, and practice (Q. 4). The combination of words with high frequency of simultaneous appearances, that is, high correlation, appeared as 'certification - acquisition', 'problem - solution', 'science - life', and 'misunderstanding - concession'. In cluster analysis, the number of clusters obtained by the height of cluster dendrogram was 2(Q.1), 4(Q.2, 4) and 5(Q. 3). At this time, the cohesion in Cluster was high and the heterogeneity between Clusters was clearly shown.

A study on application of fractal structure on graphic design (그래픽 디자인에 있어서 프랙탈 구조의 활용 가능성 연구)

  • Moon, Chul
    • Archives of design research
    • /
    • v.17 no.1
    • /
    • pp.211-220
    • /
    • 2004
  • The Chaos theory of complexity and Fractal theory which became a prominent figure as a new paradigm of natural science should be understood not as whole, and not into separate elements of nature. Fractal Dimensions are used to measure the complexity of objects. We now have ways of measuring things that were traditionally meaningless or impossible to measure. They are capable of describing many irregularly shaped objects including man and nature. It is compatible method of application to express complexity of nature in the dimension of non-fixed number by placing our point of view to lean toward non-linear, diverse, endless time, and complexity when we look at our world. Fractal Dimension allows us to measure the complexity of an object. Having a wide application of fractal geometry and Chaos theory to the art field is the territory of imagination where art and science encounter each other and yet there has not been much research in this area. The formative word has been extracted in this study by analyzing objective data to grasp formative principle and geometric characteristic of (this)distinct figures of Fractals. With this form of research, it is not so much about fractal in mathematics, but the concept of self-similarity and recursiveness, randomness, devices expressed from unspeakable space, and the formative similarity to graphic design are focused in this study. The fractal figures have characteristics in which the structure doesn't change the nature of things of the figure even in the process if repeated infinitely many times, the limit of the process produces is fractal. Almost all fractals are at least partially self-similar. This means that a part of the fractal is identical to the entire fractal itself even if there is an enlargement to infinitesimal. This means any part has all the information to recompose as whole. Based on this scene, the research is intended to examine possibility of analysis of fractals in geometric characteristics in plasticity toward forms in graphic design. As a result, a beautiful proportion appears in graphic design with calculation of mathematic. It should be an appropriate equation to express nature since the fractal dimension allows us to measure the complexity of an object and the Fractla geometry should pick out high addition in value of peculiarity and characteristics in the complex of art and science. At the stage where the necessity of accepting this demand and adapting ourselves to the change is gathering strength is very significant in this research.

  • PDF

Value and Prosect of individual diary as research materials : Based on the "The 12th May Diaries Collection" (개인 일기의 연구 자료로서의 가치와 전망 "5월12일 일기컬렉션"을 중심으로)

  • Choi, Hyo Jin;Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.46
    • /
    • pp.95-152
    • /
    • 2015
  • "Archives of Everyday Life" refers to an organization or facility which collects, appraises, selects and preserves the document from the memory of individuals, groups, or a society through categorizing and classifying lives and cultures of ordinary people. The document includes materials such as diaries, autobiography, letters, and notes. It also covers any digital files or hypertext like posts from blogs and online communities, or photos uploaded on Social Network Services. Many research fields including the Records Management Studies has continuously claimed the necessity of collection and preservation of ordinary people's records on daily life produced every moment. Especially diary is a written record reflecting the facts experienced by an individual and his self-examination. Its originality, individuality and uniqueness are considered truly valuable as a document regardless of the era. Lately many diaries have been discovered and presented to the historical research communities, and diverse researchers in human and social studies have embarked more in-depth research on diaries, their authors, and social background of the time. Furthermore, researchers from linguistics, educational studies, and psychology analyze linguistic behaviors, status of cultural assimilation, and emotional or psychological changes of an author. In this study, we are conducting a metastudy from various research on diaries in order to reaffirm the value of "The 12th May Diaries Collection" as everyday life archives. "The 12th May Diaries Collection" consists of diaries produced and donated directly by citizens on the 12th May every year. It was only 2013 when Digital Archiving Institute in Univ. of Myungji organized the first "Annual call for the 12th May". Now more than 2,000 items were collected including hand writing diaries, digital documents, photos, audio and video files, etc. The age of participants also varies from children to senior citizens. In this study, quantitative analysis will be made on the diaries collected as well as more profound discoveries on the detailed contents of each item. It is not difficult to see stories about family and friends, school life, concerns over career path, daily life and feelings of citizens ranging all different generations, regions, and professions. Based on keyword and descriptors of each item, more comprehensive examination will be further made. Additionally this study will also provide suggestions to examine future research opportunities of these diaries for different fields such as linguistics, educational studies, historical studies or humanities considering diverse formats and contents of diaries. Finally this study will also discuss necessary tasks and challenges for "the 12th May Diaries Collection" to be continuously collected and preserved as Everyday Life Archives.