• Title/Summary/Keyword: 이용분야별

Search Result 16,007, Processing Time 0.052 seconds

Current Status and Perspectives in Varietal Improvement of Rice Cultivars for High-Quality and Value-Added Products (쌀 품질 고급화 및 고부가가치화를 위한 육종현황과 전망)

  • 최해춘
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.47
    • /
    • pp.15-32
    • /
    • 2002
  • The endeavors enhancing the grain quality of high-yielding japonica rice were steadily continued during 1980s-1990s along with the self-sufficiency of rice production and the increasing demands of high-quality rices. During this time, considerably great progress and success was obtained in development of high-quality japonica cultivars and quality evaluation techniques including the elucidation of interrelationship between the physicochemical properties of rice grain and the physical or palatability components of cooked rice. In 1990s, some high-quality japonica rice cultivars and special rices adaptable for food processing such as large kernel, chalky endosperm, aromatic and colored rices were developed and its objective preference and utility was also examined by a palatability meter, rapid-visco analyzer and texture analyzer, Recently, new special rices such as extremely low-amylose dull or opaque non-glutinous endosperm mutants were developed. Also, a high-lysine rice variety was developed for higher nutritional utility. The water uptake rate and the maximum water absorption ratio showed significantly negative correlations with the K/Mg ratio and alkali digestion value(ADV) of milled rice. The rice materials showing the higher amount of hot water absorption exhibited the larger volume expansion of cooked rice. The harder rices with lower moisture content revealed the higher rate of water uptake at twenty minutes after soaking and the higher ratio of maximum water uptake under the room temperature condition. These water uptake characteristics were not associated with the protein and amylose contents of milled rice and the palatability of cooked rice. The water/rice ratio (in w/w basis) for optimum cooking was averaged to 1.52 in dry milled rices (12% wet basis) with varietal range from 1.45 to 1.61 and the expansion ratio of milled rice after proper boiling was average to 2.63(in v/v basis). The major physicochemical components of rice grain associated with the palatability of cooked rice were examined using japonica rice materials showing narrow varietal variation in grain size and shape, alkali digestibility, gel consistency, amylose and protein contents, but considerable difference in appearance and texture of cooked rice. The glossiness or gross palatability score of cooked rice were closely associated with the peak, hot paste and consistency viscosities of viscosities with year difference. The high-quality rice variety "IIpumbyeo" showed less portion of amylose on the outer layer of milled rice grain and less and slower change in iodine blue value of extracted paste during twenty minutes of boiling. This highly palatable rice also exhibited very fine net structure in outer layer and fine-spongy and well-swollen shape of gelatinized starch granules in inner layer and core of cooked rice kernel compared with the poor palatable rice through image of scanning electronic microscope. Gross sensory score of cooked rice could be estimated by multiple linear regression formula, deduced from relationship between rice quality components mentioned above and eating quality of cooked rice, with high probability of determination. The $\alpha$-amylose-iodine method was adopted for checking the varietal difference in retrogradation of cooked rice. The rice cultivars revealing the relatively slow retrogradation in aged cooked rice were IIpumbyeo, Chucheongyeo, Sasanishiki, Jinbubyeo and Koshihikari. A Tonsil-type rice, Taebaegbyeo, and a japonica cultivar, Seomjinbyeo, showed the relatively fast deterioration of cooked rice. Generally, the better rice cultivars in eating quality of cooked rice showed less retrogradation and much sponginess in cooled cooked rice. Also, the rice varieties exhibiting less retrogradation in cooled cooked rice revealed higher hot viscosity and lower cool viscosity of rice flour in amylogram. The sponginess of cooled cooked rice was closely associated with magnesium content and volume expansion of cooked rice. The hardness-changed ratio of cooked rice by cooling was negatively correlated with solids amount extracted during boiling and volume expansion of cooked rice. The major physicochemical properties of rice grain closely related to the palatability of cooked rice may be directly or indirectly associated with the retrogradation characteristics of cooked rice. The softer gel consistency and lower amylose content in milled rice revealed the higher ratio of popped rice and larger bulk density of popping. The stronger hardness of rice grain showed relatively higher ratio of popping and the more chalky or less translucent rice exhibited the lower ratio of intact popped brown rice. The potassium and magnesium contents of milled rice were negatively associated with gross score of noodle making mixed with wheat flour in half and the better rice for noodle making revealed relatively less amount of solid extraction during boiling. The more volume expansion of batters for making brown rice bread resulted the better loaf formation and more springiness in rice breed. The higher protein rices produced relatively the more moist white rice bread. The springiness of rice bread was also significantly correlated with high amylose content and hard gel consistency. The completely chalky and large grain rices showed better suitability far fermentation and brewing. The glutinous rice were classified into nine different varietal groups based on various physicochemical and structural characteristics of endosperm. There was some close associations among these grain properties and large varietal difference in suitability to various traditional food processing. Our breeding efforts on improvement of rice quality for high palatability and processing utility or value-adding products in the future should focus on not only continuous enhancement of marketing and eating qualities but also the diversification in morphological, physicochemical and nutritional characteristics of rice grain suitable for processing various value-added rice foods.ice foods.

Ultrasonographic study on the masseter muscle thickness of adult Korean (한국인 성인의 교근 두께에 관한 초음파검사적 연구)

  • Cha, Bong-Kuen;Park, In-Woo;Lee, Yeun-Hee
    • The korean journal of orthodontics
    • /
    • v.31 no.2 s.85
    • /
    • pp.225-236
    • /
    • 2001
  • It is widely accepted that the shape and structure of bone are closely related to the activity of attached muscle. Numerous clinical and animal experimental studies indicated the significant effects of masticatory muscle function on maxillofacial morphology. Recently, the development of ultrasonography has spread throughout different fields of medicine. In the clinical examinations, ultrasonography is a convenient, inexpensive technique to apply with accurate and reliable results. The aim of this study is to assess the thickness of the masseter muscle and its correlation to maxillofacial skeleton by examining 35 male and 15 female dental students at Kangnung National University. The masseter muscle thickness of the subjects were measured by ultrasonographic scanning with a 7.5MHz linear probe, and their maxillofacial morphology were investigated by lateral cephalometric radiographs. The relationship between the masseter muscle thickness and maxillofacial morphology of normal adult was statistically analyzed, and the following results were obtained. 1. The average thickness of male masseter muscle was 13.8${\pm}$1.71mm in the relaxed state and 14.8${\pm}$1.77mm at maximal clenching state, while that of female was 11.6${\pm}$1.58mm and 12.4${\pm}$1.47mm, respectively. Ethnic difference in thickness of the masseter muscle and maxillofacial skeleton was found when the results of many researchers were compared with those of this study. 2. The thickness of the masseter muscle in both sexes increased significantly at maximal clenching state than in relaxed state(P<0.05). 3. The masseter muscle thickness of male was greater than that of female both in the relaxed state and maximal clenching states(P<0.05). 4. In males, the thickness of the masseter muscle was negatively correlated with the mandibular plane angle and positively correlated with the mandibular ramus height and anterior cranial base length(P<0.05). It may suggest that the male with thicker masseter muscle has smaller facial divergence. 5. No significant correlation was found between the masseter muscle thickness and maxillofacial morphology in females(P<0.05). Therefore, these data suggest that ultrasonography can add valuable information to the conventional examinations of masseter muscle function.

  • PDF

A Comparative Study of Subset Construction Methods in OSEM Algorithms using Simulated Projection Data of Compton Camera (모사된 컴프턴 카메라 투사데이터의 재구성을 위한 OSEM 알고리즘의 부분집합 구성법 비교 연구)

  • Kim, Soo-Mee;Lee, Jae-Sung;Lee, Mi-No;Lee, Ju-Hahn;Kim, Joong-Hyun;Kim, Chan-Hyeong;Lee, Chun-Sik;Lee, Dong-Soo;Lee, Soo-Jin
    • Nuclear Medicine and Molecular Imaging
    • /
    • v.41 no.3
    • /
    • pp.234-240
    • /
    • 2007
  • Purpose: In this study we propose a block-iterative method for reconstructing Compton scattered data. This study shows that the well-known expectation maximization (EM) approach along with its accelerated version based on the ordered subsets principle can be applied to the problem of image reconstruction for Compton camera. This study also compares several methods of constructing subsets for optimal performance of our algorithms. Materials and Methods: Three reconstruction algorithms were implemented; simple backprojection (SBP), EM, and ordered subset EM (OSEM). For OSEM, the projection data were grouped into subsets in a predefined order. Three different schemes for choosing nonoverlapping subsets were considered; scatter angle-based subsets, detector position-based subsets, and both scatter angle- and detector position-based subsets. EM and OSEM with 16 subsets were performed with 64 and 4 iterations, respectively. The performance of each algorithm was evaluated in terms of computation time and normalized mean-squared error. Results: Both EM and OSEM clearly outperformed SBP in all aspects of accuracy. The OSEM with 16 subsets and 4 iterations, which is equivalent to the standard EM with 64 iterations, was approximately 14 times faster in computation time than the standard EM. In OSEM, all of the three schemes for choosing subsets yielded similar results in computation time as well as normalized mean-squared error. Conclusion: Our results show that the OSEM algorithm, which have proven useful in emission tomography, can also be applied to the problem of image reconstruction for Compton camera. With properly chosen subset construction methods and moderate numbers of subsets, our OSEM algorithm significantly improves the computational efficiency while keeping the original quality of the standard EM reconstruction. The OSEM algorithm with scatter angle- and detector position-based subsets is most available.

An Analysis of the Specialist's Preference for the Model of Park-Based Mixed-Use Districts in Securing Urban Parks and Green Spaces Via Private Development (민간개발 주도형 도시공원.녹지 확보를 위한 공원복합용도지구 모형에 대한 전문가 선호도 분석)

  • Lee, Jeung-Eun;Cho, Se-Hwan
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.39 no.6
    • /
    • pp.1-11
    • /
    • 2011
  • The research was aimed to verify the feasibility of the model of Park-Based Mixed-Use Districts(PBMUD) around urban large park to secure private-based urban parks through the revision of the urban zoning system. The PBMUD is a type of urban zoning district in which park-oriented land use is mixed with the urban land uses of residents, advertising, business, culture, education and research. The PBMUD, delineated from and based on a new paradigm of landscape urbanism, is a new urban strategy to secure urban parks and to cultivate urban regeneration around parks and green spaces to enhance the quality of the urban landscape and to ameliorate urban environmental disasters like climate change. This study performed a questionnaire survey and analysis after a review of literature related to PBMUD. The study looked for specialists in the fields of urban planning and landscape architecture such as officials, researchers and engineers to respond to the questionnaire, which asked about degree of preference. The conclusions of this study were as follows. Firstly, specialists prefer the PBMUD at 79.3% for to 20.7% against ratio, indicating the feasibility of the model of PBMUD. The second, the most preferable reasons for the model, were the possibility of securing park space around urban parks and green spaces that assures access to park and communication with each area. The third, the main reason for non-preference for the model, was a lack of understanding of PBMUD added to the problems of unprofitable laws and regulations related to urban planning and development. These proposed a revision of the related laws and regulations such as the laws for planning and use of national land, laws for architecture etc. The fourth, the most preferred type of PBMUD, was cultural use mixed with park use in every kind of mix of land use. The degree of preference was lower in the order of use of commercial, residential, business, and education(research) when mixed with park use. The number of mixed-use amenities with in the park was found to be an indicator determining preference. The greater the number, the lower was preference frequencies, especially when related to research and business use. The fifth, the preference frequencies of the more than 70% among the respondents to the mixed-use ratio between park use and the others, was in a ratio of 60% park use and 40% other urban use. These research results will help to launch new future research subjects on the revision of zoning regulations in the laws for the planning and uses of national land and architectural law as well as criteria and indicators of subdivision planning as related to a PBMUD model.

Shielding for Critical Organs and Radiation Exposure Dose Distribution in Patients with High Energy Radiotherapy (고 에너지 방사선치료에서 환자의 피폭선량 분포와 생식선의 차폐)

  • Chu, Sung-Sil;Suh, Chang-Ok;Kim, Gwi-Eon
    • Journal of Radiation Protection and Research
    • /
    • v.27 no.1
    • /
    • pp.1-10
    • /
    • 2002
  • High energy photon beams from medical linear accelerators produce large scattered radiation by various components of the treatment head, collimator and walls or objects in the treatment room including the patient. These scattered radiation do not provide therapeutic dose and are considered a hazard from the radiation safety perspective. Scattered dose of therapeutic high energy radiation beams are contributed significant unwanted dose to the patient. ICRP take the position that a dose of 500mGy may cause abortion at any stage of pregnancy and that radiation detriment to the fetus includes risk of mental retardation with a possible threshold in the dose response relationship around 100 mGy for the gestational period. The ICRP principle of as low as reasonably achievable (ALARA) was recommended for protection of occupation upon the linear no-threshold dose response hypothesis for cancer induction. We suggest this ALARA principle be applied to the fetus and testicle in therapeutic treatment. Radiation dose outside a photon treatment filed is mostly due to scattered photons. This scattered dose is a function of the distance from the beam edge, treatment geometry, primary photon energy, and depth in the patient. The need for effective shielding of the fetus and testicle is reinforced when young patients ate treated with external beam radiation therapy and then shielding designed to reduce the scattered photon dose to normal organs have to considered. Irradiation was performed in phantom using high energy photon beams produced by a Varian 2100C/D medical linear accelerator (Varian Oncology Systems, Palo Alto, CA) located at the Yonsei Cancer Center. The composite phantom used was comprised of a commercially available anthropomorphic Rando phantom (Phantom Laboratory Inc., Salem, YN) and a rectangular solid polystyrene phantom of dimensions $30cm{\times}30cm{\times}20cm$. the anthropomorphic Rando phantom represents an average man made from tissue equivalent materials that is transected into transverse 36 slices of 2.5cm thickness. Photon dose was measured using a Capintec PR-06C ionization chamber with Capintec 192 electrometer (Capintec Inc., Ramsey, NJ), TLD( VICTOREEN 5000. LiF) and film dosimetry V-Omat, Kodak). In case of fetus, the dosimeter was placed at a depth of loom in this phantom at 100cm source to axis distance and located centrally 15cm from the inferior edge of the $30cm{\times}30cm^2$ x-ray beam irradiating the Rando phantom chest wall. A acryl bridge of size $40cm{\times}40cm^2$ and a clear space of about 20 cm was fabricated and placed on top of the rectangular polystyrene phantom representing the abdomen of the patient. The leaf pot for testicle shielding was made as various shape, sizes, thickness and supporting stand. The scattered photon with and without shielding were measured at the representative position of the fetus and testicle. Measurement of radiation scattered dose outside fields and critical organs, like fetus position and testicle region, from chest or pelvic irradiation by large fie]d of high energy radiation beam was performed using an ionization chamber and film dosimetry. The scattered doses outside field were measured 5 - 10% of maximum doses in fields and exponentially decrease from field margins. The scattered photon dose received the fetus and testicle from thorax field irradiation was measured about 1 mGy/Gy of photon treatment dose. Shielding construction to reduce this scattered dose was investigated using lead sheet and blocks. Lead pot shield for testicle reduced the scatter dose under 10 mGy when photon beam of 60 Gy was irradiated in abdomen region. The scattered photon dose is reduced when the lead shield was used while the no significant reduction of scattered photon dose was observed and 2-3 mm lead sheets refuted the skin dose under 80% and almost electron contamination. The results indicate that it was possible to improve shielding to reduce scattered photon for fetus and testicle when a young patients were treated with a high energy photon beam.

The Validation Study of the Questionnaire for Sasang Constitution Classification (the 2nd edition revised in 1995) - In the field of profile analysis (사상체질분류검사지(四象體質分類檢査紙)(QSCC)II에 대(對)한 타당화(妥當化) 연구(硏究) -각(各) 체질집단(體質集團)의 군집별(群集別) Profile 분석(分析)을 중심(中心)으로-)

  • Lee, Jung-Chan;Go, Byeong-Hui;Song, Il-Byeong
    • Journal of Sasang Constitutional Medicine
    • /
    • v.8 no.1
    • /
    • pp.247-294
    • /
    • 1996
  • By means of the statistical data which has been collected with newly revised QSCC made use of the outpatient group examined at Kyung-Hee Medical Center and an open ordinary person group, the author proceeded statistical analysis for the validation study of the revised questionnaire itself. First, check the accurate discrimination rate by performing discriminant analysis on the statistical data of the patient group. And next, sought T-score by applying the norms gained in process of standadization of the open ordinary person group to the Sasang scale score of the outpatient group and investigated the distinctive feature between the subpopulations which was devided in the process of multivarite cluster analysis. The result was summarized as follows ; 1. The validity of the questionnaire was established through the fact that the accurate discrimination rate the ratio between predicted group and actual group was figured out 70.08%. 2. At the profile analysis the response to the relevant scale showed notable upward tendency in each constitutional group and therefore it seems to be pertinent in the field of constitutional discrimination. 3. In the observation of the power of expression through the profile analysis of each constitutional group the Soyang group demonstrated the most remarkable outcome, the Soeum group was the most inferior and the Taieum group revealed a sort of dual property. 4. What is called the group of seceder out of three subpopulation of each constitutional group distinguished definitely from the contrasted groups at the point of the distinctive profile feature and the content is like following description. (1) The seceder group of Soyang-in showed considerably passive disposition differently from general character of ordinary Soyang group and an appearance attracting the attention is that they demonstrated comparatively higher response at Soeum scale (2) The seceder group of Taieum-in gained low scores in general that informed the passive disposition of the group and the other way of the general property of Taieum group which showed accompanied ascension in Taiyang-Taieum scales they demonstrated sharply declined score at Taiyang scale (3) The seceder group of Soeum-in demonstrated distinctive property similar to the profile feature of Soyang group and it notifies that the passive property of Soeum group was diluted for the most part. According to the above result, the validity of newly revised questionnaire has been proven successfully and the property of seceder groups could be noticed to some degree through the profile analysis on the course of this study. The result of this study is expected to use as a research materials to produce next edition of the questionnaire and it is regarded that further inquisition about the difference between the seceder group and the contrasted group is required for the promotion of the questionnaire as it refered several times in the contents of the main discourse.

  • PDF

Study on 3D Printer Suitable for Character Merchandise Production Training (캐릭터 상품 제작 교육에 적합한 3D프린터 연구)

  • Kwon, Dong-Hyun
    • Cartoon and Animation Studies
    • /
    • s.41
    • /
    • pp.455-486
    • /
    • 2015
  • The 3D printing technology, which started from the patent registration in 1986, was a technology that did not attract attention other than from some companies, due to the lack of awareness at the time. However, today, as expiring patents are appearing after the passage of 20 years, the price of 3D printers have decreased to the level of allowing purchase by individuals and the technology is attracting attention from industries, in addition to the general public, such as by naturally accepting 3D and to share 3D data, based on the generalization of online information exchange and improvement of computer performance. The production capability of 3D printers, which is based on digital data enabling digital transmission and revision and supplementation or production manufacturing not requiring molding, may provide a groundbreaking change to the process of manufacturing, and may attain the same effect in the character merchandise sector. Using a 3D printer is becoming a necessity in various figure merchandise productions which are in the forefront of the kidult culture that is recently gaining attention, and when predicting the demand by the industrial sites related to such character merchandise and when considering the more inexpensive price due to the expiration of patents and sharing of technology, expanding opportunities and sectors of employment and cultivating manpower that are able to engage in further creative work seems as a must, by introducing education courses cultivating manpower that can utilize 3D printers at the education field. However, there are limits in the information that can be obtained when seeking to introduce 3D printers in school education. Because the press or information media only mentions general information, such as the growth of the industrial size or prosperous future value of 3D printers, the research level of the academic world also remains at the level of organizing contents in an introductory level, such as by analyzing data on industrial size, analyzing the applicable scope in the industry, or introducing the printing technology. Such lack of information gives rise to problems at the education site. There would be no choice but to incur temporal and opportunity expenses, since the technology would only be able to be used after going through trials and errors, by first introducing the technology without examining the actual information, such as through comparing the strengths and weaknesses. In particular, if an expensive equipment introduced does not suit the features of school education, the loss costs would be significant. This research targeted general users without a technology-related basis, instead of specialists. By comparing the strengths and weaknesses and analyzing the problems and matters requiring notice upon use, pursuant to the representative technologies, instead of merely introducing the 3D printer technology as had been done previously, this research sought to explain the types of features that a 3D printer should have, in particular, when required in education relating to the development of figure merchandise as an optional cultural contents at cartoon-related departments, and sought to provide information that can be of practical help when seeking to provide education using 3D printers in the future. In the main body, the technologies were explained by making a classification based on a new perspective, such as the buttress method, types of materials, two-dimensional printing method, and three-dimensional printing method. The reason for selecting such different classification method was to easily allow mutual comparison of the practical problems upon use. In conclusion, the most suitable 3D printer was selected as the printer in the FDM method, which is comparatively cheap and requires low repair and maintenance cost and low materials expenses, although rather insufficient in the quality of outputs, and a recommendation was made, in addition, to select an entity that is supportive in providing technical support.

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.

Design of Client-Server Model For Effective Processing and Utilization of Bigdata (빅데이터의 효과적인 처리 및 활용을 위한 클라이언트-서버 모델 설계)

  • Park, Dae Seo;Kim, Hwa Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.109-122
    • /
    • 2016
  • Recently, big data analysis has developed into a field of interest to individuals and non-experts as well as companies and professionals. Accordingly, it is utilized for marketing and social problem solving by analyzing the data currently opened or collected directly. In Korea, various companies and individuals are challenging big data analysis, but it is difficult from the initial stage of analysis due to limitation of big data disclosure and collection difficulties. Nowadays, the system improvement for big data activation and big data disclosure services are variously carried out in Korea and abroad, and services for opening public data such as domestic government 3.0 (data.go.kr) are mainly implemented. In addition to the efforts made by the government, services that share data held by corporations or individuals are running, but it is difficult to find useful data because of the lack of shared data. In addition, big data traffic problems can occur because it is necessary to download and examine the entire data in order to grasp the attributes and simple information about the shared data. Therefore, We need for a new system for big data processing and utilization. First, big data pre-analysis technology is needed as a way to solve big data sharing problem. Pre-analysis is a concept proposed in this paper in order to solve the problem of sharing big data, and it means to provide users with the results generated by pre-analyzing the data in advance. Through preliminary analysis, it is possible to improve the usability of big data by providing information that can grasp the properties and characteristics of big data when the data user searches for big data. In addition, by sharing the summary data or sample data generated through the pre-analysis, it is possible to solve the security problem that may occur when the original data is disclosed, thereby enabling the big data sharing between the data provider and the data user. Second, it is necessary to quickly generate appropriate preprocessing results according to the level of disclosure or network status of raw data and to provide the results to users through big data distribution processing using spark. Third, in order to solve the problem of big traffic, the system monitors the traffic of the network in real time. When preprocessing the data requested by the user, preprocessing to a size available in the current network and transmitting it to the user is required so that no big traffic occurs. In this paper, we present various data sizes according to the level of disclosure through pre - analysis. This method is expected to show a low traffic volume when compared with the conventional method of sharing only raw data in a large number of systems. In this paper, we describe how to solve problems that occur when big data is released and used, and to help facilitate sharing and analysis. The client-server model uses SPARK for fast analysis and processing of user requests. Server Agent and a Client Agent, each of which is deployed on the Server and Client side. The Server Agent is a necessary agent for the data provider and performs preliminary analysis of big data to generate Data Descriptor with information of Sample Data, Summary Data, and Raw Data. In addition, it performs fast and efficient big data preprocessing through big data distribution processing and continuously monitors network traffic. The Client Agent is an agent placed on the data user side. It can search the big data through the Data Descriptor which is the result of the pre-analysis and can quickly search the data. The desired data can be requested from the server to download the big data according to the level of disclosure. It separates the Server Agent and the client agent when the data provider publishes the data for data to be used by the user. In particular, we focus on the Big Data Sharing, Distributed Big Data Processing, Big Traffic problem, and construct the detailed module of the client - server model and present the design method of each module. The system designed on the basis of the proposed model, the user who acquires the data analyzes the data in the desired direction or preprocesses the new data. By analyzing the newly processed data through the server agent, the data user changes its role as the data provider. The data provider can also obtain useful statistical information from the Data Descriptor of the data it discloses and become a data user to perform new analysis using the sample data. In this way, raw data is processed and processed big data is utilized by the user, thereby forming a natural shared environment. The role of data provider and data user is not distinguished, and provides an ideal shared service that enables everyone to be a provider and a user. The client-server model solves the problem of sharing big data and provides a free sharing environment to securely big data disclosure and provides an ideal shared service to easily find big data.

Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)

  • Bae, Jung-Hwan;Son, Ji-Eun;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.141-156
    • /
    • 2013
  • Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.