• Title/Summary/Keyword: 데이터 정제

Search Result 469, Processing Time 0.034 seconds

The Framework of Research Network and Performance Evaluation on Personal Information Security: Social Network Analysis Perspective (개인정보보호 분야의 연구자 네트워크와 성과 평가 프레임워크: 소셜 네트워크 분석을 중심으로)

  • Kim, Minsu;Choi, Jaewon;Kim, Hyun Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.177-193
    • /
    • 2014
  • Over the past decade, there has been a rapid diffusion of electronic commerce and a rising number of interconnected networks, resulting in an escalation of security threats and privacy concerns. Electronic commerce has a built-in trade-off between the necessity of providing at least some personal information to consummate an online transaction, and the risk of negative consequences from providing such information. More recently, the frequent disclosure of private information has raised concerns about privacy and its impacts. This has motivated researchers in various fields to explore information privacy issues to address these concerns. Accordingly, the necessity for information privacy policies and technologies for collecting and storing data, and information privacy research in various fields such as medicine, computer science, business, and statistics has increased. The occurrence of various information security accidents have made finding experts in the information security field an important issue. Objective measures for finding such experts are required, as it is currently rather subjective. Based on social network analysis, this paper focused on a framework to evaluate the process of finding experts in the information security field. We collected data from the National Discovery for Science Leaders (NDSL) database, initially collecting about 2000 papers covering the period between 2005 and 2013. Outliers and the data of irrelevant papers were dropped, leaving 784 papers to test the suggested hypotheses. The co-authorship network data for co-author relationship, publisher, affiliation, and so on were analyzed using social network measures including centrality and structural hole. The results of our model estimation are as follows. With the exception of Hypothesis 3, which deals with the relationship between eigenvector centrality and performance, all of our hypotheses were supported. In line with our hypothesis, degree centrality (H1) was supported with its positive influence on the researchers' publishing performance (p<0.001). This finding indicates that as the degree of cooperation increased, the more the publishing performance of researchers increased. In addition, closeness centrality (H2) was also positively associated with researchers' publishing performance (p<0.001), suggesting that, as the efficiency of information acquisition increased, the more the researchers' publishing performance increased. This paper identified the difference in publishing performance among researchers. The analysis can be used to identify core experts and evaluate their performance in the information privacy research field. The co-authorship network for information privacy can aid in understanding the deep relationships among researchers. In addition, extracting characteristics of publishers and affiliations, this paper suggested an understanding of the social network measures and their potential for finding experts in the information privacy field. Social concerns about securing the objectivity of experts have increased, because experts in the information privacy field frequently participate in political consultation, and business education support and evaluation. In terms of practical implications, this research suggests an objective framework for experts in the information privacy field, and is useful for people who are in charge of managing research human resources. This study has some limitations, providing opportunities and suggestions for future research. Presenting the difference in information diffusion according to media and proximity presents difficulties for the generalization of the theory due to the small sample size. Therefore, further studies could consider an increased sample size and media diversity, the difference in information diffusion according to the media type, and information proximity could be explored in more detail. Moreover, previous network research has commonly observed a causal relationship between the independent and dependent variable (Kadushin, 2012). In this study, degree centrality as an independent variable might have causal relationship with performance as a dependent variable. However, in the case of network analysis research, network indices could be computed after the network relationship is created. An annual analysis could help mitigate this limitation.

Accuracy of 5-axis precision milling for guided surgical template (가이드 수술용 템플릿을 위한 5축 정밀가공공정의 정확성에 관한 연구)

  • Park, Ji-Man;Yi, Tae-Kyoung;Jung, Je-Kyo;Kim, Yong;Park, Eun-Jin;Han, Chong-Hyun;Koak, Jai-Young;Kim, Seong-Kyun;Heo, Seong-Joo
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.48 no.4
    • /
    • pp.294-300
    • /
    • 2010
  • Purpose: The template-guided implant surgery offers several advantages over the traditional approach. The purpose of this study was to evaluate the accuracy of coordinate synchronization procedure with 5-axis milling machine for surgical template fabrication by means of reverse engineering through universal CAD software. Materials and methods: The study was performed on ten edentulous models with imbedded gutta percha stoppings which were hidden under silicon gingival form. The platform for synchordination was formed on the bottom side of models and these casts were imaged in Cone beam CT. Vectors of stoppings were extracted and transferred to those of planned implant on virtual planning software. Depth of milling process was set to the level of one half of stoppings and the coordinate of the data was synchronized to the model image. Synchronization of milling coordinate was done by the conversion process for the platform for the synchordination located on the bottom of the model. The models were fixed on the synchordination plate of 5-axis milling machine and drilling was done as the planned vector and depth based on the synchronized data with twist drill of the same diameter as GP stopping. For the 3D rendering and image merging, the impression tray was set on the conbeam CT and pre- and post- CT acquiring was done with the model fixed on the impression body. The accuracy analysis was done with Solidworks (Dassault systems, Concord, USA) by measuring vector of stopping’s top and bottom centers of experimental model through merging and reverse engineering the planned and post-drilling CT image. Correlations among the parameters were tested by means of Pearson correlation coefficient and calculated with SPSS (release 14.0, SPSS Inc. Chicago, USA) ($\alpha$ = 0.05). Results: Due to the declination, GP remnant on upper half of stoppings was observed for every drilled bores. The deviation between planned image and drilled bore that was reverse engineered was 0.31 (0.15 - 0.42) mm at the entrance, 0.36 (0.24 - 0.51) mm at the apex, and angular deviation was 1.62 (0.54 - 2.27)$^{\circ}$. There was positive correlation between the deviation at the entrance and that at the apex (Pearson Correlation Coefficient = 0.904, P = .013). Conclusion: The coordinate synchronization 5-axis milling procedure has adequate accuracy for the production of the guided surgical template.

Development of Biologically Active Compounds from Edible Plant Sources XXII. Triterpenoids from the Aerial Parts of Sajabalssuk (Artemisia princeps PAMPANINI) (식용식물자원으로부터 활성물질의 탐색-XXII. 사자발쑥(Artemisia princeps PAMPANINI)의 지상부로부터 Triterpenoid의 분리)

  • Bang, Myun-Ho;Cho, Jin-Gyeong;Song, Myoung-Chong;Lee, Dae-Young;Han, Min-Woo;Chung, Hae-Gon;Jeong, Tae-Sook;Lee, Kyung-Tae;Choi, Myung-Sook;Baek, Nam-In
    • Applied Biological Chemistry
    • /
    • v.51 no.3
    • /
    • pp.223-227
    • /
    • 2008
  • The aerial parts of Sajabalssuk (Artemisia princeps PAMPANINI, Sajabalssuk) was extracted with 80% aqueous MeOH, and the concentrated extract was partitioned with EtOAc, n-BuOH and $H_2O$, successively. From the EtOAc fraction, three cycloartane-type triterpnoids and one ursane-type triterpenoid were isolated through the repeated silica gel, ODS and Sephadex LH-20 column chromatographies. From the results of physico-chemical data including NMR, MS and IR, the chemical structures of the triterpenoids were determined as wrightial (1), wrightial acetate (2), 27-norcycloart-20(21)-ene-25-al-3${\beta}$-ol acetate (3) and ursolic acid (4). No report has been found for isolation of compound 3 in the literature so far, and compounds 1, 2 and 3 were the first to be isolated from Sajabalssuk (Artemisia princeps PAMPANINI, Sajabalssuk). Also, compound 1 showed Acyl-CoA:Cholesterol acyltransferase (hACAT-1) and hACAT-2 inhibitory activity with the $IC_{50}$ values of 33.0 and 45.0 ${\mu}g/ml$, respectively. Compounds 2 and 3 inhibited hACAT-1 activity with the $IC_{50}$ values of 12.0 and 16.0 ${\mu}g/ml$, respectively.

Impact of Sulfur Dioxide Impurity on Process Design of $CO_2$ Offshore Geological Storage: Evaluation of Physical Property Models and Optimization of Binary Parameter (이산화황 불순물이 이산화탄소 해양 지중저장 공정설계에 미치는 영향 평가: 상태량 모델의 비교 분석 및 이성분 매개변수 최적화)

  • Huh, Cheol;Kang, Seong-Gil;Cho, Mang-Ik
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.13 no.3
    • /
    • pp.187-197
    • /
    • 2010
  • Carbon dioxide Capture and Storage(CCS) is regarded as one of the most promising options to response climate change. CCS is a three-stage process consisting of the capture of carbon dioxide($CO_2$), the transport of $CO_2$ to a storage location, and the long term isolation of $CO_2$ from the atmosphere for the purpose of carbon emission mitigation. Up to now, process design for this $CO_2$ marine geological storage has been carried out mainly on pure $CO_2$. Unfortunately the $CO_2$ mixture captured from the power plants and steel making plants contains many impurities such as $N_2$, $O_2$, Ar, $H_2O$, $SO_2$, $H_2S$. A small amount of impurities can change the thermodynamic properties and then significantly affect the compression, purification, transport and injection processes. In order to design a reliable $CO_2$ marine geological storage system, it is necessary to analyze the impact of these impurities on the whole CCS process at initial design stage. The purpose of the present paper is to compare and analyse the relevant physical property models including BWRS, PR, PRBM, RKS and SRK equations of state, and NRTL-RK model which are crucial numerical process simulation tools. To evaluate the predictive accuracy of the equation of the state for $CO_2-SO_2$ mixture, we compared numerical calculation results with reference experimental data. In addition, optimum binary parameter to consider the interaction of $CO_2$ and $SO_2$ molecules was suggested based on the mean absolute percent error. In conclusion, we suggest the most reliable physical property model with optimized binary parameter in designing the $CO_2-SO_2$ mixture marine geological storage process.

$CO_2$ Transport for CCS Application in Republic of Korea (이산화탄소 포집 및 저장 실용화를 위한 대한민국에서의 이산화탄소 수송)

  • Huh, Cheol;Kang, Seong-Gil;Cho, Mang-Ik
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.13 no.1
    • /
    • pp.18-29
    • /
    • 2010
  • Offshore subsurface storage of $CO_2$ is regarded as one of the most promising options to response severe climate change. Marine geological storage of $CO_2$ is to capture $CO_2$ from major point sources, to transport to the storage sites and to store $CO_2$ into the offshore subsurface geological structure such as the depleted gas reservoir and deep sea saline aquifer. Since 2005, we have developed relevant technologies for marine geological storage of $CO_2$. Those technologies include possible storage site surveys and basic designs for $CO_2$ transport and storage processes. To design a reliable $CO_2$ marine geological storage system, we devised a hypothetical scenario and used a numerical simulation tool to study its detailed processes. The process of transport $CO_2$ from the onshore capture sites to the offshore storage sites can be simulated with a thermodynamic equation of state. Before going to main calculation of process design, we compared and analyzed the relevant equation of states. To evaluate the predictive accuracies of the examined equation of states, we compare the results of numerical calculations with experimental reference data. Up to now, process design for this $CO_2$ marine geological storage has been carried out mainly on pure $CO_2$. Unfortunately the captured $CO_2$ mixture contains many impurities such as $N_2$, $O_2$, Ar, $H_{2}O$, $SO_{\chi}$, $H_{2}S$. A small amount of impurities can change the thermodynamic properties and then significantly affect the compression, purification and transport processes. This paper analyzes the major design parameters that are useful for constructing onshore and offshore $CO_2$ transport systems. On the basis of a parametric study of the hypothetical scenario, we suggest relevant variation ranges for the design parameters, particularly the flow rate, diameter, temperature, and pressure.

Ginsenosides from the fruits of Panax ginseng and their cytotoxic effects on human cancer cell lines (인삼(Panax ginseng) 열매로부터 분리한 ginsenoside의 동정 및 암세포독성 효과)

  • Gwag, Jung Eun;Lee, Yeong-Geun;Hwang-Bo, Jeon;Kim, Hyoung-Geun;Oh, Seon Min;Lee, Dae Young;Baek, Nam-In
    • Journal of Applied Biological Chemistry
    • /
    • v.61 no.4
    • /
    • pp.371-377
    • /
    • 2018
  • The fruits of Panax ginseng were extracted with 80% aqueous MeOH and the concentrates were partitioned into EtOAc, n-BuOH, and $H_2O$ fractions. The repeated $SiO_2$ and octadecyl $SiO_2$ column chromatographies for the EtOAc fraction led to isolation of five ginsenosides. The chemical structures of these compounds were determined as ginsenoside F1 (1), ginsenoside F2 (2), ginsenoside F3 (3), ginsenoside Ia (4), notoginsenoside Fe (5) based on spectroscopic analyses including nuclear magnetic resonance, MS, and infrared. Compounds 2-5 were isolated for the first time from the fruits of P. ginseng in this study. All isolated compounds were evaluated for cytotoxic activities against human cancer cell lines such as HCT-116, SK-OV-3, human cervix adenocarcinoma (HeLa), HepG2, and SK-MEL-5. Among them compounds 2, 4, and 5 showed significant cytotoxicity on cancer cells. Compound 2 exhibited cytotoxicity on SK-MEL-5, HepG2, and HeLa cells with $IC_{50}$ values of 82.8, 86.8, and $78.3{\mu}M$, respectively. Compound 4 showed cytotoxicity on HCT-116, SK-MEL-5, SK-OV-3, HepG2, and HeLa cells with $IC_{50}$ values of 24.5, 25.4, 26.3, 22.0, and $24.9{\mu}M$, respectively. Compound 5 did on SK-MEL-5 cell with $IC_{50}$ value of $81.7{\mu}M$. The cytotoxicity of ginsenoside 2, 4, and 5 isolated from the fruits of Panax ginseng showed strong inhibition effect against on cancer cells, all of which have a glucopyranosyl moiety on C-3.

Aesthetic Experience of Streetscape in Syarosu-gil as Urban Commercial Alleyway (도심 골목상권으로서 샤로수길 가로 경관의 미적 경험)

  • Lim, Hansol;Pae, Jeong-Hann
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.5
    • /
    • pp.125-137
    • /
    • 2021
  • How can we explain the phenomenon of small, old alleyways in the city becoming rising commercial places attracting people from an aesthetic perspective? This research discusses distinctive aesthetic experiences of urban commercial alleyways, which are located on inner roads and consist of small-scale stores and explore the specific aspects of Sharosu-gil, located in Gwanak-gu, Seoul. The aesthetic experience of urban commercial alleyways is generated by the contrast with the refined urban fabric along main roads in terms of space, the gap between the old and the new, and the antagonism between the known and the less known. The approach to Sharosu-gil consists of the high-rise buildings along the main road built in the 2000s, then encountering low-rise buildings on inside roads built from the late 1970s to the present. Therefore, it is judged that the site has sufficient conditions to generate the aesthetic experience as an urban commercial alleyway. As a result of analyzing the street improvement projects, first, the official announcement of the name 'Sharosu-gil' was interpreted as an escape from the place specificity and garnered the acquisition of the characteristics of an alternative. Secondly, the improvement project for old-established signboards was interpreted as harmony between the new and the old and the loss of temporality. Thirdly, in the pedestrian priority road project, the pavement was interpreted as a reinforcement of the identity as an alleyway and the visualization of the area. Since the reality of urban commercial alleyways depends on the user's visiting, it is necessary to interpret alleyways from the perspective of the senses and aesthetics, not just from social phenomena or capital logic perspective. The study will cast implications for relevant schemes and data-driven research.

Exploring the Trend of Korean Creative Dance by Analyzing Research Topics : Application of Text Mining (연구주제 분석을 통한 한국창작무용 경향 탐색 : 텍스트 마이닝의 적용)

  • Yoo, Ji-Young;Kim, Woo-Kyung
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.6
    • /
    • pp.53-60
    • /
    • 2020
  • The study is based on the assumption that the trend of phenomena and trends in research are contextually consistent. Therefore the purpose of this study is to explore the trend of dance through the subject analysis of the Korean creative dance study by utilizing text mining. Thus, 1,291 words were analyzed in the 616 journal title, which were established on the paper search website. The collection, refining and analysis of the data were all R 3.6.0 SW. According to the study, keywords representing the times were frequently used before the 2000s, but Korean creative dance research types were also found in terms of education and physical training. Second, the frequency of keywords related to the dance troupe's performance was high after the 2000s, but it was confirmed that Choi Seung-hee was still in an important position in the study of Korean creative dance. Third, an analysis of the overall research subjects of the Korean creative dance study showed that the research on 'Art of Choi Seung-hee in the modern era' was the highest proportion. Fourth, the Hot Topics, which are rising as of 2000, appeared as 'the performance activities of the National Dance Company' and 'the choreography expression and utilization of traditional dance'. However, since the recent trend of the National Dance Company's performance is advocating 'modernization based on tradition', it has been confirmed that the trend of Korean creative dance since the 2000s has been focused on the use of traditional dance motifs. Fifth, the Cold Topic, which has been falling as of 2000, has been shown to be a study of 'dancing expressions by age'. It was judged that interest in research also decreased due to the tendency to mix various dance styles after the establishment of the genre of Korean creative dance.

Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

  • Choi, Hyunseung;Kim, Mintae;Kim, Wooju;Shin, Dongwook;Lee, Yong Hun
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.111-136
    • /
    • 2018
  • In this paper, we propose a methodology to extract answer information about queries from various types of unstructured documents collected from multi-sources existing on web in order to expand knowledge base. The proposed methodology is divided into the following steps. 1) Collect relevant documents from Wikipedia, Naver encyclopedia, and Naver news sources for "subject-predicate" separated queries and classify the proper documents. 2) Determine whether the sentence is suitable for extracting information and derive the confidence. 3) Based on the predicate feature, extract the information in the proper sentence and derive the overall confidence of the information extraction result. In order to evaluate the performance of the information extraction system, we selected 400 queries from the artificial intelligence speaker of SK-Telecom. Compared with the baseline model, it is confirmed that it shows higher performance index than the existing model. The contribution of this study is that we develop a sequence tagging model based on bi-directional LSTM-CRF using the predicate feature of the query, with this we developed a robust model that can maintain high recall performance even in various types of unstructured documents collected from multiple sources. The problem of information extraction for knowledge base extension should take into account heterogeneous characteristics of source-specific document types. The proposed methodology proved to extract information effectively from various types of unstructured documents compared to the baseline model. There is a limitation in previous research that the performance is poor when extracting information about the document type that is different from the training data. In addition, this study can prevent unnecessary information extraction attempts from the documents that do not include the answer information through the process for predicting the suitability of information extraction of documents and sentences before the information extraction step. It is meaningful that we provided a method that precision performance can be maintained even in actual web environment. The information extraction problem for the knowledge base expansion has the characteristic that it can not guarantee whether the document includes the correct answer because it is aimed at the unstructured document existing in the real web. When the question answering is performed on a real web, previous machine reading comprehension studies has a limitation that it shows a low level of precision because it frequently attempts to extract an answer even in a document in which there is no correct answer. The policy that predicts the suitability of document and sentence information extraction is meaningful in that it contributes to maintaining the performance of information extraction even in real web environment. The limitations of this study and future research directions are as follows. First, it is a problem related to data preprocessing. In this study, the unit of knowledge extraction is classified through the morphological analysis based on the open source Konlpy python package, and the information extraction result can be improperly performed because morphological analysis is not performed properly. To enhance the performance of information extraction results, it is necessary to develop an advanced morpheme analyzer. Second, it is a problem of entity ambiguity. The information extraction system of this study can not distinguish the same name that has different intention. If several people with the same name appear in the news, the system may not extract information about the intended query. In future research, it is necessary to take measures to identify the person with the same name. Third, it is a problem of evaluation query data. In this study, we selected 400 of user queries collected from SK Telecom 's interactive artificial intelligent speaker to evaluate the performance of the information extraction system. n this study, we developed evaluation data set using 800 documents (400 questions * 7 articles per question (1 Wikipedia, 3 Naver encyclopedia, 3 Naver news) by judging whether a correct answer is included or not. To ensure the external validity of the study, it is desirable to use more queries to determine the performance of the system. This is a costly activity that must be done manually. Future research needs to evaluate the system for more queries. It is also necessary to develop a Korean benchmark data set of information extraction system for queries from multi-source web documents to build an environment that can evaluate the results more objectively.