• 제목/요약/키워드: Large-scale database

검색결과 298건 처리시간 0.028초

영역 질의의 효과적인 처리를 위한 궤적 인덱싱 (Trajectory Indexing for Efficient Processing of Range Queries)

  • 차창일;김상욱;원정임
    • 정보처리학회논문지D
    • /
    • 제16D권4호
    • /
    • pp.487-496
    • /
    • 2009
  • 본 연구에서는 대용량 궤적 데이터베이스에서 영역 질의를 효과적으로 처리하기 위한 인덱싱 기법에 대하여 논의한다. 먼저, 기존 인덱싱기법의 문제점을 지적하고, 이러한 문제점을 해결하는 새로운 기법을 제안한다. 제안된 기법에서는 우선 시간 차원을 다수의 시간 구간으로 분할하고, 인덱싱의 대상이 되는 전체 라인 세그먼트들을 시간 구간별로 구분한다. 각 시간 구간에 속하는 라인 세그먼트들에 대하여 별도의 인덱스를 구축한다. 또한, 디스크에서 관리되는 과거 시간 구간에 대한 인덱스들과는 달리 최근 시간 구간에 대한 인덱스는 메인 메모리상에 관리함으로써 삽입과 검색의 성능을 크게 개선할 수 있다. 각 시간 구간에 속하는 라인 세그먼트들은 다음과 같은 방식으로 인덱스를 구축한다. 먼저, 2D-트리를 이용하여 전체 공간 차원을 유사한 수의 라인 세그먼트들이 배정되도록 다수의 셀들로 분할한다. 또한, 분할된 각 셀마다 시공간 차원 (x, y, t)에 대한 별도의 3차원 $R^*$-트리를 두어 보다 상세한 인덱싱을 지원한다. 이와 같은 다양한 전략을 이용함으로써 기존 기법의 문제점들을 해결 할 수 있다. 다양한 실험을 통하여 제안된 기법의 우수성을 정량적으로 검증한다. 실험 결과에 의하면, 기존 기법에 비하여 작은 인덱스 구조를 갖으면서도 검색 성능면에서 3$\sim$10배까지의 성능 향상 효과를 갖는 것으로 나타났다.

퍼지논리연산을 이용한 토지피복환경 변화분석: 안면도 사례연구 (Change Detection of Land Cover Environment using Fuzzy Logic Operation : A Case Study of Anmyeon-do)

  • 장동호;지광훈;이현영
    • 대한원격탐사학회지
    • /
    • 제18권6호
    • /
    • pp.305-317
    • /
    • 2002
  • 본 연구에서는 안면도의 토지피복변화 분석을 위해 원격탐사 및 GIS 기법을 이용하여 지표경관의 변화를 탐지하였다. 변화지역 추출은 위성영상과 현장답사를 통하여 확인하였고, 지표경관변화와 관련된 GIS 기반의 다양한 공간정보를 구축하였다. 공간통합 방법으로 퍼지논리연산을 사용하였다. 분석결과 자연 및 인문·사회에 관한 주제도들 중 토지피복 변화에 가장 큰 영향을 미치는 주제도는 표고분석도, 인구밀도도, 국토이용계획도 등이다. 퍼지논리연산을 이용하여 토지피복 변화를 통합 분석한 결과 정확한 변화를 예측할 수 있었다. 즉, 안면도 지역에서 대규모 토지피복 변화가 일어날 가능성이 높은 지역들은 해안과 가까운 평지에 위치한 지역이 높은 확률로 변화하였다. 특히 경사도 5%이하, 표고 15m 이하의 구릉지로 해양과 인접해 있는 지역은 현재 진행 중인 대규모 개발에 따른 연안환경 악화의 위험성이 높으므로 이에 대한 대책강구가 시급하다. 결론적으로 본 방법은 향후 토지피복 변화 연구를 위한 효과적인 방법 중의 하나로 적용될 수 있을 것으로 기대된다.

Cohort profile: National Investigation of Birth Cohort in Korea study 2008 (NICKs-2008)

  • Kim, Ju Hee;Lee, Jung Eun;Shim, So Min;Ha, Eun Kyo;Yon, Dong Keon;Kim, Ok Hyang;Baek, Ji Hyeon;Koh, Hyun Yong;Chae, Kyu Young;Lee, Seung Won;Han, Man Yong
    • Clinical and Experimental Pediatrics
    • /
    • 제64권9호
    • /
    • pp.480-488
    • /
    • 2021
  • Background: An adequate large-scale pediatric cohort based on nationwide administrative data is lacking in Korea. Purpose: This study established the National Investigation of Birth Cohort in Korea study 2008 (NICKs-2008) based on data from a nationwide population-based health screening program and data on healthcare utilization for children. Methods: The NICKs-2008 study consisted of the Korean National Health Insurance System (NHIS) and the National Health Screening Program for Infants and Children (NHSPIC) databases comprising children born in 2008 (n=469,248) and 2009 (n=448,459) in the Republic of Korea. The NHIS database contains data on age, sex, residential area, income, healthcare utilization (International Classification of Diseases10 codes, procedure codes, and drug classification codes), and healthcare providers. The NHSPIC consists of 7 screening rounds. These screening sessions comprised physical examination, developmental screening (rounds 2-7), a general health questionnaire, and age-specific anticipatory guidance. Results: During the 10-year follow-up, 2,718 children (0.3%) died, including more boys than girls (hazard ratio, 1.145; P<0.001). A total of 848,048 children participated in at least 1 of the 7 rounds of the NHSPIC, while 96,046 participated in all 7 screening programs. A total of 823 infants (0.1%) weighed less than 1,000 g, 3,177 (0.4%) weighed 1,000-1,499 g, 37,166 (4.4%) weighed 1,500-2,499 g, 773,081 (91.4%) weighed 2,500-4,000 g, and 32,016 (5.1%) weighed over 4,000 g. There were 23,404 premature babies (5.5%) in 2008 compared to 23,368 (5.6%) in 2009. The developmental screening test indicated appropriate development in 95%-98% of children, follow-up requirements for 1%-4% of children, and recommendations for further evaluation for 1% of children. Conclusion: The NICKs-2008, which integrates data from the NHIS and NHSPIC databases, can be used to analyze disease onset prior to hospitalization based on information such as lifestyle, eating habits, and risk factors.

No benefit of hypomethylating agents compared to supportive care for higher risk myelodysplastic syndrome

  • Sohn, Sang Kyun;Moon, Joon Ho;Lee, In Hee;Ahn, Jae Sook;Kim, Hyeoung Joon;Chung, Joo Seop;Shin, Ho Jin;Park, Sung Woo;Lee, Won Sik;Lee, Sang Min;Kim, Hawk;Lee, Ho Sup;Kim, Yang Soo;Cho, Yoon Young;Bae, Sung Hwa;Lee, Ji Hyun;Kim, Sung Hyun;Song, Ik Chan;Kwon, Ji Hyun;Lee, Yoo Jin
    • The Korean journal of internal medicine
    • /
    • 제33권6호
    • /
    • pp.1194-1202
    • /
    • 2018
  • Background/Aims: This study evaluated the role of hypomethylating agents (HMA) compared to best supportive care (BSC) for patients with high or very-high (H/VH) risk myelodysplastic syndrome (MDS) according to the Revised International Prognostic Scoring System. Methods: A total of 279 H/VH risk MDS patients registered in the Korean MDS Working Party database were retrospectively analyzed. Results: HMA therapy was administered to 205 patients (73.5%), including 31 patients (11.1%) who then received allogeneic hematopoietic cell transplantation (allo-HCT), while 74 patients (26.5%) received BSC or allo-HCT without HMA. The 3-year overall survival (OS) rates were $53.1%{\pm}10.7%$ for allo-HCT with HMA, $75%{\pm}21.7%$ for allo-HCT without HMA, $17.3%{\pm}3.6%$ for HMA, and $20.8%{\pm}6.9%$ for BSC groups (p < 0.001). In the multivariate analysis, only allo-HCT was related with favorable OS (hazard ratio [HR], 0.356; p = 0.002), while very poor cytogenetic risk (HR, 5.696; p = 0.042), age ${\geq}65years$ (HR, 1.578; p = 0.022), Eastern Cooperative Oncology Group performance status (ECOG PS) 2 to 4 (HR, 2.837; p < 0.001), and transformation to acute myeloid leukemia (AML) (HR, 1.901; p = 0.001) all had an adverse effect on OS. Conclusions: For the H/VH risk group, very poor cytogenetic risk, age ${\geq}65years$, ECOG PS 2 to 4, and AML transformation were poor prognostic factors. HMA showed no benefit in terms of OS when compared to BSC. Allo-HCT was the only factor predicting a favorable long-term outcome. The use of HMA therapy did not seem to have an adverse effect on the transplantation outcomes. However, the conclusion of this study should be carefully interpreted and proven by large scale research in the future.

Identification and Validation of Circulating MicroRNA Signatures for Breast Cancer Early Detection Based on Large Scale Tissue-Derived Data

  • Yu, Xiaokang;Liang, Jinsheng;Xu, Jiarui;Li, Xingsong;Xing, Shan;Li, Huilan;Liu, Wanli;Liu, Dongdong;Xu, Jianhua;Huang, Lizhen;Du, Hongli
    • Journal of Breast Cancer
    • /
    • 제21권4호
    • /
    • pp.363-370
    • /
    • 2018
  • Purpose: Breast cancer is the most commonly occurring cancer among women worldwide, and therefore, improved approaches for its early detection are urgently needed. As microRNAs (miRNAs) are increasingly recognized as critical regulators in tumorigenesis and possess excellent stability in plasma, this study focused on using miRNAs to develop a method for identifying noninvasive biomarkers. Methods: To discover critical candidates, differential expression analysis was performed on tissue-originated miRNA profiles of 409 early breast cancer patients and 87 healthy controls from The Cancer Genome Atlas database. We selected candidates from the differentially expressed miRNAs and then evaluated every possible molecular signature formed by the candidates. The best signature was validated in independent serum samples from 113 early breast cancer patients and 47 healthy controls using reverse transcription quantitative real-time polymerase chain reaction. Results: The miRNA candidates in our method were revealed to be associated with breast cancer according to previous studies and showed potential as useful biomarkers. When validated in independent serum samples, the area under curve of the final miRNA signature (miR-21-3p, miR-21-5p, and miR-99a-5p) was 0.895. Diagnostic sensitivity and specificity were 97.9% and 73.5%, respectively. Conclusion: The present study established a novel and effective method to identify biomarkers for early breast cancer. And the method, is also suitable for other cancer types. Furthermore, a combination of three miRNAs was identified as a prospective biomarker for breast cancer early detection.

네트워크 약리학을 기반으로한 총명공진단(聰明供辰丹) 구성성분과 알츠하이머 타겟 유전자의 효능 및 작용기전 예측 (Network pharmacology-based prediction of efficacy and mechanism of Chongmyunggongjin-dan acting on Alzheimer's disease)

  • 권빛나;유수민;김동욱;오진영;장미경;박성주;배기상
    • 대한한의학회지
    • /
    • 제44권2호
    • /
    • pp.106-118
    • /
    • 2023
  • Objectives: Network pharmacology is a method of constructing and analyzing a drug-compound-target network to predict potential efficacy and mechanisms related to drug targets. In that large-scale analysis can be performed in a short time, it is considered a suitable tool to explore the function and role of herbal medicine. Thus, we investigated the potential functions and pathways of Chongmyunggongjin-dan (CMGJD) on Alzheimer's disease (AD) via network pharmacology analysis. Methods: Using public databases and PubChem database, compounds of CMGJD and their target genes were collected. The putative target genes of CMGJD and known target genes of AD were compared and found the correlation. Then, the network was constructed using Cytoscape 3.9.1. and functional enrichment analysis was conducted based on the Gene Ontology (GO) Biological process and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathways to predict the mechanisms. Results: The result showed that total 104 compounds and 1157 related genes were gathered from CMGJD. The network consisted of 1157nodes and 10034 edges. 859 genes were interacted with AD gene set, suggesting that the effects of CMGJD are closely related to AD. Target genes of CMGJD are considerably associated with various pathways including 'Positive regulation of chemokine production', 'Cellular response to toxic substance', 'Arachidonic acid metabolic process', 'PI3K-Akt signaling pathway', 'Metabolic pathways', 'IL-17 signaling pathway' and 'Neuroactive ligand-receptor interaction'. Conclusion: Through a network pharmacological method, CMGJD was predicted to have high relevance with AD by regulating inflammation. This study could be used as a basis for effects of CMGJD on AD.

중풍 후 운동 장애에 대한 『의부집성(醫部集成)』의 침구치료 고찰 (A literatual study on the acupuncture and moxibustion for hemiparesis of stroke in Euibujipsung)

  • 정동원;민인규;문상관;박성욱;정우상;박정미;고창남;조기호;배형섭;김영석
    • 대한중풍순환신경학회지
    • /
    • 제7권1호
    • /
    • pp.34-39
    • /
    • 2006
  • Objectives and methods : The Euibujipsung is the one of the huge-scale encyclopedias about Oriental Medicine. To investigate the most frequently used acupoints for hemiparesis after stroke, we used Euibujipsung CR-ROM database with several key words concerned with motor weakness (半身不遂 不遂不隨 癱瘓 中臟 中腑 風痱, etc.). Results : In the result, we found five popular acupoints (GV20, LI11, LI15, ST36 and GB39), and four meridians (Stomach, Gall bladder, Large intestine and Small intestine). We also found that the Yang meridians were cited more frequently than the Yin. Conclusion : Therefore we think that these findings can give further ideas to clinical practice and research fields for stroke rehabilitation in Oriental medicine.

  • PDF

중풍 후 언어 장애에 대한 ☐☐의부집성(醫部集成)☐☐의 침구치료 고찰 (A Literatual Study on the Acupuncture and Moxibustion for Dysarthria of Stroke in Euibujipsung)

  • 정동원;민인규;문상관;나병조;홍진우;박성욱;정우상;박정미;고창남;조기호;배형섭;김영석
    • 대한중풍순환신경학회지
    • /
    • 제8권1호
    • /
    • pp.28-33
    • /
    • 2007
  • Objectives and methods : The Euibujipsung is one of the huge-scale encyclopedias about Oriental Medicine. To search the most frequently used aupoints for dysarthria after stroke, we used Euibujipsung CD-ROM database with several chinese character keyword concerned with vernal function(語, 言, 音, 啞, 瘖, etc). Results : We found four popular acupoints(PC5, GV20, GV16, TE6), and five meridians (Governor vessel, Gall Bladder, Heart, Large Intestine and Triple Energizer). We also found that the extra meridians were used more frequently than other type of meridians. Conclusion : We think that these findings can give further ideas to clinical practice and research fields for stroke rehabilitation in Oriental medicine.

  • PDF

토픽 모델링을 이용한 트위터 이슈 트래킹 시스템 (Twitter Issue Tracking System by Topic Modeling Techniques)

  • 배정환;한남기;송민
    • 지능정보연구
    • /
    • 제20권2호
    • /
    • pp.109-122
    • /
    • 2014
  • 현재 우리는 소셜 네트워크 서비스(Social Network Service, 이하 SNS) 상에서 수많은 데이터를 만들어 내고 있다. 특히, 모바일 기기와 SNS의 결합은 과거와는 비교할 수 없는 대량의 데이터를 생성하면서 사회적으로도 큰 영향을 미치고 있다. 이렇게 방대한 SNS 데이터 안에서 사람들이 많이 이야기하는 이슈를 찾아낼 수 있다면 이 정보는 사회 전반에 걸쳐 새로운 가치 창출을 위한 중요한 원천으로 활용될 수 있다. 본 연구는 이러한 SNS 빅데이터 분석에 대한 요구에 부응하기 위해, 트위터 데이터를 활용하여 트위터 상에서 어떤 이슈가 있었는지 추출하고 이를 웹 상에서 시각화 하는 트위터이슈 트래킹 시스템 TITS(Twitter Issue Tracking System)를 설계하고 구축 하였다. TITS는 1) 일별 순위에 따른 토픽 키워드 집합 제공 2) 토픽의 한달 간 일별 시계열 그래프 시각화 3) 토픽으로서의 중요도를 점수와 빈도수에 따라 Treemap으로 제공 4) 키워드 검색을 통한 키워드의 한달 간 일별 시계열 그래프 시각화의 기능을 갖는다. 본 연구는 SNS 상에서 실시간으로 발생하는 빅데이터를 Open Source인 Hadoop과 MongoDB를 활용하여 분석하였고, 이는 빅데이터의 실시간 처리가 점점 중요해지고 있는 현재 매우 주요한 방법론을 제시한다. 둘째, 문헌정보학 분야뿐만 아니라 다양한 연구 영역에서 사용하고 있는 토픽 모델링 기법을 실제 트위터 데이터에 적용하여 스토리텔링과 시계열 분석 측면에서 유용성을 확인할 수 있었다. 셋째, 연구 실험을 바탕으로 시각화와 웹 시스템 구축을 통해 실제 사용 가능한 시스템으로 구현하였다. 이를 통해 소셜미디어에서 생성되는 사회적 트렌드를 마이닝하여 데이터 분석을 통한 의미 있는 정보를 제공하는 실제적인 방법을 제시할 수 있었다는 점에서 주요한 의의를 갖는다. 본 연구는 JSON(JavaScript Object Notation) 파일 포맷의 1억 5천만개 가량의 2013년 3월 한국어 트위터 데이터를 실험 대상으로 한다.

Delineating Transcription Factor Networks Governing Virulence of a Global Human Meningitis Fungal Pathogen, Cryptococcus neoformans

  • Jung, Kwang-Woo;Yang, Dong-Hoon;Maeng, Shinae;Lee, Kyung-Tae;So, Yee-Seul;Hong, Joohyeon;Choi, Jaeyoung;Byun, Hyo-Jeong;Kim, Hyelim;Bang, Soohyun;Song, Min-Hee;Lee, Jang-Won;Kim, Min Su;Kim, Seo-Young;Ji, Je-Hyun;Park, Goun;Kwon, Hyojeong;Cha, Sooyeon;Meyers, Gena Lee;Wang, Li Li;Jang, Jooyoung;Janbon, Guilhem;Adedoyin, Gloria;Kim, Taeyup;Averette, Anna K.;Heitman, Joseph;Cheong, Eunji;Lee, Yong-Hwan;Lee, Yin-Won;Bahn, Yong-Sun
    • 한국균학회소식:학술대회논문집
    • /
    • 한국균학회 2015년도 춘계학술대회 및 임시총회
    • /
    • pp.59-59
    • /
    • 2015
  • Cryptococcus neoformans causes life-threatening meningoencephalitis in humans, but the treatment of cryptococcosis remains challenging. To develop novel therapeutic targets and approaches, signaling cascades controlling pathogenicity of C. neoformans have been extensively studied but the underlying biological regulatory circuits remain elusive, particularly due to the presence of an evolutionarily divergent set of transcription factors (TFs) in this basidiomycetous fungus. In this study, we constructed a high-quality of 322 signature-tagged gene deletion strains for 155 putative TF genes, which were previously predicted using the DNA-binding domain TF database (http://www.transcriptionfactor.org/). We tested in vivo and in vitro phenotypic traits under 32 distinct growth conditions using 322 TF gene deletion strains. At least one phenotypic trait was exhibited by 145 out of 155 TF mutants (93%) and approximately 85% of the TFs (132/155) have been functionally characterized for the first time in this study. Through high-coverage phenome analysis, we discovered myriad novel TFs that play critical roles in growth, differentiation, virulence-factor (melanin, capsule, and urease) formation, stress responses, antifungal drug resistance, and virulence. Large-scale virulence and infectivity assays in insect (Galleria mellonella) and mouse host models identified 34 novel TFs that are critical for pathogenicity. The genotypic and phenotypic data for each TF are available in the C. neoformans TF phenome database (http://tf.cryptococcus.org). In conclusion, our phenome-based functional analysis of the C. neoformans TF mutant library provides key insights into transcriptional networks of basidiomycetous fungi and ubiquitous human fungal pathogens.

  • PDF