• Title/Summary/Keyword: Log Clustering

Search Result 73, Processing Time 0.027 seconds

Automatic Extract User Intention from Web Search Log (웹 정보 검색 이력을 이용한 사용자 의도 자동 추출)

  • Park, Kinam;Jung, Soonyoung;Suh, Taewon;Ji, Hyesung;Lee, Taemin;Lim, Heuiseok
    • The Journal of Korean Association of Computer Education
    • /
    • v.12 no.6
    • /
    • pp.21-32
    • /
    • 2009
  • This paper proposes a method to extract a user's intention automatically and implementation of intention map that support a user can appropriate search results using a user' information need accurately. It selects user intention based on searching history obtained from previous users' same queries and extracts user intentions by using clustering algorithm and user intention extraction algorithm, extracted user intentions are represented in an intention map base on a theory of knowledge representation. For the efficiency analysis of intention map, we extracted user intentions using 2,600 search history data which provided by a current domestic commercial search engine. The experimental results using the information intention map search when using general search engines represent more than satisfaction was statistically significant.

  • PDF

논제 부정 Access에 대한 Firewall의 과제와 대책

  • 변성준;서정석;최원석
    • Proceedings of the Korea Database Society Conference
    • /
    • 2000.11a
    • /
    • pp.227-238
    • /
    • 2000
  • Firewall은 다양한 부정Access의 방지책으로서 확실히 유효한 수단이지만 이 Firewall은 사용자로부터 지시된 설정을 충실히 실행하는 것으로 설정 오류, 소프트웨어의 정지, 허가된 룰을 악용한 침입 등 반드시 사용자가 바라는 작용을 무조건적 상태에서 보증해 주는 것은 아니다. 따라서 사용자는 도입 후 에도 운용시에 Access log를 감시하고 본래의 Security Policy에 반하는 행위를 매일 매일 체크하지 않으면 안될 상황에 처해 있다. 본 연구는 이러한 부정Access에 대한 이와 같은 Firewall의 현상에 대한 과제 중에서 "부정Access를 어떻게 하면 일찍, 정확히 체크할 수 있는가\ulcorner"라는 주제를 선택하여 Firewall의 한계와 그 대응책을 실제로 부정Access를 시험해 보는 것으로 검증하기로 하였다. 실험결과에서 (1)Port Scan이나 전자메일 폭탄(서비스정지공격)등은 Firewall로 방지하는 것은 불가능하거나 혹은 Checking이 곤란하다. (2)공격마다 로그 수집을 했음에도 관계없이 Firewall의 로그는 번잡하므로 단시간에 사태의 발견이 대단히 곤란하다고 하는 Firewall의 한계를 인식하였다. 그리고 그 대책으로서 우리는 체크 툴의 유효성에 착안하여 조사한 결과, 결국 무엇이 부정Access인가에 대해서는 어디까지나 이용하는 측이 판단하여 Firewall 상에 설정하지 않으면 안되지만 체크 툴은 이 부정Access 정보를 데이터베이스로서 갖고 있음으로써 '무엇이 부정Access인가'를 이용자 대신에 판단하고 툴에 따라서는 설정을 자동적으로 변경하여 부정 Access의 저지율을 향상시킨다. 이처럼 체크 툴은 Firewall의 수비능력을 보강하는 위치에 있다고 생각할 수 있다.다. 4 장에서는 3장에서 제기한 각각의 문제점에 대해 RAD 의 관점에 비추어 e-business 시스템의 단기개발을 실현하기 위한 고려사항이나 조건 해결책을 제안한다. 본 논문이 지금부터 e-business 를 시작하려고 하는 분, e-business 시스템의 개발을 시작하려고 하는 분께 단기간의 e-business 실현을 위한 하나의 지침이 된다면 다행이겠다.formable template is used to optimize the matching. Then, clustering the similar shapes by the distance between each centroid, papaya can be completely detected from the background.uage ("Association of research for algorithm of calculating machine (1992)"). As a result, conventional NN and CNN were available for interpolation of sampling data. Moreover, when nonlinear intensity is not so large under the field condition of small slope, interpolation performance of CNN was a little not so better than NN. However, when nonlinear intensity is large under the field condition of large slope, interpolation performance of CNN was relatively better than NN.콩과 자연 콩이 성분 분석에서 차이를

  • PDF

Hedging effectiveness of KOSPI200 index futures through VECM-CC-GARCH model (벡터오차수정모형과 다변량 GARCH 모형을 이용한 코스피200 선물의 헷지성과 분석)

  • Kwon, Dongan;Lee, Taewook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1449-1466
    • /
    • 2014
  • In this paper, we consider a hedge portfolio based on futures of underlying asset. A classical way to estimate a hedge ratio for a hedge portfolio of a spot and futures is a regression analysis. However, a regression analysis is not capable of reflecting long-run equilibrium between a spot and futures and volatility clustering in the conditional variance of financial time series. In order to overcome such defects, we analyzed KOSPI200 index and futures using VECM-CC-GARCH model and computed a hedge ratio from the estimated conditional covariance-variance matrix. In real data analysis, we compared a regression and VECM-CC-GARCH models in terms of hedge effectiveness based on variance, value at risk and expected shortfall of log-returns of hedge portfolio. The empirical results show that the multivariate GARCH models significantly outperform a regression analysis and improve hedging effectiveness in the period of high volatility.

Expression profiling of cultured podocytes exposed to nephrotic plasma reveals intrinsic molecular signatures of nephrotic syndrome

  • Panigrahi, Stuti;Pardeshi, Varsha Chhotusing;Chandrasekaran, Karthikeyan;Neelakandan, Karthik;PS, Hari;Vasudevan, Anil
    • Clinical and Experimental Pediatrics
    • /
    • v.64 no.7
    • /
    • pp.355-363
    • /
    • 2021
  • Background: Nephrotic syndrome (NS) is a common renal disorder in children attributed to podocyte injury. However, children with the same diagnosis have markedly variable treatment responses, clinical courses, and outcomes, suggesting molecular heterogeneity. Purpose: This study aimed to explore the molecular responses of podocytes to nephrotic plasma to identify specific genes and signaling pathways differentiating various clinical NS groups as well as biological processes that drive injury in normal podocytes. Methods: Transcriptome profiles from immortalized human podocyte cell line exposed to the plasma of 8 subjects (steroid-sensitive nephrotic syndrome [SSNS], n=4; steroid-resistant nephrotic syndrome [SRNS], n=2; and healthy adult individuals [control], n=2) were generated using microarray analysis. Results: Unsupervised hierarchical clustering of global gene expression data was broadly correlated with the clinical classification of NS. Differential gene expression (DGE) analysis of diseased groups (SSNS or SRNS) versus healthy controls identified 105 genes (58 up-regulated, 47 down-regulated) in SSNS and 139 genes (78 up-regulated, 61 down-regulated) in SRNS with 55 common to SSNS and SRNS, while the rest were unique (50 in SSNS, 84 genes in SRNS). Pathway analysis of the significant (P≤0.05, -1≤ log2 FC ≥1) differentially expressed genes identified the transforming growth factor-β and Janus kinase-signal transducer and activator of transcription pathways to be involved in both SSNS and SRNS. DGE analysis of SSNS versus SRNS identified 2,350 genes with values of P≤0.05, and a heatmap of corresponding expression values of these genes in each subject showed clear differences in SSNS and SRNS. Conclusion: Our study observations indicate that, although podocyte injury follows similar pathways in different clinical subgroups, the pathways are modulated differently as evidenced by the heatmap. Such transcriptome profiling with a larger cohort can stratify patients into intrinsic subtypes and provide insight into the molecular mechanisms of podocyte injury.

Clinical comparison of marginal fit of ceramic inlays between digital and conventional impressions

  • Franklin Guillermo Vargas-Corral;Americo Ernesto Vargas-Corral;Miguel Angel Rodríguez Valverde;Manuel Bravo;Juan Ignacio Rosales Leal
    • The Journal of Advanced Prosthodontics
    • /
    • v.16 no.1
    • /
    • pp.57-65
    • /
    • 2024
  • PURPOSE. The aim of this stuldy was to compare the clinical marginal fit of CAD-CAM inlays obtained from intraoral digital impression or addition silicone impression techniques. MATERIALS AND METHODS. The study included 31 inlays for prosthodontics purposes of 31 patients: 15 based on intraoral digital impressions (DI group); and 16 based on a conventional impression technique (CI group). Inlays included occlusal and a non-occlusal surface. Inlays were milled in ceramic. The inlay-teeth interface was replicated by placing each inlay in its corresponding uncemented clinical preparation and taking interface impressions with silicone material from occlusal and free surfaces. Interface analysis was made using white light confocal microscopy (WLCM) (scanning area: 694 × 510 ㎛2) from the impression samples. The gap size and the inlay overextension were measured from the microscopy topographies. For analytical purposes (i.e., 95-%-confidence intervals calculations and P-value calculations), the procedure REGRESS in SUDAAN was used to account for clustering (i.e., multiple measurements). For p-value calculation, the log transformation of the dependent variables was used to normalize the distributions. RESULTS. Marginal fit values for occlusal and free surfaces were affected by the type of impression. There were no differences between surfaces (occlusal vs. free). Gap obtained for DI group was 164 ± 84 ㎛ and that for CI group was 209 ± 104 ㎛, and there were statistical differences between them (p = .041). Mean overextension values were 60 ± 59 ㎛ for DI group and 67 ± 73 ㎛ for CI group, and there were no differences between then (p = .553). CONCLUSION. Digital impression achieved inlays with higher clinical marginal fit and performed better than the conventional silicone materials.

An Interactive Approach to Categorize Questions on the Internet BBSs (인터넷 게시판 질문 분류를 위한 인터랙티브 접근방법에 관한 연구)

  • Jae-Kwang Lee;Seong-Ho Noh;Ok-Hyun Ryou
    • The Journal of Society for e-Business Studies
    • /
    • v.8 no.3
    • /
    • pp.177-195
    • /
    • 2003
  • In a traditional customer support environment, mainly call centers or service centers are responsible for receiving inquiries from their customers via telephone calls. Due to the rapid growth of Internet with its widespread acceptance and accessibility, means of communication with customers in the traditional customer support center, such as telephones, letters, and direct-visiting, have been replaced by e-mails and bulletin board systems (BBSs) using the Internet constantly. BBSs are basically question and answer systems, they require some lead time to get answer from administrator. To reduce lead time, BBSs enable remote customers or users to log on and tap into a knowledge database that is generally formatted in the form of Frequently Asked Questions (FAQs) that provide answers and solutions to the common problems. And, many different types of the questions are mixed on the BBS. It is a burden to administrator. To build FAQs and to support BBS adminstrator, a supporting tool which is to categorize questions is helpful. In this research, we suggest an interactive question categorizing methodology which consists of steps to present question using keywords, identifying keywords' affinity, computing similarity among questions, and clustering questions. This methodology allows users to interact iteratively for clear manifestation of ambiguous questions. We also developed a prototype system, IQC (interactive question categorizer) and evaluated its performance using the comparison experiments with other systems. IQC is not a general purposed system, but it produces a good result in a given specific domain.

  • PDF

Comparison between Planned and Actual Data of Block Assembly Process using Process Mining in Shipyards (조선 산업에서 프로세스 마이닝을 이용한 블록 조립 프로세스의 계획 및 실적 비교 분석)

  • Lee, Dongha;Park, Jae Hun;Bae, Hyerim
    • The Journal of Society for e-Business Studies
    • /
    • v.18 no.4
    • /
    • pp.145-167
    • /
    • 2013
  • This paper proposes a method to compare planned processes with actual processes of bock assembly operations in shipbuilding industry. Process models can be discovered using the process mining techniques both for planned and actual log data. The comparison between planned and actual process is focused in this paper. The analysis procedure consists of five steps : 1) data pre-processing, 2) definition of analysis level, 3) clustering of assembly bocks, 4) discovery of process model per cluster, and 5) comparison between planned and actual processes per cluster. In step 5, it is proposed to compare those processes by the several perspectives such as process model, task, process instance and fitness. For each perspective, we also defined comparison factors. Especially, in the fitness perspective, cross fitness is proposed and analyzed by the quantity of fitness between the discovered process model by own data and the other data(for example, the fitness of planned model to actual data, and the fitness of actual model to planned data). The effectiveness of the proposed methods was verified in a case study using planned data of block assembly planning system (BAPS) and actual data generated from block assembly monitoring system (BAMS) of a top ranked shipbuilding company in Korea.

Toward understanding learning patterns in an open online learning platform using process mining (프로세스 마이닝을 활용한 온라인 교육 오픈 플랫폼 내 학습 패턴 분석 방법 개발)

  • Taeyoung Kim;Hyomin Kim;Minsu Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.2
    • /
    • pp.285-301
    • /
    • 2023
  • Due to the increasing demand and importance of non-face-to-face education, open online learning platforms are getting interests both domestically and internationally. These platforms exhibit different characteristics from online courses by universities and other educational institutions. In particular, students engaged in these platforms can receive more learner autonomy, and the development of tools to assist learning is required. From the past, researchers have attempted to utilize process mining to understand realistic study behaviors and derive learning patterns. However, it has a deficiency to employ it to the open online learning platforms. Moreover, existing research has primarily focused on the process model perspective, including process model discovery, but lacks a method for the process pattern and instance perspectives. In this study, we propose a method to identify learning patterns within an open online learning platform using process mining techniques. To achieve this, we suggest three different viewpoints, e.g., model-level, variant-level, and instance-level, to comprehend the learning patterns, and various techniques are employed, such as process discovery, conformance checking, autoencoder-based clustering, and predictive approaches. To validate this method, we collected a learning log of machine learning-related courses on a domestic open education platform. The results unveiled a spaghetti-like process model that can be differentiated into a standard learning pattern and three abnormal patterns. Furthermore, as a result of deriving a pattern classification model, our model achieved a high accuracy of 0.86 when predicting the pattern of instances based on the initial 30% of the entire flow. This study contributes to systematically analyze learners' patterns using process mining.

Association between High Diffusion-Weighted Imaging-Derived Functional Tumor Burden of Peritoneal Carcinomatosis and Overall Survival in Patients with Advanced Ovarian Carcinoma

  • He An;Jose AU Perucho;Keith WH Chiu;Edward S Hui;Mandy MY Chu;Siew Fei Ngu;Hextan YS Ngan;Elaine YP Lee
    • Korean Journal of Radiology
    • /
    • v.23 no.5
    • /
    • pp.539-547
    • /
    • 2022
  • Objective: To investigate the association between functional tumor burden of peritoneal carcinomatosis (PC) derived from diffusion-weighted imaging (DWI) and overall survival in patients with advanced ovarian carcinoma (OC). Materials and Methods: This prospective study was approved by the local research ethics committee, and informed consent was obtained. Fifty patients (mean age ± standard deviation, 57 ± 12 years) with stage III-IV OC scheduled for primary or interval debulking surgery (IDS) were recruited between June 2016 and December 2021. DWI (b values: 0, 400, and 800 s/mm2) was acquired with a 16-channel phased-array torso coil. The functional PC burden on DWI was derived based on K-means clustering to discard fat, air, and normal tissue. A score similar to the surgical peritoneal cancer index was assigned to each abdominopelvic region, with additional scores assigned to the involvement of critical sites, denoted as the functional peritoneal cancer index (fPCI). The apparent diffusion coefficient (ADC) of the largest lesion was calculated. Patients were dichotomized by immediate surgical outcome into high- and low-risk groups (with and without residual disease, respectively) with subsequent survival analysis using the Kaplan-Meier curve and log-rank test. Multivariable Cox proportional hazards regression was used to evaluate the association between DWI-derived results and overall survival. Results: Fifteen (30.0%) patients underwent primary debulking surgery, and 35 (70.0%) patients received neoadjuvant chemotherapy followed by IDS. Complete tumor debulking was achieved in 32 patients. Patients with residual disease after debulking surgery had reduced overall survival (p = 0.043). The fPCI/ADC was negatively associated with overall survival when accounted for clinicopathological information with a hazard ratio of 1.254 for high fPCI/ADC (95% confidence interval, 1.007-1.560; p = 0.043). Conclusion: A high DWI-derived functional tumor burden was associated with decreased overall survival in patients with advanced OC.

Cluster exploration of water pipe leak and complaints surveillance using a spatio-temporal statistical analysis (스캔통계량 분석을 통한 상수도 누수 및 수질 민원 발생 클러스터 탐색)

  • Juwon Lee;Eunju Kim;Sookhyun Nam;Tae-Mun Hwang
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.37 no.5
    • /
    • pp.261-269
    • /
    • 2023
  • In light of recent social concerns related to issues such as water supply pipe deterioration leading to problems like leaks and degraded water quality, the significance of maintenance efforts to enhance water source quality and ensure a stable water supply has grown substantially. In this study, scan statistic was applied to analyze water quality complaints and water leakage accidents from 2015 to 2021 to present a reasonable method to identify areas requiring improvement in water management. SaTScan, a spatio-temporal statistical analysis program, and ArcGIS were used for spatial information analysis, and clusters with high relative risk (RR) were determined using the maximum log-likelihood ratio, relative risk, and Monte Carlo hypothesis test for I city, the target area. Specifically, in the case of water quality complaints, the analysis results were compared by distinguishing cases occurring before and after the onset of "red water." The period between 2015 and 2019 revealed that preceding the occurrence of red water, the leak cluster at location L2 posed a significantly higher risk (RR: 2.45) than other regions. As for water quality complaints, cluster C2 exhibited a notably elevated RR (RR: 2.21) and appeared concentrated in areas D and S, respectively. On the other hand, post-red water incidents of water quality complaints were predominantly concentrated in area S. The analysis found that the locations of complaint clusters were similar to those of red water incidents. Of these, cluster C7 exhibited a substantial RR of 4.58, signifying more than a twofold increase compared to pre-incident levels. A kernel density map analysis was performed using GIS to identify priority areas for waterworks management based on the central location of clusters and complaint cluster RR data.