• Title/Summary/Keyword: SELECT model

Search Result 1,855, Processing Time 0.027 seconds

The Use of Reinforcement Learning and The Reference Page Selection Method to improve Web Spidering Performance (웹 탐색 성능 향상을 위한 강화학습 이용과 기준 페이지 선택 기법)

  • 이기철;이선애
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.3
    • /
    • pp.331-340
    • /
    • 2002
  • The web world is getting so huge and untractable that without an intelligent information extractor we would get more and more helpless. Conventional web spidering techniques for general purpose search engine may be too slow for the specific search engines, which concentrate only on specific areas or keywords. In this paper a new model for improving web spidering capabilities is suggested and experimented. How to select adequate reference web pages from the initial web Page set relevant to a given specific area (or keywords) can be very important to reduce the spidering speed. Our reference web page selection method DOPS dynamically and orthogonally selects web pages, and it can also decide the appropriate number of reference pages, using a newly defined measure. Even for a very specific area, this method worked comparably well almost at the level of experts. If we consider that experts cannot work on a huge initial page set, and they still have difficulty in deciding the optimal number of the reference web pages, this method seems to be very promising. We also applied reinforcement learning to web environment, and DOPS-based reinforcement learning experiments shows that our method works quite favorably in terms of both the number of hyper links and time.

  • PDF

Does Breast Cancer Drive the Building of Survival Probability Models among States? An Assessment of Goodness of Fit for Patient Data from SEER Registries

  • Khan, Hafiz;Saxena, Anshul;Perisetti, Abhilash;Rafiq, Aamrin;Gabbidon, Kemesha;Mende, Sarah;Lyuksyutova, Maria;Quesada, Kandi;Blakely, Summre;Torres, Tiffany;Afesse, Mahlet
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.12
    • /
    • pp.5287-5294
    • /
    • 2016
  • Background: Breast cancer is a worldwide public health concern and is the most prevalent type of cancer in women in the United States. This study concerned the best fit of statistical probability models on the basis of survival times for nine state cancer registries: California, Connecticut, Georgia, Hawaii, Iowa, Michigan, New Mexico, Utah, and Washington. Materials and Methods: A probability random sampling method was applied to select and extract records of 2,000 breast cancer patients from the Surveillance Epidemiology and End Results (SEER) database for each of the nine state cancer registries used in this study. EasyFit software was utilized to identify the best probability models by using goodness of fit tests, and to estimate parameters for various statistical probability distributions that fit survival data. Results: Statistical analysis for the summary of statistics is reported for each of the states for the years 1973 to 2012. Kolmogorov-Smirnov, Anderson-Darling, and Chi-squared goodness of fit test values were used for survival data, the highest values of goodness of fit statistics being considered indicative of the best fit survival model for each state. Conclusions: It was found that California, Connecticut, Georgia, Iowa, New Mexico, and Washington followed the Burr probability distribution, while the Dagum probability distribution gave the best fit for Michigan and Utah, and Hawaii followed the Gamma probability distribution. These findings highlight differences between states through selected sociodemographic variables and also demonstrate probability modeling differences in breast cancer survival times. The results of this study can be used to guide healthcare providers and researchers for further investigations into social and environmental factors in order to reduce the occurrence of and mortality due to breast cancer.

Static Analysis Based on Backward Control Flow Graph Generation Method Model for Program Analysis (프로그램 분석을 위한 정적분석 기반 역추적 제어흐름그래프 생성 방안 모델)

  • Park, Sunghyun;Kim, Yeonsu;Noh, Bongnam
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.29 no.5
    • /
    • pp.1039-1048
    • /
    • 2019
  • Symbolic execution, an automatic search method for vulnerability verification, has been technically improved over the last few years. However, it is still not practical to analyze the program using only the symbolic execution itself. One of the biggest reasons is that because of the path explosion problem that occurs during program analysis, there is not enough memory, and you can not find the solution of all paths in the program using symbolic execution. Thus, it is practical for the analyst to construct a path for symbolic execution to a target with vulnerability rather than solving all paths. In this paper, we propose a static analysis - based backward CFG(Control Flow Graph) generation technique that can be used in symbolic execution for program analysis. With the creation of a backward CFG, an analyst can select potential vulnerable points, and the backward path generated from that point can be used for future symbolic execution. We conducted experiments with Linux binaries(x86), and indeed showed that potential vulnerability selection and backward CFG path generation were possible in a variety of binary situations.

A Study of Receptive Factors of Smartphone Service from the User's Perspective (스마트폰 서비스의 수용적 요인에 관한 연구 : 사용자 관점에서)

  • Choi, Junhyeog;Baek, Yeongtae;Han, Seungjin
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.11
    • /
    • pp.181-190
    • /
    • 2013
  • This study first aims to investigate from the users' perspective what service is the most efficient to users and what service is relatively the most effective in contrast to investment among a variety of services provided by smartphone manufacturers, telecommunication companies, and related corporations. In addition, this research suggests implicatively important elements for making the future model of smartphone services. For this end, this study finds out the factors which generate users' positive or negative attitudes towards smartphone use through a questionnaire of those who are using smartphones at present. In particular, by applying Theory of Planned Behavior, this study analyzes the influence exerted by the user's belief towards the kinds of services by setting up Attitudinal Belief, Subjective Norms, and Control Belief which have an influence on attitude from the perspective of smartphone providing detailed services. The results of this study will eventually help the smartphone manufacturers, telecommunication companies, and related corporations to establish smartphone marketing strategy as well as to select the smartphone services which will have popular appeal to their users.

Optimization of MOF-801 Synthesis Using Sequential Design of Experiments (순차적 실험계획법을 이용한 MOF-801 합성공정 최적화)

  • Lee, Min Hyung;Yoo, Kye Sang
    • Applied Chemistry for Engineering
    • /
    • v.32 no.6
    • /
    • pp.621-626
    • /
    • 2021
  • A sequential design of experiments was used to optimize MOF-801 synthesis process. For the initial screening, a general 2k factorial design was selected followed by the central composition design, one of the response surface methods. A 23 factorial design based on the molar ratio of fumaric acid, dimethylformamide (DMF), and formic acid was performed to select the more suitable response variable for the design of experimental method among the crystallinity and BET specific surface area of MOF-801. After performing 8 synthesis experiments designed by MINITAB 19 software, the characteristic analysis was performed using XRD analysis and nitrogen adsorption method. The crystallinity with R2 = 0.999 was found to be more suitable for the experimental method than that of BET specific surface area. Based on analysis of variance (ANOVA), it was confirmed that the molar ratio of fumaric acid and formic acid was a major factor in determining the crystallinity of MOF-801. Through the response optimization and contour plot of two factors, the optimal molar ratio of ZrOCl2·8H2O : fumaric acid : DMF : formic acid was 1 : 1 : 39 : 35. In order to optimize the synthesis process, the central composition design on synthesis time and temperature was performed under the identical molar ratio of precursors. The results derived through the designed 9 synthesis experiments were calculated using the quadratic model equation. Thus, the maximum crystallinity of MOF-801 predicted under the synthesis time and temperature of 7.8 h and 123 ℃, respectively.

Study on the Production Process of Performance Arts Visualization Projects: Focused on a Case Analysis of NT Live Cinema Broadcasts (공연예술 영상화 제작과정 연구:NT Live 시네마 브로드캐스트 사례분석을 중심으로)

  • Park, Jin-Won;Kim, Ga-eun
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.7
    • /
    • pp.45-58
    • /
    • 2021
  • This study aims to select representative performance art visualization projects that react to changes in the culture enjoyment methods and needs of contemporary performance art consumers for performance art culture value creation and vitalization that suit the Fourth Industry and a global age, verify new cultural value creation possibilities of performance projects, and look into important matters and keynotes of production processes. Focusing on the report 'NT Live-Digital broadcast of theatre Learning from the pilot season'(2011), a thorough analysis was conducted on the Royal National Theatre of England, a leading model of cinema broadcast performance visualization projects, including the purpose, production processes (copyright agreements, personnel compositions, filming and broadcasting), marketing methods, and audiences of its "NT Live" project and observations were made of production processes and cultural and artistic values that differ from existing performance art to examine administrative and financial keynotes for the sustainability of performance visualization projects. Through this, possibilities of source creations with artistic, cultural, and economic values that cinema broadcast (live performance broadcast) performance viewing methods have as a new form of performance art products can be verified. In addition, the development of various performance approaches that respond to the culture enjoyment methods and consumption patterns of audiences will result in the vitalization of performing arts visualization projects through the enhancement of popular appeal and the expansion of audience types of the performing arts field.

Machine Learning Prediction of Economic Effects of Busan's Strategic Industry through Ridge Regression and Lasso Regression (릿지 회귀와 라쏘 회귀 모형에 의한 부산 전략산업의 지역경제 효과에 대한 머신러닝 예측)

  • Yi, Chae-Deug
    • Journal of Korea Port Economic Association
    • /
    • v.37 no.1
    • /
    • pp.197-215
    • /
    • 2021
  • This paper analyzes the machine learning predictions of the economic effects of Busan's strategic industries on the employment and income using the Ridge Regression and Lasso Regression models with regulation terms. According to the Ridge estimation and Lasso estimation models of employment, the intelligence information service industry such as the service platform, contents, and smart finance industries and the global tourism industry such as MICE and specialized tourism are predicted to influence on the employment in order. However, the Ridge and Lasso regression model show that the future transportation machine industry does not significantly increase the employment and income since it is the primitive investment industry. The Ridge estimation models of the income show that the intelligence information service industry and global tourism industry are also predicted to influence on the income in order. According to the Lasso estimation models of income, four strategic industries such as the life care, smart maritime, the intelligence machine, and clean tech industry do not influence the income. Furthermore, the future transportation machine industry may influence the income negatively since it is the primitive investment industry. Thus, we have to select the appropriate economic objectives and priorities of industrial policies.

Predicting Successful Conservative Surgery after Neoadjuvant Chemotherapy in Hormone Receptor-Positive, HER2-Negative Breast Cancer

  • Ko, Chang Seok;Kim, Kyu Min;Lee, Jong Won;Lee, Han Shin;Lee, Sae Byul;Sohn, Guiyun;Kim, Jisun;Kim, Hee Jeong;Chung, Il Yong;Ko, Beom Seok;Son, Byung Ho;Ahn, Seung Do;Kim, Sung-Bae;Kim, Hak Hee;Ahn, Sei Hyun
    • Journal of Breast Disease
    • /
    • v.6 no.2
    • /
    • pp.52-59
    • /
    • 2018
  • Purpose: This study aimed to determine whether clinicopathological factors are potentially associated with successful breast-conserving surgery (BCS) after neoadjuvant chemotherapy (NAC) and develop a nomogram for predicting successful BCS candidates, focusing on those who are diagnosed with hormone receptor (HR)-positive, human epidermal growth factor receptor 2 (HER2)-negative tumors during the pre-NAC period. Methods: The training cohort included 239 patients with an HR-positive, HER2-negative tumor (${\geq}3cm$), and all of these patients had received NAC. Patients were excluded if they met any of the following criteria: diffuse, suspicious, malignant microcalcification (extent >4 cm); multicentric or multifocal breast cancer; inflammatory breast cancer; distant metastases at the time of diagnosis; excisional biopsy prior to NAC; and bilateral breast cancer. Multivariate logistic regression analysis was conducted to evaluate the possible predictors of BCS eligibility after NAC, and the regression model was used to develop the predicting nomogram. This nomogram was built using the training cohort (n=239) and was later validated with an independent validation cohort (n=123). Results: Small tumor size (p<0.001) at initial diagnosis, long distance from the nipple (p=0.002), high body mass index (p=0.001), and weak positivity for progesterone receptor (p=0.037) were found to be four independent predictors of an increased probability of BCS after NAC; further, these variables were used as covariates in developing the nomogram. For the training and validation cohorts, the areas under the receiver operating characteristic curve were 0.833 and 0.786, respectively; these values demonstrate the potential predictive power of this nomogram. Conclusion: This study established a new nomogram to predict successful BCS in patients with HR-positive, HER2-negative breast cancer. Given that chemotherapy is an option with unreliable outcomes for this subtype, this nomogram may be used to select patients for NAC followed by successful BCS.

Design and Implemention of Real-time web Crawling distributed monitoring system (실시간 웹 크롤링 분산 모니터링 시스템 설계 및 구현)

  • Kim, Yeong-A;Kim, Gea-Hee;Kim, Hyun-Ju;Kim, Chang-Geun
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.1
    • /
    • pp.45-53
    • /
    • 2019
  • We face problems from excessive information served with websites in this rapidly changing information era. We find little information useful and much useless and spend a lot of time to select information needed. Many websites including search engines use web crawling in order to make data updated. Web crawling is usually used to generate copies of all the pages of visited sites. Search engines index the pages for faster searching. With regard to data collection for wholesale and order information changing in realtime, the keyword-oriented web data collection is not adequate. The alternative for selective collection of web information in realtime has not been suggested. In this paper, we propose a method of collecting information of restricted web sites by using Web crawling distributed monitoring system (R-WCMS) and estimating collection time through detailed analysis of data and storing them in parallel system. Experimental results show that web site information retrieval is applied to the proposed model, reducing the time of 15-17%.

A Study on Social Media Advertising of Plastic Surgery Using Eye-Tracking (아이트래킹을 활용한 성형외과 소셜 미디어광고의 시선 추적 연구)

  • Son, Jeong-Eun;Jung, Eui-Tay;Paik, Jin-Kyung
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.7
    • /
    • pp.1-12
    • /
    • 2019
  • According to a survey on the frequency of access to medical ads by the Korea Press Foundation in 2017, the most commonly exposed ads among adult men and women are advertising about beauty, plasticity and obesity. As of 2011, South Korea had the largest number of cosmetic surgeries in the world, with 131 cosmetic surgeries per 10,000 people. As a result, as many as 1,414 plastic surgery clinics are operating in South Korea, and the number is also on the rise. Although there are various standards for evaluating people's appearance, the desire to pursue a better look is growing day by day. Then, one might wonder what factors influence consumers' choices among the numerous advertisements for plastic surgery clinics. Based on these questions, this study identified the examples of plastic surgery advertisements, analyzed their type, and identified the types of advertisements with the high visual appeal of the advertising consumer through eye tracking experiment. In total, seven eye-tracking tests of plastic surgery social media advertisements were conducted on 10 subjects. The results showed that the commercial model was the biggest factor that caught the attraction and attention of the ad recipient first and that the most focused and long-standing factor was the treatment contents. Therefore, it is important to select proper commercial models for hospital and clinic contents and to specify factual treatment contents when producing social media advertisements for plastic surgeons. We hope these findings will help create online advertising for plastic surgery clinics effectively.