• Title/Summary/Keyword: R-package

Search Result 543, Processing Time 0.03 seconds

Integrative Comparison of Burrows-Wheeler Transform-Based Mapping Algorithm with de Bruijn Graph for Identification of Lung/Liver Cancer-Specific Gene

  • Ajaykumar, Atul;Yang, Jung Jin
    • Journal of Microbiology and Biotechnology
    • /
    • v.32 no.2
    • /
    • pp.149-159
    • /
    • 2022
  • Cancers of the lung and liver are the top 10 leading causes of cancer death worldwide. Thus, it is essential to identify the genes specifically expressed in these two cancer types to develop new therapeutics. Although many messenger RNA (mRNA) sequencing data related to these cancer cells are available due to the advancement of next-generation sequencing (NGS) technologies, optimized data processing methods need to be developed to identify the novel cancer-specific genes. Here, we conducted an analytical comparison between Bowtie2, a Burrows-Wheeler transform-based alignment tool, and Kallisto, which adopts pseudo alignment based on a transcriptome de Bruijn graph using mRNA sequencing data on normal cells and lung/liver cancer tissues. Before using cancer data, simulated mRNA sequencing reads were generated, and the high Transcripts Per Million (TPM) values were compared. mRNA sequencing reads data on lung/liver cancer cells were also extracted and quantified. While Kallisto could directly give the output in TPM values, Bowtie2 provided the counts. Thus, TPM values were calculated by processing the Sequence Alignment Map (SAM) file in R using package Rsubread and subsequently in python. The analysis of the simulated sequencing data revealed that Kallisto could detect more transcripts and had a higher overlap over Bowtie2. The evaluation of these two data processing methods using the known lung cancer biomarkers concludes that in standard settings without any dedicated quality control, Kallisto is more effective at producing faster and more accurate results than Bowtie2. Such conclusions were also drawn and confirmed with the known biomarkers specific to liver cancer.

Topic Modeling of News Article Related to Franchise Regulation Using LDA (LDA 를 이용한 '프랜차이즈 규제' 관련 뉴스기사 토픽모델링)

  • YANG, Woo-Ryeong;YANG, Hoe Chang
    • The Korean Journal of Franchise Management
    • /
    • v.13 no.4
    • /
    • pp.1-12
    • /
    • 2022
  • Purpose: In 2020, the franchise industry accomplished a significant growth compared to the previous year, as the number of franchise companies increased by 9.0% while the number of franchise brands increased by 12.5%. Despite growth in size, the Korean franchise industry underwent many negative incidents, such as franchise ownership sales to private equity funds, that led to deterioration of businesses. From this point of view, this study aims to make various proposals to help policy makers develop franchise industry policies by analyzing trends of the current and previous presidential administrations' franchise policies and regulations using newspaper articles. Research design, data and methodology: A total of 7,439 articles registered in Naver API from February 25, 2013 to November 29, 2021 were extracted. Among them, 34 unrelated video articles were deleted, and a total of 7,405 articles from both administrations were used for analysis. The R package was used for word frequency analysis, word clouding, word correlation analysis, and LDA (Latent Dirichlet Allocation) topic modeling. Results: The keyword frequency analysis shows that the most frequently mentioned keywords during the previous administration include 'no-brand', 'major company', 'bill', 'business field', and 'SMEs', and those mentioned during the current administration include 'industry' and 'policy'. As a result of LDA topic modeling, 9 topics such as 'global startups' and 'job creation' from the previous administration, and 10 topics such as 'franchise business' and 'distribution industry' from the current administration were derived. The results of LDAvis showed that the previous administration operated a policy based on mutual growth of large and small businesses rather than hostile regulations in the franchise business, whereas the current administration extended the regulation related to franchise business to the employment sector. Conclusions: The analysis of past two administrations' franchise policy, it can be suggested that franchisors and franchisees may complement each other in developing the Fair Transactions in Franchise Business Act and achieving balanced growth. Moreover, political support is needed for sound development of franchisors. Limitations and future research suggestions are presented at the end of this study.

An Exploratory Analysis of Online Discussion of Library and Information Science Professionals in India using Text Mining

  • Garg, Mohit;Kanjilal, Uma
    • Journal of Information Science Theory and Practice
    • /
    • v.10 no.3
    • /
    • pp.40-56
    • /
    • 2022
  • This paper aims to implement a topic modeling technique for extracting the topics of online discussions among library professionals in India. Topic modeling is the established text mining technique popularly used for modeling text data from Twitter, Facebook, Yelp, and other social media platforms. The present study modeled the online discussions of Library and Information Science (LIS) professionals posted on Lis Links. The text data of these posts was extracted using a program written in R using the package "rvest." The data was pre-processed to remove blank posts, posts having text in non-English fonts, punctuation, URLs, emails, etc. Topic modeling with the Latent Dirichlet Allocation algorithm was applied to the pre-processed corpus to identify each topic associated with the posts. The frequency analysis of the occurrence of words in the text corpus was calculated. The results found that the most frequent words included: library, information, university, librarian, book, professional, science, research, paper, question, answer, and management. This shows that the LIS professionals actively discussed exams, research, and library operations on the forum of Lis Links. The study categorized the online discussions on Lis Links into ten topics, i.e. "LIS Recruitment," "LIS Issues," "Other Discussion," "LIS Education," "LIS Research," "LIS Exams," "General Information related to Library," "LIS Admission," "Library and Professional Activities," and "Information Communication Technology (ICT)." It was found that the majority of the posts belonged to "LIS Exam," followed by "Other Discussions" and "General Information related to the Library."

3D FE modeling and parametric analysis of steel fiber reinforced concrete haunched beams

  • Al Jawahery, Mohammed S.;Cevik, Abdulkadir;Gulsan, Mehmet Eren
    • Advances in concrete construction
    • /
    • v.13 no.1
    • /
    • pp.45-69
    • /
    • 2022
  • This paper investigates the shear behavior of reinforced concrete haunched beams (RCHBs) without stirrups. The research objective is to study the effectiveness of the ideal steel fiber (SF) ratio, which is used to resist shear strength, besides the influence of main steel reinforcement, compressive strength, and inclination angles of the haunched beam. The modeling and analysis were carried out by Finite Element Method (FE) based on a software package, called Atena-GiD 3D. The program of this study comprises two-part. One of them consists of nine results of experimental SF RCHBs which are used to identify the accuracy of FE models. The other part comprises 81 FE models, which are divided into three groups. Each group differed from another group by the area of main steel reinforcement (As) which are 226, 339, and 509 mm2. The other parameters which are considered in each group in the same quantities to study the effectiveness of them, were steel fiber volumetric ratios (0.0, 0.5, and 1.0)%, compressive strength (20.0, 40.0, 60.0) MPa, and the inclination angle of haunched beam (0.0°, 10.0°, and 15.0°). Moreover, the parametric analysis was carried out on SF RCHBs to clarify the effectiveness of each parameter on the mechanical behavior of SF RCHBs. The results show that the correlation coefficient (R2) between shear load capacities of FE proposed models and shear load capacities of experimental SF RCHBs is 0.9793, while the effective inclination angle of the haunched beam is 10° which contributes to resisting shear strength, besides the ideal ratio of steel fibers is 1% when the compressive strength of SF RCHBs is more than 20 MPa.

Comparative Analysis of Patients Visiting Department of Korean Internal Medicine in a Korean Medicine Hospital Before and During COVID-19 - From July 2018 to June 2021 at Wonkwang University Jeonju Korean Medicine Hospital - (COVID-19 전후 단일 한방병원 한방내과 내원환자들에 대한 비교 분석 - 2018년 7월부터 2021년 6월까지 원광대학교 전주한방병원을 중심으로 -)

  • Lee, Ji-eun;Shin, Yong-jeen;Shin, Sun-ho
    • The Journal of Internal Korean Medicine
    • /
    • v.42 no.6
    • /
    • pp.1255-1268
    • /
    • 2021
  • Objectives: This study aimed to analyze the healthcare utilization behavior of patients visiting the department of Korean internal medicine in the Korean medicine hospital of Wonkwang University in Jeon-ju from July 2018 to June 2021. Methods: We retrospectively analyzed the medical records of 26,108 patients and sorted the data by period, month, visiting types, new or returning types, sex, and age group. IBM SPSS 26.0 and the R 4.05 'changepoint' package were used with various statistical methods, such as Independent t-test, Mann-Whitney test, Chi-square test, Simple regression analysis. The P-value was set at 0.05. Results and Conclusions: Females outnumbered males regardless of period, and the ratio of females fell after COVID-19. Regardless of visiting types, patients in their 50s, 60s, and 70s outrated any other age group. The average number of females among the returning patients decreased significantly after COVID-19, but did not in males. Outpatients under 10 and in their 10s decreased significantly after COVID-19, as did inpatients in their 40s and 60s. The average duration of hospitalization was extended significantly after COVID-19. The number of outpatients and inpatients decreased as time passed after COVID-19. We expect that the results of this study will be used as reference materials in analyzing the effects of COVID-19 on healthcare utilization.

Clinical effectiveness of different types of bone-anchored maxillary protraction devices for skeletal Class III malocclusion: Systematic review and network meta-analysis

  • Wang, Jiangwei;Yang, Yingying;Wang, Yingxue;Zhang, Lu;Ji, Wei;Hong, Zheng;Zhang, Linkun
    • The korean journal of orthodontics
    • /
    • v.52 no.5
    • /
    • pp.313-323
    • /
    • 2022
  • Objective: This study aimed to estimate the clinical effects of different types of bone-anchored maxillary protraction devices by using a network meta-analysis. Methods: We searched seven databases for randomized and controlled clinical trials that compared bone-anchored maxillary protraction with tooth-anchored maxillary protraction interventions or untreated groups up to May 2021. After literature selection, data extraction, and quality assessment, we calculated the mean differences, 95% confidence intervals, and surface under the cumulative ranking scores of eleven indicators. Statistical analysis was performed using R statistical software with the GeMTC package based on the Bayesian framework. Results: Six interventions and 667 patients were involved in 18 studies. In comparison with the tooth-anchored groups, the bone-anchored groups showed significantly more increases in Sella-Nasion-Subspinale (°), Subspinale-Nasion-Supramentale(°) and significantly fewer increases in mandibular plane angle and the labial proclination angle of upper incisors. In comparison with the control group, Sella-Nasion-Supramentale(°) decreased without any statistical significance in all treated groups. IMPA (angle of lower incisors and mandibular plane) decreased in groups with facemasks and increased in other groups. Conclusions: Bone-anchored maxillary protraction can promote greater maxillary forward movement and correct the Class III intermaxillary relationship better, in addition to showing less clockwise rotation of mandible and labial proclination of upper incisors. However, strengthening anchorage could not inhibit mandibular growth better and the lingual inclination of lower incisors caused by the treatment is related to the use of a facemask.

A Study on Factors Affecting Intention to Use Online Collaboration Tools for the Non-Face-to-Face Educational Environment (비대면 교육 환경에서 온라인 협업 툴 사용의도에 영향을 미치는 요인에 관한 연구)

  • Seo, Jay;An, Sunju;Choi, Jeongil
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.3
    • /
    • pp.571-591
    • /
    • 2022
  • Purpose: The purpose of this study is to examine the factors affecting the intention to use online collaboration tools for non-face-to-face educational environment in the perspective of the learners. Methods: For empirical analysis, the survey of this study was administered with data that were limited to experienced learners using online collaboration tools such as Google Docs, Allo, Padlet, and Slido in online education environments such as Zoom, Webex, MS Teams, etc. and valid 400 data were analyzed by SPSS(ver 22.0) and R(ver 4.1.0) program package. Results: The results of empirical analysis showed that performance expectancy were found to have an effect on reliability of system quality, empathy of service quality, playfulness and informativity of content quality among the characteristics of online collaboration tools. On the other hand, it was found that the security of system quality, responsiveness of service quality, and extroversion of user personality characteristics did not affect. It was analyzed that playfulness had the greatest positive effect, followed by informativity, empathy, and reliability. Among the characteristics of online collaboration tools, it was found that the reliability and security of system quality and informativity of content quality had an effect on the effort expectancy. It was analyzed that informativity has the greatest influence, followed by security and reliability. Conclusion: This study is meaningful in that it examines the perspectives of users and learners, who can be said to be the end customers of online collaboration tools. Based on the results of this study, it is expected that not only platform operators that provide online collaborative tools, but also providers that use online collaboration tools will have a significant impact on the development of edutech and infrastructure in the educational environment.

Comparison of Herbs in Prescription Composition of Consumptive Disease and Internal Injury in Donguibogam Through Network Analysis (네트워크 분석을 통한 동의보감(東醫寶鑑) 내상(內傷)문과 허로(虛勞)문의 처방 구성 본초 비교)

  • Chien-hsin Kuo;Heung Ko;Seon-mi Shin
    • The Journal of Internal Korean Medicine
    • /
    • v.44 no.1
    • /
    • pp.35-52
    • /
    • 2023
  • Objective: Internal injuries and consumptive disease have different causes, yet they can affect each other. The relationship and combination of prescription drugs in the clinical practice of internal injuries and consumptive disease were analyzed for various diseases of "Donguibogam" through network analysis. Methods: The prescriptions used in consumptive disease and internal injury were established by conducting a full survey on the papers extracted from Donguibogam. The R version 4.0.3 (2020-10-10) and the igraph and arules package were used to perform network analysis and association rule relationship mining analysis in the first and second prescription compositions. Results: The herb frequently used for internal injury was Glycyrrhizae Radix, while the herb combination frequently used was Citri Pericarpium-Glycyrrhizae Radix. For centrality, the main factor was generally Glycyrrhizae Radix. In the case of consumptive disease, the herb most frequently used was Angelicae Gigantis Radix, and the combination most frequently used was Rehmanniae Radix Preparata-Angelicae Gigantis Radix. In terms of centrality, it was Angelicae Gigantis Radix. As a result of the network analysis of herbal prescription frequency, each group was divided into three. Conclusion: The interrelationship between internal injury and consumptive disease prescription drugs may reveal the differences and similarities between internal injury and consumptive disease and may serve as a basis for the development of new drugs or materials that can enhance mutual effectiveness in the treatment of internal injury and consumptive diseases.

Intensity estimation with log-linear Poisson model on linear networks

  • Idris Demirsoy;Fred W. Hufferb
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.1
    • /
    • pp.95-107
    • /
    • 2023
  • Purpose: The statistical analysis of point processes on linear networks is a recent area of research that studies processes of events happening randomly in space (or space-time) but with locations limited to reside on a linear network. For example, traffic accidents happen at random places that are limited to lying on a network of streets. This paper applies techniques developed for point processes on linear networks and the tools available in the R-package spatstat to estimate the intensity of traffic accidents in Leon County, Florida. Methods: The intensity of accidents on the linear network of streets is estimated using log-linear Poisson models which incorporate cubic basis spline (B-spline) terms which are functions of the x and y coordinates. The splines used equally-spaced knots. Ten different models are fit to the data using a variety of covariates. The models are compared with each other using an analysis of deviance for nested models. Results: We found all covariates contributed significantly to the model. AIC and BIC were used to select 9 as the number of knots. Additionally, covariates have different effects such as increasing the speed limit would decrease traffic accident intensity by 0.9794 but increasing the number of lanes would result in an increase in the intensity of traffic accidents by 1.086. Conclusion: Our analysis shows that if other conditions are held fixed, the number of accidents actually decreases on roads with higher speed limits. The software we currently use allows our models to contain only spatial covariates and does not permit the use of temporal or space-time covariates. We would like to extend our models to include such covariates which would allow us to include weather conditions or the presence of special events (football games or concerts) as covariates.

Assessment of population structure and genetic diversity of German Angora rabbit through pedigree analysis

  • Abdul Rahim;K. S. Rajaravindra;Om Hari Chaturvedi;S. R. Sharma
    • Animal Bioscience
    • /
    • v.36 no.5
    • /
    • pp.692-703
    • /
    • 2023
  • Objective: The main goals of this investigation were to i) assess the population structure and genetic diversity and ii) determine the efficiency of the ongoing breeding program in a closed flock of Angora rabbits through pedigree analysis. Methods: The pedigree records of 6,145 animals, born between 1996 to 2020 at NTRS, ICAR-CSWRI, Garsa were analyzed using ENDOG version 4.8 software package. The genealogical information, genetic conservation index and parameters based on gene origin probabilities were estimated. Results: Analysis revealed that, 99.09% of the kits had both parents recorded in the whole dataset. The completeness levels for the whole pedigree were 99.12%, 97.12%, 90.66%, 82.49%, and 74.11% for the 1st, 2nd, 3rd, 4th, and 5th generations, respectively, reflecting well-maintained pedigree records. The maximum inbreeding, average inbreeding and relatedness were 36.96%, 8.07%, and 15.82%, respectively. The mean maximum, mean equivalent and mean completed generations were 10.28, 7.91, and 5.51 with 0.85%, 1.19%, and 1.85% increase in inbreeding, respectively. The effective population size estimated from maximum, equivalent and complete generations were 58.50, 27.05, and 42.08, respectively. Only 1.51% of total mating was highly inbred. The effective population size computed via the individual increase in inbreeding was 42.83. The effective numbers of founders (fe), ancestors (fa), founder genomes (fg) and non-founder genomes (fng) were 18, 16, 6.22, and 9.50, respectively. The fe/fa ratio was 1.12, indicating occasional bottlenecks had occurred in the population. The six most influential ancestors explained 50% of genes contributed to the gene pool. The average generation interval was 1.51 years and was longer for the sire-offspring pathway. The population lost 8% genetic diversity over time, however, considerable genetic variability still existed in the closed Angora population. Conclusion: This study provides important and practical insights to manage and maintain the genetic variability within the individual flock and the entire population.