• 제목/요약/키워드: Small Domains

검색결과 195건 처리시간 0.026초

Improving methods for normalizing biomedical text entities with concepts from an ontology with (almost) no training data at BLAH5 the CONTES

  • Ferre, Arnaud;Ba, Mouhamadou;Bossy, Robert
    • Genomics & Informatics
    • /
    • 제17권2호
    • /
    • pp.20.1-20.5
    • /
    • 2019
  • Entity normalization, or entity linking in the general domain, is an information extraction task that aims to annotate/bind multiple words/expressions in raw text with semantic references, such as concepts of an ontology. An ontology consists minimally of a formally organized vocabulary or hierarchy of terms, which captures knowledge of a domain. Presently, machine-learning methods, often coupled with distributional representations, achieve good performance. However, these require large training datasets, which are not always available, especially for tasks in specialized domains. CONTES (CONcept-TErm System) is a supervised method that addresses entity normalization with ontology concepts using small training datasets. CONTES has some limitations, such as it does not scale well with very large ontologies, it tends to overgeneralize predictions, and it lacks valid representations for the out-of-vocabulary words. Here, we propose to assess different methods to reduce the dimensionality in the representation of the ontology. We also propose to calibrate parameters in order to make the predictions more accurate, and to address the problem of out-of-vocabulary words, with a specific method.

협동로봇 시장 진출 성공요인 분석 (Analysis of Factors for the Success in Entry into Cooperation Robot Market)

  • 김신표
    • 산업융합연구
    • /
    • 제15권1호
    • /
    • pp.43-52
    • /
    • 2017
  • Robot refers to machines that recognize the external environment and assess the given situations in order to operate autonomously by imitating the manner in which humans behave. Although Korea still lacks global competitiveness, Korea, as the $4^{th}$ ranked robot manufacturing country in the world, is currently expanding the domains of robots from application in manufacturing to application in service provision. Accordingly, this study aims to analyze the factors for the success in entry into the cooperation robot market among various robotic markets in accordance with the literary research method in consideration for the importance of robot industry that could determine the future national competitiveness. The result of the analysis of the factors for the success in entry into the cooperation robot market, shows that factors including analysis of the trends in manufacturing robot market, strategy for benchmarking of the leading cooperation robot companies, activation of small and medium enterprise-centered cooperation robotic industry, excavation of demands for cooperation robots with focus on automobile, semiconductor and IT industries, utilization of the opportunities provided by government's robotic industry policies and standardization of cooperation robot components, etc. determine whether one will succeed in the market or not. Furthermore, it is believed that fortification of competitiveness of the manufacturing sector through the powerful policy support for the robotic industry at government level and policies on cultivation of new growth engine through specialization of the robotic areas closely related to daily life must be implemented concurrently because it is forecasted that competitiveness in robotics technology will become the criterion for national competitiveness in the future.

  • PDF

Identification and molecular characterization of doublesex and mab-3-related transcription factor(dmrt) in brackish water flea, Diaphanosoma celebensis, exposed to bisphenol analogs

  • Cho, Hayoung;Jeon, Min Jeong;Lee, Young-Mi
    • 환경생물
    • /
    • 제39권2호
    • /
    • pp.160-168
    • /
    • 2021
  • Doublesex and mab-3 related transcription factor(dmrt) play crucial roles in sex determination and sex differentiation in vertebrates and invertebrates. Although dmrt genes have been identified in vertebrates, little is known about aquatic invertebrates. In this study, two dmrt genes, namely, Dc_dmrt93B and Dc_dmrt99B, were identified from brackish water flea, Diaphanosoma celebensis. Transcriptional changes were observed in the dmrt genes when the flea was exposed to bisphenol(BP), an endocrine disruptor. Sequence and phylogenetic analyses showed that both dmrt genes contained two conserved domains, namely, DM and DMA, closely clustered with those of Daphnia spp. Additionally, a significant increase in the Dc_dmrt99B mRNA expression level was observed upon exposure to intermediate concentrations of BP (bisphenol A>bisphenol S=bisphenol F, p<0.05), while the expression of Dc_dmrt93B mRNA was slightly modulated. These findings imply that the two dmrt genes may be involved in sex differentiation of D. celebensis. Furthermore, it was found that the ability of BP to modulate dmrt genes could affect development and reproduction. This study provides a basis for understanding the function of the dmrt genes and the molecular mode of action of BP in small crustaceans.

수중 폭발 시뮬레이션을 위한 경계 요소법 기반의 수치 해석 기법 연구 (A Study on BEM-Based Numerical Simulation Technique for Underwater Explosions)

  • 정준모;이재빈
    • 대한조선학회논문집
    • /
    • 제57권5호
    • /
    • pp.271-277
    • /
    • 2020
  • Recoverability and vulnerability of navy ships under underwater explosion are critical verification factors in the acquisition phase of navy ships. This paper aims to establish numerical analysis techniques for the underwater explosion of navy ships. Doubly Asymptotic Approach (DAA) Equation of Motion (EOM) of primary shock wave and secondary bubble pulse proposed by Geers-Hunter was introduced. Assuming a non-compressive fluid, reference solution of the DAA EOM of Geers-Hunter using Runge-Kutta method was derived for the secondary bubble pulse phase with an assumed charge conditions. Convergence analyses to determine fluid element size were performed, suggesting that the minimum fluid element size for underwater explosion analysis was 0.1 m. The spherical and cylindrical fluid domains were found to be appropriate for the underwater explosion analyses from the fluid domain shape study. Because the element size of 0.1 m was too small to be applied to the actual navy ships, a very slender beam with the square solid section was selected for the study of fluid domain existence effect. The two underwater explosion models with/without fluid domain provided very similar results in terms of the displacement and stress processes.

조직.기관의 분화와 유전자 발현의 조절, 최근의 진보 (Recent Advancement in the Differentiation of Tissues and Organs and Regulation of Gene Expression)

  • Harn, Chang-Yawl
    • 식물조직배양학회지
    • /
    • 제24권1호
    • /
    • pp.1-35
    • /
    • 1997
  • Fertilized egg, by successive cell divisions, differentiates into different tissues and organs with various structures and functions. Different cells and tissues contain different proteins, products of selective gene expression. Not all the genes in any genomes are equally active, temporal and spatial gene expression being the general rule. Present paper attempts to review the tanscriptional mechanisms or the initiations of transcription from several angles. In some of the organisms the genes in the process of transcription or the genes in the inactive state can be seen under the light microscope. Some bands of Drosophila polytene chromosomes may exhibit a swollen or puff appearance under certain conditions. A puff, unfolded or decondensed form of chromomere, represents sets of intense transcriptional activity or RNA synthesis. The heterochromatic X chromosome whose genes remain inactive in the female mammals can be visualized as a dark staining structure called Barr body, Configuration of chromatin differs between transcribed and nontranscribed chromatin. Modification to the chromatin facilitates RNA synthesis. The movement of large polymerase molecule along the DNA would probably be facilitated if some modifications of the chromatin configuration is effected. Methylation of cytosines in CG sequences is associated with inactive genes. Methylation can play a role in determination of mammalian cells during embryogenesis. Demethylation is necessary for the gene to be expressed during development A histone modification that is also known to be correlated with transcriptional capacity of chromatin is acetylation of the lysine residues of the core histones. Chromatin containing a high level of histone acetylation is very sensitive to DNase 1. For the transcription to occur TBP must first bind to the TATA box. Another TF, TF IIB, then binds to the promoter-TBP complex, facilitating the access of RNA polymerase to the transcription initiation site. As recently as eight years ago researchers assumed that histones were irrelevant to the regulation of gene expression. Histones combine with the DNA to form nucleosome of the chromatin. Histones are vital participant in gene regulation. Histone and basal factors compete for access to TATA box. When DNA is exposed to basal factors before histones are introduced, the basal factors assemble on TATA boxes preventing the access of histones, allowing transcription to occur, for transcription to begin, activator protein at the upstream activation sequence or enhancer must interact with the tail of histone H4 at TATA box and cause the histone role particle to dissociate from the TATA box leading to partial breakup of the histone core particle and allowing the basal factors to bind to the TATA box. New concept of genomic flux in contrast to the old concept of static genome has been developed based on the powerful new molecular techniques. Genomic changes such as repetitive DNAs and transposable elements, it is assumed but not yet proved, may affect some of the developmental patterns that characterize particular cells, tissues, organs, and organisms. In the last decade or so remarkable achievement have been made in the researches of the structures and functions of TFs and the specific target sequences located in promoters or enhancers where these TFs bind. TFs have independent domains that bind DNA and that activate transcription. DNA binding domain of TFs serves to bring the protein into the right location. There are many types of DNA binding domains. Common types of motifs can be found that are responsible for binding to DNA. The motifs are usually quite short and comprise only a small part of the protein structure. Steroid receptors have domains for hormone binding, DNA binding, and activating transcription. The zinc finger motif comprises a DNA binding domain. Leucine zipper consist of a stretch of amino acids with a leucine residue in every seventh position Two proteins form a dimer because they interact by means of leucine zippers on similar α-helical domain. This positions their DNA binding basic domains for interaction with the two halves of a DNA sequence with dyad symmetry of TGACTCA, ACTGAGT.

  • PDF

Psychometric Properties of the Persian Version of Satisfaction with Care EORTC-in-patsat32 Questionnaire among Iranian Cancer Patients

  • Pishkuhi, Mahin Ahmadi;Salmaniyan, Soraya;Nedjat, Saharnaz;Zendedel, Kazem;Lari, Mohsen Asadi
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권23호
    • /
    • pp.10121-10128
    • /
    • 2015
  • Background: Cancers impose an increasing burden on health of the populations and individuals, but little is known about cancer patient satisfaction with care. The aim of this study was to assess the psychometric properties of the Persian version of European Organisation for Research and Treatment of Cancer (EORTC) In-Patsat32, as a recently developed questionnaire to assess cancer patient satisfaction with care and information provided during hospital admission. Materials and Methods: Complying with EORTC protocols, the Persian version of Inpatsat32 was translated and piloted in a small group of patients, then applied to 380 cancer patients admitted to different oncology wards in Tehran. Validity (convergent, discriminant, and divergent) and reliability of the tool was assessed through using multitrait analysis, factor analysis, intraclass correlations, Chronbach's alpha and test-retest (on a sample of 70 patients). Results: Good acceptance and high sensitivity of the questionnaire with low floor and ceiling effects were recognized, indicating power of the instrument to detect differences between groups with heterogeneous levels of satisfaction. Multitrait scaling analyses supported the convergent validity of the majority of scales (correlation coefficient >0.4) and favorable discriminant validity (item own scale correlation >0.8). There was no correlation between In-patsat32 scales and the EORTC-C30, which measures different concepts, confirming divergent validity of the tool. Internal consistency for all domains was high (${\alpha}$ >0.70) except for the hospital access score and the test-retest reliability was excellent (r=0.86-0.96). There was a weak responsiveness to change except for nurses technical skills. Principle component analysis confirmed five domains with much improved internal consistency (${\alpha}$ >0.9). Conclusions: The Persian version of the EORTC-in-patsat32 module is a reliable and valid instrument to measure cancer patient satisfaction with care received during their hospitalization period and can be utilized in clinical cancer research.

라벨이 없는 데이터를 사용한 종단간 음성인식기의 준교사 방식 도메인 적응 (Semi-supervised domain adaptation using unlabeled data for end-to-end speech recognition)

  • 정현재;구자현;김회린
    • 말소리와 음성과학
    • /
    • 제12권2호
    • /
    • pp.29-37
    • /
    • 2020
  • 최근 신경망 기반 심층학습 알고리즘의 적용으로 고전적인 Gaussian mixture model based hidden Markov model (GMM-HMM) 음성인식기에 비해 성능이 비약적으로 향상되었다. 또한 심층학습 기법의 장점을 더욱 잘 활용하는 방법으로 언어모델링 및 디코딩 과정을 통합처리 하는 종단간 음성인식 시스템에 대한 연구가 매우 활발히 진행되고 있다. 일반적으로 종단간 음성인식 시스템은 어텐션을 사용한 여러 층의 인코더-디코더 구조로 이루어져 있다. 때문에 종단간 음성인식 시스템이 충분히 좋은 성능을 내기 위해서는 많은 양의 음성과 문자열이 함께 있는 데이터가 필요하다. 음성-문자열 짝 데이터를 구하기 위해서는 사람의 노동력과 시간이 많이 필요하여 종단간 음성인식기를 구축하는 데 있어서 높은 장벽이 되고 있다. 그렇기에 비교적 적은 양의 음성-문자열 짝 데이터를 이용하여 종단간 음성인식기의 성능을 향상하는 선행연구들이 있으나, 음성 단일 데이터나 문자열 단일 데이터 한쪽만을 활용하여 진행된 연구가 대부분이다. 본 연구에서는 음성 또는 문자열 단일 데이터를 함께 이용하여 종단간 음성인식기가 다른 도메인의 말뭉치에서도 좋은 성능을 낼 수 있도록 하는 준교사 학습 방식을 제안했으며, 성격이 다른 도메인에 적응하여 제안된 방식이 효과적으로 동작하는지 확인하였다. 그 결과로 제안된 방식이 타깃 도메인에서 좋은 성능을 보임과 동시에 소스 도메인에서도 크게 열화되지 않는 성능을 보임을 알 수 있었다.

Effect of Core Morphology on the Decomposition of CCI₄ over the Surface of Core/Shell Structured Fe₂O₃/MgO Composite Metal Oxides

  • 김해진;강진;박동곤;권호진;Kenneth J. Klabunde
    • Bulletin of the Korean Chemical Society
    • /
    • 제18권8호
    • /
    • pp.831-840
    • /
    • 1997
  • Core/shell structured composite metal oxides of Fe2O3/MgO were prepared by thermal decomposition of Fe(acac)3 adsorbed on the surface of MgO cores. The morphology of the composites conformed to that of the MgO used as the cores. Broad powder X-ray diffraction peaks shifted toward larger d, large BET surface area (∼350 m2/g), and the size of crystalline domains in nano range (4 nm), all corroborate to the nanocrystallinity of the Fe2O3/MgO composite which was prepared by using nanocrystalline MgO as the core. By use of microcrystalline MgO as the core, microcrystalline Fe2O3/MgO composite was prepared, and it had small BET surface area of less than 35 m2/g. AFM measurements on nanocrystalline Fe2O3/MgO showed a collection of spherical aggregates (∼80 nm dia) with a very rough surface. On the contrary, microcrystalline Fe2O3/MgO was a collection of plate-like flat crystallites with a smooth surface. The nitrogen adsorption-desorption behavior indicated that microcrystalline Fe2O3/MgO was nonporous, whereas nanocrystalline Fe2O3/MgO was mesoporous. Bimodal distribution of the pore size became unimodal as the layer of Fe2O3 was applied to nanocrystalline MgO. The macropores in a wide distribution which the nanocrystalline MgO had were absent in the nanocrystalline Fe2O3/MgO. The decomposition of CCl4 was largily enhanced by the overlayer of Fe2O3 on nanocrystalline MgO making the reaction between nanocrystalline Fe2O3/MgO and CCl4 be nearly stoichiometric. The reaction products were environmentally benign MgCl2 and CO2. Such an enhancement was not attainable with the microcrystalline samples. Even for the nanocrystalline MgO, the enhancement was not attained, if not with the Fe2O3 layer. Without the layer of Fe2O3, it was observed that the nanocrystalline domain of the MgO transformed into microcrystalline one as the decomposition of CCl4 proceeded on its surface. It appeared that the layer of Fe2O3 on the particles of nanocrystalline Fe2O3/MgO blocked the transformation of the nanocrystalline domain into microcrystalline one. Therefore, in order to attain stoichiometric reaction between CCl4 and Fe2O3/MgO core/shell structured composite metal oxide, the morphology of the core MgO has to be nanocrystalline, and also the nanocrystalline domains has to be sustained until the core was exhausted into MgCl2.

Some characters of bacterial cellulases in goats' rumen elucidated by metagenomic DNA analysis and the role of fibronectin 3 module for endoglucanase function

  • Nguyen, Khanh Hoang Viet;Dao, Trong Khoa;Nguyen, Hong Duong;Nguyen, Khanh Hai;Nguyen, Thi Quy;Nguyen, Thuy Tien;Nguyen, Thi Mai Phuong;Truong, Nam Hai;Do, Thi Huyen
    • Animal Bioscience
    • /
    • 제34권5호
    • /
    • pp.867-879
    • /
    • 2021
  • Objective: Fibronectin 3 (FN3) and immunoglobulin like modules (Ig) are usually collocated beside modular cellulase catalytic domains. However, very few researches have investigated the role of these modules. In a previous study, we have sequenced and analyzed bacterial metagenomic DNA in Vietnamese goats' rumen and found that cellulase-producing bacteria and cellulase families were dominant. In this study, the properties of modular cellulases and the role of a FN3 in unique endoglucanase belonging to glycosyl hydorlase (GH) family 5 were determined. Methods: Based on Pfam analysis, the cellulases sequences containing FN3, Ig modules were extracted from 297 complete open reading frames (ORFs). The alkaline, thermostability, tertiary structure of deduced enzymes were predicted by AcalPred, TBI software, Phyre2 and Swiss models. Then, whole and truncated forms of a selected gene were expressed in Escherichia coli and purified by His-tag affinity column for assessment of FN3 ability to enhance enzyme activity, solubility and conformation. Results: From 297 complete ORFs coding for cellulases, 148 sequences containing FN3, Ig were identified. Mostly FN3 appeared in 90.9% beta-glucosidases belonging to glycosyl hydrolase family 3 (GH3) and situated downstream of catalytic domains. The Ig was found upstream of 100% endoglucanase GH9. Rarely FN3 was seen to be situated downstream of X domain and upstream of catalytic domain endoglucanase GH5. Whole enzyme (called XFN3GH5 based on modular structure) and truncate forms FN3, XFN3, FN3GH5, GH5 were cloned in pET22b (+) and pET22SUMO to be expressed in single and fusion forms with a small ubiquitin-related modifier partner (S). The FN3, SFN3 increased GH5 solubility in FN3GH5, SFN3GH5. The SFN3 partly served for GH5 conformation in SFN3GH5, increased modules interaction and enzyme-soluble substrate affinity to enhance SXFN3GH5, SFN3GH5 activities in mixtures. Both SFN3 and SXFN3 did not anchor enzyme on filter paper but exfoliate and separate cellulose chains on filter paper for enzyme hydrolysis. Conclusion: Based on these findings, the presence of FN3 module in certain cellulases was confirmed and it assisted for enzyme conformation and activity in both soluble and insoluble substrate.

해상 부이 보호 및 선박 사고 예방을 위한 트레일 카메라-AIS 연계형 능동감시 및 접근경보 시스템 개발 (A Development of Active Monitoring and Approach Alarm System for Marine Buoy Protection and Ship Accident Prevention based on Trail Cameras and AIS)

  • 황훈규;김배성;김헌우;강용수;김대한
    • 한국정보통신학회논문지
    • /
    • 제22권7호
    • /
    • pp.1021-1029
    • /
    • 2018
  • 해상 부이는 항로 및 위험물 표지, 기상 및 해양 환경 모니터링, 군사 전략 요소 등 다양한 목적으로 운용되는 설비이다. 이러한 해상 부이가 선박 충돌 등으로 인해 손상되면 해양이라는 특수성으로 인해 복구 및 교체 작업에 많은 시간과 비용이 소요되며, 표류 시 2차 사고의 위험성이 존재한다. 본 논문에서는 이러한 해상 부이를 보호하기 위하여 트레일 카메라 및 AIS를 활용한 능동감시 및 접근경보 시스템의 개발에 관한 내용을 다룬다. 이러한 시스템의 개발을 위하여 기존 국내외 연구 및 유사 시스템 개발 사례를 분석한 후, 개선 요구사항을 도출하고, 도출된 내용을 바탕으로 시스템을 설계한다. 설계 시 주안점을 둔 내용으로는 AIS와 트레일 카메라 연계형 능동 감시, 선박 접근에 대한 단계별 경보, 육상과 부이의 거리에 따른 선택적 통신매체 적용, 영상 처리를 통한 선박 식별 및 경보 제공, 열화상 카메라의 적용 등 크게 다섯 가지가 있다. 또한, 설계된 내용을 바탕으로 시스템을 개발하고, 실험실 혹은 필드 수준의 테스트를 통해 개발한 시스템의 유용성을 검증한다.