Introduction
The gold standard methods for the diagnosis of many infectious diseases caused by bacteria are based on culturing clinical samples in order to carry out a biochemical identification of their etiological agent. These diagnostic tools are widely utilized owing to their affordability, ease of application, and performance. However, these traditional techniques are time-consuming (48 to 72 h) for the complete identification of many species and exhibit uncertainty associated with their lack of sensitivity or specificity (microorganism-dependent). These drawbacks have hindered the etiological identification of many infections in clinical practice, and have led to a tendency towards empirical diagnosis, control, and treatment, especially in the most prevalent infectious diseases.
In the past two decades, the advent of molecular techniques based on DNA amplification (such as polymerase chain reaction (PCR)), have allowed the gradual replacement of the classic techniques for microbiological identification. Owing to their high sensitivity and specificity, many PCR assays have been widely accepted as the new gold standard for the detection of viruses [2] and fastidious pathogens [11, 13, 16, 22, 30], for antimicrobial resistance profiling [5, 8, 13], and for etiological identification of body fluid infections [2, 26].
The PCR is an in vitro replication that allows for the massive and rapid amplification of specific sequences of small DNA fragments [18]. This is a three-step iterative process characterized by the initial hot denaturation of a double-stranded DNA template, followed by an annealing of the specific primers on the target, and finishing with an extension step of the annealed primers by the DNA polymerase reaction. For the massive replication of DNA templates, this process is usually repeated between 25 and 40 times. In order to obtain an efficient in vitro replication in PCR assays, relevant parameters should be coordinated; that is, factors such as cycle conditions (time and temperature of each phase of the cycle and number of cycles) and concentration of the reactants in the mix (primers, magnesium ions, dNTPs, and DNA templates). Nevertheless, the most critical factor in the performance of an in vitro replication (uniplex and multiplex) is the adequate selection of the primer sequence [1, 6, 27, 32]. The primer design process in a PCR is complex and time-consuming owing to the multiple criteria that must be taken into account: (i) the primer length, ranging from 18 to 26 bases; (ii) the guanine-cytosine percentage, between 40% and 60%; (iii) the melting temperature (Tm), with values ranging from 45℃ to 65℃, with 5℃ as the maximum difference between primers; (iv) the absence of base repetitions of more than four bases; and (v) the presence of G, C, GC, or CG in the primer 3’ end. Inconsistencies in these criteria can lead to the formation of unspecific products in PCRs [3], as a consequence of inter- and intra- molecular structures that diminish the potential performance, specificity, and/or sensitivity of DNA amplification [6].
In bacterial infections of multiple etiologies, the identification of the causal agent by PCR techniques is expensive and laborious owing to the need for numerous individual analyses. In these cases, assays of PCR able to amplify multiple DNA targets in a single reaction, such as Multiplex-PCR, have been shown to be the most efficient option. In this type of PCR, multiple pairs of primers are included in the same test tube. However, owing to the several parameters to be considered in the optimization process for the Multiplex-PCR conditions, the implementation of this technology for clinical diagnoses represents a great challenge, especially because of the primer design problem. In this technique, each of the primers used in a mix should be designed to satisfy all the aforementioned parameters. Additionally, in order to efficiently obtain the amplification of multiple target sequences, all the primer sets used should be adjusted as if they were a single pair of primers. Furthermore, for the electrophoresis analysis, all the product sizes should be planned to allow the appropriate resolution in the gel, with a minimum difference among the PCR products of 50 bp.
This practice is usually a highly time-consuming and expensive approach as a result of human error. To overcome these difficulties, in the last decades, several heuristic optimization methods have been implemented for the solution of the Multiplex-PCR primer design problem [21, 29]. In this context, genetic algorithms [29, 31] and particle swarm optimization [33] techniques have been recently used to solve the problem. However, the major drawback of these approaches is the requirement for an a priori definition, by the user, of the parameters that control the algorithm performance and the fact that these parameters are problem-dependent.
In order to overcome these challenges, Begambre and Laier [7] proposed a new self-configured evolutionary hybrid algorithm. This algorithm does not require the definition of its initial search parameters, allowing an operator (who is not a linear or non-linear programming expert) to obtain the optimal or quasi-optimal conditions for amplification of multiple DNA fragments. In this work, a new strategy for Multiplex-PCR primer design is presented based on Particle Swarm Optimization (PSO) – Simplex (s), PSOS [7]. The PSOS algorithm was adapted and used as a tool for in silico rational design of the primers for the Multiplex-PCR assays. This adaptation is called Mult-PSOS. The efficiency and robustness of this method was evaluated in vitro using the Mult-PSOS predicted primers for uropathogen detection. The goal of this multiplex-PCR was to allow, in a single assay, the efficient molecular identification of the most common bacteria associated with urinary tract infection (UTI) (Escherichia coli, Proteus mirabilis, Proteus vulgaris, Pseudomonas aeruginosa, Klebisella pneumoniae, Staphylococcus saprophyticus, and Enterococcus sp.) [10, 12, 14, 25]. It is worth noting that UTIs are the most common and serious infectious diseases among outpatients and inpatients, and represent a significant healthcare burden [10].
Materials and Methods
The Particle Swarm Optimization and the Simplex Algorithm
As described in Refs. [28], [29], and [31], the manual primer design is unstable and the software available to design primers is not capable of selecting the best candidates. Therefore, in this work, the problem is defined as an optimization problem whose principal objective is to build candidate primers and to choose the best candidate according to a fitness function and its restrictions. For the sake of clarity, in this section, a brief description of these algorithms is presented.
The Simplex Algorithm
Here, we present a short review of the classical version of the algorithm. The method uses a polytope (a polytope (in this case referred to as the Simplex) is a geometric object with plane sides, such as a polyhedron, in three dimensions) with N+1 vertices, where N is the number of variables in the optimization problem [19]. The Simplex explores the search space (all the possible values that the variables can take), permitting each of its vertices only three movements (or actions): expansion, contraction (shrinkage), and reflection [19]. These actions allow the Simplex to explore the topography of the fitness function and finally to determine the optimum solution (minimum point). The Simplex coefficients used in this work were σ = 0.5; ϕ = 0.5; χ = 2; ρ = 1 (standard case) as recommended in Ref. [7] with a maximum number of iterations of 200 (stopping criterion).
The PSO Algorithm
PSO is based on a simplified social interaction model. It is known that sharing information in social environments may offer evolutionary advantages to individuals. PSO is a member of the group of computational intelligence methods (or heuristic methods) to solve optimization problems. In this context, the particle (or individual of the swarm) position is updated using social information shared by the swarm members (see Eqs. (a) and (b)) and each particle tries to change its position to a point where the swarm (or the particle itself) has a higher fitness function value [15].
The search is controlled by the following expressions [15]:
where is the position of particle i at iteration k (a position vector whose size is equal to the number of variables in the problem), is the updated velocity of particle k at iteration i+1, stands for the best position of particle k at iteration i, is the best global position at iteration i (the best particle in the swarm), Ran1 and Ran2 are independent random numbers, and ω is the inertia weight (a factor that controls the impact of the previous velocity on the current particle velocity). From Eq. (b), it is evident that the factors C1 (cognitive parameter), C2 (social parameter), and ω (inertia weight) must be carefully chosen by the user to obtain a good result. The values of the PSO factors employed in this study were
0 < w < 1.5; C1 + C2 ≤ 4, with a population of 10 particles.
Particle Swarm Optimization – Simplex PSOS
To solve the Multiplex-PCR primer design problem, several procedures have been employed. Taking into account that the problem has a huge number of possible solutions (the search space is enormous) and discrete variables, many authors have used heuristic algorithms (i.e., genetic algorithms, particle swarm optimization). The principal advantage of these techniques is that they do not require the computation of derivatives or Hessians (difficult or impossible to obtain in this class of problems). However, heuristic methods need the designation of many users’ defined initial factors to control the search (e.g., the cognitive parameter, social parameter, and inertia weight in Eq. (b)). To overcome this drawback, we adopted in this work the concept of a self-configured algorithm [7]. This form of algorithm combines two heuristic methods. The first algorithm (in this case, the nonlinear Simplex, or Simplex) determines the factors that control the performance of the second algorithm (the PSO) and the PSO performs the search in the fitness function values space. This topology guarantees that, no matter what the initial parameters of the first algorithm are, the second procedure always finds a very good solution without user intervention [7]. Fig. 1 shows the plain PSOS.
Fig. 1.Basic PSOS heuristic. (A) Heuristic factor search space and the polytope. S1, S2, and S3 are the Simplex vertex. (B) Fitness function values (given by Eq. (A) or Eq. (B)). PSO1, PSO2, and PSO3 are the swarms controlled by each Simplex vertex.
Based on the ideas outlined above, the Multiplex-PCR primer design proposed in this work can be outlined in four steps:
1) Set the definitions and variables of the Multiplex-PCR optimization problem; 2) Select the first primer pair: first optimization; 3) Multiplex-PCR optimization process: second optimization; and 4) Stopping criterion.
The selection of the first primer pair is made using an optimization process. This primer pair, as well as its restrictions, is used for the second optimization process. Both of the two optimization processes are carried out using an adapted PSOS algorithm [7], called Mult-PSOS. For the sake of precision, each step is explained in detail in the next section.
Mult-PSOS Optimization for Multiplex-PCR Primer Design
As described above, the algorithm has four main steps and its final goal is to obtain a Multiplex-PCR optimum primer design.
1) Set the definitions and variables of the Multiplex-PCR optimization problem. In this study, the definitions described previously in Ref. [28] were applied for the basic parameters in the primer design problem:
Definition of the target sequence or GD: In order to improve the specificity and sensitivity of the Multiplex-PCR for infectious disease diagnosis, the target sequence or GD should fulfill all of the following criteria: (i) a degree of variation in the sequence sufficiently significant to identify the target organism; (ii) high GC content (between 40% and 60%); (iii) multiple copies of the target genome (if possible); and (iv) low presence of tandem repetitions in the target sequences.
Template (GD): The target DNA sequence that is the amplification objective of a PCR. GD is composed of four letters (A, T, C, and G) that represent each one of the nucleotides, which are listed in order of their appearance in DNA. The infinite specificity of the targets proceeds from variations of the length and order of letters in a GD. In PCR experiments, the GD length is <5,000 bp. GD’ is the complementary strand for GD, where T is the complementary letter for each A, C is complementary for each G, and vice versa. Thus, for a GD (GD = ATTAAGGCCATCG…), the GD’=TAATTCCGGTAGC… [28].
Sense primer of GD or Bf = {bi|i is the index of GD between Fs and Fe}, where Fs and Fe are respectively the first and last bases of the sense primer Bf, in GD. The anti-sense primer of GD or Br = {bi|i is the index of GD between Rs and Re}, where Rs is the first letter and Re is the last letter of Br in GD. The individuals or possible solutions of the minimization problem are denoted as a primer pair, which is presented as a Pt vector [28]:
where α is the number of bases of Bf; β is the number of bases between Bf and Br; and γ is the number of bases in the primer Br. The independent variables are defined by (2), (3), and (4), and these equations represent the relative position of each primer [28]:
Primer length (tam): In a P1 primer of GD, the length |tam| is the summation of the number of bases in the primer and is obtained from Eq. (5) [28]:
Melting temperature (Tm): The Tm estimates the DNA-DNA duplex stability and can be calculated in accordance with the thermodynamic nearest-neighbor (NN) model of the fusion process with unified ΔH and ΔS [28], as given by Eq. (6):
where C is the primer concentration in empirically calculated molarity, R is the molar constant of the gases (1.987 cal/K mol), ΔH (kcal/mol) is the enthalpy change, and ΔS is the entropy change for the helix formation. Here, ΔH and ΔS are calculated using the NN model under standardized conditions (250 mMol/l NaCl, pH 7.0) [3]. The stabilizing effect of the salt (sodium) on the duplex is given by
where N is the number of nucleotide pairs in the primer (primer length -1) expressed in bp, and [Na+] is the sodium equivalent in mM calculated by Eq. (8):
Annealing temperature (Ta): The fusion or annealing temperature of the primer on GD. The optimal Ta is calculated according to Rychlik’s formula (Eq. 9) [28]:
where Tm is the melting temperature.
2) Select the first primer pair: first optimization. The second step in the Multiplex-PCR primer design process by Mult-PSOS is the choice of the first primer pair. This procedure follows the classical minimization process described in Refs. [9] and [28] with some modifications. One of the main differences between these works and the current study is that the optimization technique employed in this study does not require the definition of any heuristic parameter (mutation probability, crossover rate, and selection rules in genetic algorithms, or cognitive parameters and inertia weight in PSO) in order to obtain a solution.
The original fitness function [28] and its restrictions described in Ref. [26] are defined as follows:
where (Pt) is the primer pair; tam(Pt) is the primer length; tamd(Pt) is the length difference between the two primers of a pair; Tmd(Pt) is the Tm difference into a primer pair; GCP(Pt) is the GC%; GCc(Bf) + GCc(Br) is the G, C, CG, or GC presence in the 3’ end (Bf) or (Br) primer; Uni(Pt) is the primer specificity; Sc(Pt) indicates the presence of self-complementary sequences; PC(Pt) indicates the presence of complementary structures within the pair; and Rt(Pt) is the restriction site check function.
In order to improve the primer’s sensitivity, specificity, and efficiency in Multiplex-PCR for infectious disease diagnosis and through analyzing previous studies [1, 9, 22, 31], it was possible for us to define new terms and different weights for the parameters described in Eq. (I) [28].
The modified fitness function is presented in Eq. (A):
where Tm(Pt) is the melting temperature; Ta(Pt) is the annealing temperature of a primer in Pt; Tad(Pt) is the Ta(Pt) difference of the members in a primer pair; Term(Pt) is the 3’ end stability related to the ΔG value (a divergent definition of the Term(Pt) presented in Eq. (I)); PCh(Pt) is the number of internal loops with a ΔG ≥ -6kcal/mol; and Rep(Pt) is the false priming prediction due to base repetition in tandem. Finally, la(Pt) is the amplicon length. These factors take into account the kinetic of the primer-template duplex. Different weights from Ref. [26] were assumed for tam(Pt), tamd(Pt), GCp(Pt), and PC(Pt). The weights (3, 1, 3, 10, and 50) in Eq. (A) were applied considering the relevance of the primer restrictions. These values are set based on realistic conditions and on the studies mentioned above [1, 9, 28].
After inserting the target sequence (GD), 10 pairs of primers were defined as the initial population (see Eqs. (1)-(4)). Within this population, the Mult-PSOS chooses, at each iteration, the global best (the best primer pair in the population; see Eqs. (a) and (b)) until the process completes after 200 iterations (the stopping criterion). As a result of the minimization process, a first optimum primer pair is selected.
Restrictions for the first optimization (first primer design in a PCR)
At the end of this stage, the last primer pair and its optimum parameters (tam, Ta, and Tm) are determined (again, the optimization process was carried out employing the Mult-PSOS algorithm). The restrictions applied in the first optimization are defined below:
Tmd between Bf and Br is defined as Tmd(Pt) = 0, if the absolute value indicated in Eq. (12) is ≤5, otherwise Tmd(Pt) = 1 [28].
where ΔH and ΔS are the enthalpy and entropy changes of the primer, respectively. Large negative values indicate a stable structure or undesirable loops (such as loops proximal to the 3’ end).
Additionally, the following restrictions are taken into account in this work:
3) Multiplex-PCR optimization process: second optimization. In the second optimization step, the fitness given by Eq. (B) is minimized (for the second, third, and n-th primers) by applying the restrictions defined in the first optimization step and using, as always, 10 pairs of primers and 200 iterations. At this point, the first primer pair (obtained in the first optimization step) and its optimum parameters (the tam, Ta, and Tm values) are now used as restrictions for all the other primers (see Eqs. (5)-(18)).
In this manner, the process continues until the last primer pair is obtained. This strategy guarantees the rapid mathematical convergence of the method. Equation (B) represents the Multiplex-PCR primer design strategy proposed in this work. The Mult-PSOS was coded in MATLAB (Fig. 2) and run on an Intel Core 2 Duo – 1.33 GHz processor with 2.00 GB of RAM. The results obtained with the proposed algorithm were compared with the primers described by Ref. [28] and the oligonucleotides obtained using the Primer-BLAST tool (http://www.ncbi.nlm.nih.gov/tools/primer-blast/).
Fig. 2.Basic flow diagram of the Mult-PSOS algorithm.
Restrictions for the primer design in a Multiplex-PCR (second optimization step)
For a Multiplex-PCR where multiple primer pairs must be selected, the first pair of oligonucleotides is selected by minimizing Eq. (A). Based on the parameters of the selected primers, the tam, Ta, and Tm values are fixed as initial values for all the primers. In a second optimization, the fitness given by Eq. (B) is recalculated with the restrictions that defined the identification of the second primer pair and so on. These restrictions are defined as follows:
4) Stopping criterion. In a Mult-PSOS process, each individual of the particle swarm is defined through the vector given by Eq. (1) and its fitness is evaluated by Eqs. (A) in the first optimization step or (B) in the second step. The optimization process described by Eqs. (A) and (B), with its restrictions, continues until the defined stopping criterion is met. In this case, we use 200 iterations for the PSO and 200 iterations for the Simplex algorithm.
Bacteria and Gene Sequences
DNA sequences from genes of uropathogenic bacteria consigned in the GenBank (http://www.ncbi.nlm.nih.gov/genbank) database were used as templates for in silico primer design experiments by PSOS and for in vitro PCR assays (Table 1). The selection of the bacteria and the genes was made according to Ref. [20]: E. coli FimH gene (GenBank access AJ225176); K. pneumoniae FimK gene (GenBank access EU315065.1); P. aeruginosa ETA gene (GenBank access K01397.1); and P. mirabilis UreC gene (GenBank access AM942759). For the Multiplex-PCR design in this study, two uropathogenic bacteria were added: S. saprophyticus (16S gene, GenBank access NR_074999.1) and E. faecalis (16S gene, GenBank access NR_074637.1) [10, 17].
Table 1.ATCC bacterial lineages and genes used as a model for the Multiplex-PCR primer design by Mult-PSOS.
Bacterial Growth and Preservation
For the in vitro experiments, the gram-negative bacteria (E. coli, K. pneumoniae, P. aeruginosa, and P. mirabilis) were grown in Luria-Bertani broth and agar. Muller-Hinton broth and agar were used for the growth of the Gram-positive bacteria (S. saprophyticus and E. faecalis). Microorganisms in mid-log phase (OD620 = 0.4) were preserved in 10% glycerol at -80℃ pending further use.
DNA Extraction
For DNA extraction, all the microorganisms (except E. coli) in mid-log phase growth were pelleted and washed two times with sterile 1× PBS. The washed and pelleted cells were resuspended in 2 volumes of PrepMan Ultra (PMU) reagent (Applied Biosystems). Then, cells in suspension were disrupted by the freeze-thaw technique (3 min in a liquid nitrogen bath followed by 10 min at 95℃). The DNA extract was obtained by ultracentrifugation at 14,000 rpm for 2 min and preserved at -20℃ with RNase A (10 μg/ml) until the PCR experiments. DNA extraction for E. coli was obtained directly by the colony boiling method. The genomic DNA quality was confirmed on a 1% agarose gel stained with SYBR Green and quantified using a NanoDrop ND-1000 UV-Vis spectrophotometer (Thermo Scientific).
Wet Experiments of Multiplex-PCR
The optimal PCR primers designed were synthetized and used for the Multiplex-PCR assays. Table 1 summarizes the sequence of the selected primer pairs (obtained by minimizing the fitness functions given by Eqs. (A) and (B) subject to the restrictions in Eqs. (1) to (19)) for the amplification of the specific gene for each uropathogen. For the Multiplex-PCRs, we used an amplification mixture containing 4 μl of template (50-200 ng DNA), 0.4-0.6 mM of each primer, 1.5 U of GoTaq Flexi DNA Polymerase (Promega), and 1.5 mM MgCl2 for a final volume of 50 μl. Cycling reactions were performed under the following conditions: one cycle of 4 min at 94℃, followed by 35 cycles of 60 sec at 94℃, 50 sec at 53℃, and 70 sec at 72℃. Finally, one post-amplification step of 10 min at 72℃ was carried out. The amplicon quality and size were verified on 2% agarose gels and revealed by SYBR Gold stain (Molecular Probe, Invitrogen Life Technologies) with 1 Kb and 100 pb DNA Ladder (Thermo Scientific) as a molecular weight marker. The amplicons were ligated to the cloning vector pCR 2.1 TOPO (Invitrogen Life Technologies). The sequence of the insert was confirmed by DNA sequencing and a search by homology was then performed using the BLASTn algorithm (http://www.ncbi.nlm.nih.gov/BLAST) [4]. For the verification of the primer specificity, extracted DNA from Serratia marcescens, Salmonella spp., and Staphylococcus aureus were used as negative controls in the Multiplex-PCR.
Results
Primer Design for Multiplex-PCR
In this study, a set of Multiplex-PCR primers for uropathogenic bacteria detection were designed using a Mult-PSOS tool. In Tables 2 and 3, the sequences and some of the theoretical primer design parameters calculated by Gene Runner 3.01 are presented for the group of oligonucleotides. All the aforementioned design restrictions were evaluated for the 12 primers selected by PSOS. The conditions derived were evaluated using Eqs. (A), (B), and (1)-(18). The final fitness checks for the PCR and Multiplex-PCR were made using Eqs. (A) and (B), respectively. The fitness values thus obtained are presented in Table 4.
Table 2.*Thermodynamic Tm, calculated with 250 mMol of salt concentration (Na+). F: forward; R: reverse. Ec: E. coli. Kp: K. pneumoniae. Pa: P. aeruginosa. Pm: P. mirabilis. Ss: S. saprophyticus. Ef: E. faecalis.
Table 3.F: forward; R: reverse. Ec: E. coli. Kp: K. pneumoniae. Pa: P. aeruginosa. Pm: P. mirabilis. Ss: S. saprophyticus. Ef: E. faecalis.
Table 4.F: forward; R: reverse. Ec: E. coli. Kp: K. pneumoniae. Pa: P. aeruginosa. Pm: P. mirabilis. Ss: S. saprophyticus. Ef: E. faecalis. * http://www.ncbi.nlm.nih.gov/tools/primerblast/.
In Vitro Multiplex-PCR Assays
The primer sequences predicted by the hybrid algorithm were synthetized and used for in vitro replication assays. The Ta(Pt) and la(Pt) parameters calculated in silico were used for calibration of the cycle conditions (number of cycles, temperature, and time for each of the PCR phases). The Multiplex-PCR was designed for the amplification of seven genes from six pathogens highly prevalent in the causation of UTIs. The electrophoretic analysis on the agarose gel (2%) showed a good resolution between bands with amplicons ranging from 235 to 740 bp, and a minimum difference of 50 bp among them. The electrophoretic profile of the Multiplex-PCR showed an adequate performance of the amplification. Densitometry analysis of the amplicon bands revealed a DNA concentration ranging from 67 to 128ng/ μl for the E. coli FimH gene and E. faecalis 16S gene, respectively (Fig. 3).
Fig. 3.Agarose gel electrophoresis (2%) for uropathogenic Multiplex-PCR products amplified with the primer sequences designed by Mult-PSOS. 1) 1 kb molecular weight ladder (Thermo Scientific). Ec: E. coli (FimH), 235 bp. Kp: K. pneumoniae (FimK), 316 bp. Pa: P. aeruginosa (ETA), 505 bp. PmZ: P. mirabilis (ZapA), 571 bp. PmU: P. mirabilis (UreC), 649 bp. Ss: S. saprophyticus (16S), 741 bp. Ef: E. faecalis (16S), 440 bp.
Benchmarking
For the benchmarking analysis, the parameters and fitness values (from Eqs. (A) and (B)) obtained from the UTI Mult-PSOS primers were compared with two groups of oligonucleotides, one designed using the online tool primer-BLAST (http://www.ncbi.nlm.nih.gov/tools/primerblast/) and the other described previously by Padmavathy et al. [20]. Five genes from four uropathogenic bacteria (E. coli (FimH), K. pneumoniae (FimK), P. aeruginosa (ETA), and P. mirabilis (ZapA and UreC)) were used as GD. For the primer-BLAST estimation, the PCR product size was adjusted with 100 bp and 800 bp as the minimum and maximum lengths, respectively. The first primer pair generated for each DNA template (gene) by primer-BLAST and all the primer sequences described by Ref. [20] were subjected to oligonucleotide analysis by Gene Runner 3.01. The good performance of the Mult-PSOS can be verified by the comparison of the fitness values calculated for the three groups of primers. The results of the check functions from Eqs. (A) and (B) are shown in Tables 3 and 4.
Discussion
In Multiplex-PCR assays, a vast number of parameters play a critical role in the efficiency of the DNA target amplification. These parameters include (i) optimal nucleotides and enzyme concentration in the reaction; (ii) quality and amount of extracted DNA template; (iii) number of PCR cycles; (iv) quantity of primers (inverse to amplicon size); and (v) salt concentration [26]. Nevertheless, optimal primer design is the most relevant factor in Multiplex-PCR performance.
Currently, software is available for the solution of the Multiplex-PCR primer design problem, but, in general, these options are non-free closed algorithms that do not take into account all the necessary restrictions for the adequate prediction of these sequences. Previously, a Multiplex-PCR primer design strategy based on a genetic algorithm for tuberculosis diagnosis has been described [29]. However, in that work, the AG control parameters known as mutation probability (mp) and crossover probability (cp) must be defined by the user. The designation of these initial parameters requires a priori knowledge of AG, which constitutes a serious inconvenience for the general application of this computational tool by non-AG experts. It is worth highlighting that the AG results are dependent on these values. In this work, we implemented an adjusted hybrid algorithm of Particle Swarm Optimization-Simplex [7], the Mult-PSOS, in order to obtain an autoconfigured computational strategy, which allows the rapid and reliable prediction of a set of oligonucleotides for simultaneous amplification of multiple genes in a single PCR (Multiplex-PCR). This strategy consists of the selection of a first primer pair (and subsequent pairs), following all the restrictions required for an adequate primer design. In the optimization process, the parameters and restrictions were defined and weighted in accordance with their relevance in the primer performance (specificity and sensitivity) in a Multiplex-PCR. In this way, the most penalized restriction violations were the appearance of the primer sequence more than once in a target DNA (Uni(Pt)), and the size difference among amplicons (ladM(Pt)). The second highest penalizing value was given to the factors that affect the sensitivity of the oligonucleotides, such as hairpin loop structures (self-complementary sequences) and Ta and Tm differences greater than 5℃ among primers.
To evaluate the accuracy of the method, an in vitro test with the Mult-PSOS predicted primers was made taking as targets five gene sequences, previously described in a Multiplex-PCR for simultaneous identification of the bacteria most prevalently implicated in UTI [20]. Two additional genes were added for the Multiplex-PCR design from S. saprophyticus and E. faecalis, due to their prevalence in UTI etiology [10, 12, 17]. Using the restriction parameters, 14 oligonucleotide sequences were selected by the algorithm as the optimal sense and anti-sense primers. In the in vitro Multiplex-PCR, amplification of all the target sequences was achieved in only two tests, validating the good performance of the Mult-PSOS. It is worth mentioning that in the first assay, five of the six amplicons (K. pneumoniae, P. aeruginosa, P. mirabilis, S. saprophyticus, and E. faecalis) were successfully replicated, and the absence of one of the target genes (E. coli) was attributed to reduced molecular weight and to poor quality of the DNA template. A second test of Multiplex-PCR was performed with the same mixture and cycle conditions as the previous assay, but with E. coli genetic material obtained by the boiling-DNA extraction method. With this modification, 100% of the amplicons were amplified.
In general, the replication efficiency was proportional to the molecular weight of the amplicon, with the lowest rate being associated with the FimH gene from E. coli (235 bp). These kinds of variations are very common in Multiplex-PCR assays. Moreover, in this work, they were principally associated with the quality of the DNA template and the primer availability in the reaction [24]. The specificity of the primer sequences was verified in silico by nucleotide BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastn&BLAST_SPEC=WGS&BLAST_PROGRAMS=megaBlast&PAGE_TYPE=BlastSearch) against human, bacteria, fungal, and protozoan databases. The results showed nonsignificant cross homology regions in the primers. The specificity of the Mult-PSOS predicted primers was also confirmed in vitro, using DNA samples from S. marcescens, Salmonella spp., and S. aureus without any band amplification.
The primer sequences designed by the Mult-PSOS satisfied all the mandatory restrictions (see Eqs. (1)-(4)) for the adequate performance of Singletex and Multiplex-PCRs (Eqs. (5)-(18)). The adequate adjustment of the parameters was confirmed by the Gene Runner 3.01 free software (http://www.generunner.net/) and the online tool Multalin (http://multalin.toulouse.inra.fr/multalin/multalin.html). In accordance with the results, we can conclude that Mult-PSOS is an autoconfigured algorithm that works in a sequential manner, facilitating the efficient prediction and selection of the primer sequences for the simultaneous replication of up to seven genes. The optimization of the reaction conditions for the adequate amplification of all amplicons was accomplished in a single step (with the correct amount of DNA template and primers), decreasing significantly the need for the multiple testing by trial and error. This tool can be used for the systematic design and implementation of Multiplex-PCRs for clinical and forensic applications and can be employed by non-expert users in computational techniques or primer design problems.
References
- Abd-Elsalam KA. 2003. Bioinformatic tools and guideline for PCR primer design. Afr. J. Biotechnol. 2: 91-95. https://doi.org/10.5897/AJB2003.000-1019
- Abdeldaim GM, Strålin K, Korsgaard J, Blomberg J, Welinder-Olsson C, Herrmann B. 2010. Multiplex quantitative PCR for detection of lower respiratory tract infection and meningitis caused by Streptococcus pneumoniae, Haemophilus influenzae and Neisseria meningitidis. BMC Microbiol. 10: 310. https://doi.org/10.1186/1471-2180-10-310
- Allawi HT, SantaLucia J. 1997. Thermodynamics and NMR of internal GT mismatches in DNA. Biochemistry 36: 10581-10594. https://doi.org/10.1021/bi962590c
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410. https://doi.org/10.1016/S0022-2836(05)80360-2
- Amghalia E, Nagi AA, Shamsudin MN, Radu S, Rosli R, Neela V, Rahim RA. 2009. Multiplex PCR assays for the detection of clinically relevant antibiotic resistance genes in Staphylococcus aureus isolated from Malaysian hospitals. Res. J. Biol. Sci. 4: 444-448.
- Apte A, Daniel S. 2009. PCR primer design. Cold Spring Harb. Protoc. 2009: pdb-ip65. https://doi.org/10.1101/pdb.ip65
- Begambre O, Laier JE. 2009. A hybrid particle swarm optimization - simplex algorithm (PSOS) for structural damage identification. Adv. Eng. Softw. 40: 883-891. https://doi.org/10.1016/j.advengsoft.2009.01.004
- Boehme CC, Nabeta P, Hillemann D, Nicol MP, Shenai S, Krapp F, et al. 2010. Rapid molecular detection of tuberculosis and rifampin resistance. N. Engl. J. Med. 363: 1005-1015. https://doi.org/10.1056/NEJMoa0907847
- Cheng YH, Chuang LY, Yang CH. 2012. Fuzzy adaptive particle swarm optimization for confronting two-pair primer design. In: Proceedings of the International MultiConference of Engineers and Computer Scientists. IMECS 2012 1.
- Foxman B. 2002. Epidemiology of urinary tract infections: incidence, morbidity, and economic costs. Am. J. Med. 113: 5-13. https://doi.org/10.1016/S0002-9343(02)01054-9
- Gopinath K, Singh S. 2009. Multiplex PCR assay for simultaneous detection and differentiation of Mycobacterium tuberculosis, Mycobacterium avium complexes and other Mycobacterial species directly from clinical specimens. J. Appl. Microbiol. 107: 425-435. https://doi.org/10.1111/j.1365-2672.2009.04218.x
- Grabe M, Bishop MC, Bjerklund-Johansen TE, Botto H, Cek M, Lobel B, et al. 2010. Guidelines on Urological Infections. European Association of Urology. 110.
- Gupta S, Bandyopadhyay D, Paine SK, Gupta S, Banerjee S, Bhattacharya S, Bhattacharya B. 2010. Rapid identification of Mycobacterium species with the aid of multiplex polymerase chain reaction (PCR) from clinical isolates. Open Microbiol. J. 4: 93. https://doi.org/10.2174/1874285801004010093
- Johnson JR, Stell AL. 2000. Extended virulence genotypes of Escherichia coli strains from patients with urosepsis in relation to phylogeny and host compromise. J. Infect. Dis. 181: 261-272. https://doi.org/10.1086/315217
- Kennedy J, Eberhart R. 1995. Particle swarm optimization. Proc. IEEE Int. Conf. Neural Networks 4: 1942-1948. https://doi.org/10.1109/ICNN.1995.488968
- Liarte DB, Murta SM, Steindel M, Romanha AJ. 2009. Trypanosoma cruzi: multiplex PCR to detect and classify strains according to groups I and II. Exp. Parasitol. 123: 283-291. https://doi.org/10.1016/j.exppara.2008.12.005
- Martineau F, Picard FJ, Ménard C, Roy PH, Ouellette M, Bergeron MG. 2000. Development of a rapid PCR a ssay specific for Staphylococcus saprophyticus and application to direct detection from urine samples. J. Clin. Microbiol. 38: 3280-3284.
- Mullis KB, Faloona FA. 1987. Specific synthesis of DNA in vitro via a polymerase-catalyzed chain reaction. Methods Enzymol. 155: 335. https://doi.org/10.1016/0076-6879(87)55023-6
- Nelder JA, Mead R. 1965. A simplex method for function minimization. Comp. J. 7: 308-313. https://doi.org/10.1093/comjnl/7.4.308
- Padmavathy B, Vinoth KR, Amee P, Deepika SS, Vaidehi T, Jaffar ABM. 2012. Rapid and sensitive detection of major uropathogens in a single-pot multiplex PCR assay. Curr. Microbiol. 65: 44-53. https://doi.org/10.1007/s00284-012-0126-3
- Rachlin J, Ding C, Cantor C, Kasif S. 2005. MuPlex: multiobjective multiplex PCR assay design. Nucl. Acids Res. 33: 544-547. https://doi.org/10.1093/nar/gki377
- Reddington K, O’Grady J, Dorai-Raj S, Maher M, Van Soolingen D, Barry T. 2011. Novel multiplex real-time PCR diagnostic assay for identification and differentiation of Mycobacterium tuberculosis, Mycobacterium canettii, and Mycobacterium tuberculosis complex strains. J. Clin. Microbiol. 49: 651-657. https://doi.org/10.1128/JCM.01426-10
- Shen Q, Shi W-M, Kong W. 2008. Hybrid particle swarm optimization and tabu search approach for selecting gene for tumor classification using gene expression data. Comput. Biol. Chem. 32: 53-60. https://doi.org/10.1016/j.compbiolchem.2007.10.001
- Sibley CD, Peirano G, Church DL. 2012. Molecular methods for pathogen and microbial community detection and characterization: current and potential application in diagnostic microbiology. Infect. Genet. Evol. 12: 505-521. https://doi.org/10.1016/j.meegid.2012.01.011
- Stankowska D, Kwinkowski M, Wieslaw K. 2008. Quantification of Proteus mirabilis virulence factors and modulation by acylated homoserine lactones. J. Microbiol. Immunol. Infect. Dis. 41: 243-253.
- Tsalik EL, Jones D, Nicholson B, Waring L, Liesenfeld O, Park LP, Woods CW. 2010. Multiplex PCR to diagnose bloodstream infections in patients admitted from the emergency department with sepsis. J. Clin. Microbiol. 48: 26-33. https://doi.org/10.1128/JCM.01447-09
- Weissensteiner T, Nolan T, Bustin SA, Griffin HG, Griffin A. 2010. PCR Technology: Current Innovations, 2nd Ed. CRC Press, Boca Raton, FL.
- Wu JS, Lee C, Wu CC, Shiue YL. 2004. Primer design using genetic algorithm. Bioinformatics 20: 1710-1717. https://doi.org/10.1093/bioinformatics/bth147
- Wu LC, Horng JT, Huang HY, Lin FM, Huang HD, Tsai MF. 2007. Primer design for multiplex PCR using a genetic algorithm. Soft Comput. 11: 855-863. https://doi.org/10.1007/s00500-006-0137-8
- Xiao L. 2010. Molecular epidemiology of cryptosporidiosis: an update. Exp. Parasitol. 124: 80-89. https://doi.org/10.1016/j.exppara.2009.03.018
- Yang CH, Cheng YH, Chuang LY, Chang HW. 2009. Genetic algorithm for the design of confronting two-pair primers, pp. 242-247 In: Bioinformatics and BioEngineering, 2009. BIBE'09. Ninth IEEE International Conference.
- Yang CH, Lin MC, Chuang LY. 2010. Primer design for the PCR in methylation studies using PSO. Proceedings of the International MultiConference of Engineers and Computer Scientists. 1.
- Yang CH, Chang HW, Ho CH, Chou YC, Chuang LY. 2011. Conserved PCR primer set designing for closely-related species to complete mitochondrial genome sequencing using a sliding window-based PSO algorithm. PloS One 6: 17729. https://doi.org/10.1371/journal.pone.0017729
Cited by
- A new multiplex PCR for species-specific diagnosis of human candidiasis vol.37, pp.2, 2015, https://doi.org/10.7705/biomedica.v37i2.3202
- Development and application of multiplex PCR method for simultaneous detection of seven viruses in ducks vol.15, pp.None, 2019, https://doi.org/10.1186/s12917-019-1820-1