Surface co-display of cellulases with synergy in Pichia pastoris (Part V)

Step 8. Genomic PCR analysis

Although the reason why the transparent zone method turns out invalid in our project is not fully understood, the project proceeds with other method. Genomic PCR (Polymerase chain reaction) analysis is used for further screening.

Genomic PCR is PCR using genome as template. This method is used to verify two things: 1) the cellulase gene is truly integrated into the genome of the transformants; 2) the cellulase gene integrated into the genome of our transformants has correct sequence. Genome DNA isolation and genomic PCR are carried out according to standard protocol from Molecular Cloning: A Laboratory Manual (Sambrook J, et al. 2001).

Isolated genome DNA is firstly identified by agarose gel electrophoresis (Figure 1), and secondly used as template in subsequent genomic PCR reaction.

Figure 1. Identification of genome DNA

Figure 1. Identification of genome DNA

The primers we used in genomic PCR reaction system are specific to the cellulase gene (Table 1). If the cellulase gene is truly integrated into the genome of the transformants, the PCR product should be the same size of the corresponding cellulase gene. The size of the PCR product is identified by agarose gel electrophoresis (Figure 2). In negative control, sterile water is used to replace template in the PCR reaction system to avoid the disturbance of primer dimers.

Table 1. Primers used in genomic PCR reaction system

Table 1. Primers used in genomic PCR reaction system

Figure 2. Identification of genome PCR products

Figure 2. Identification of genome PCR product

Transformant with positive genomic PCR result is called recombinant (an organism carrying foreign DNA in its genome via genetic recombination). PCR products of recombinants are sent for sequencing. According to the sequencing result, recombinants carry the correct cellulase gene are used for fermentation.

Step 9. Continuous Fermentation

After resistance selection and genomic PCR analysis, recombinants selected are inoculated into 20mL flasks with 5mL YPD medium, and cultivated at 28℃ 200r/min over night for activation. 1mL bacteria culture is then inoculated into 500mL flasks with 50mL BMMY induction medium for fermentation. The cells are induced for 120h at 28℃ 200r/min. 500uL methanol (1%) is added to the medium every 24h to maintain induction. Aseptic techniques are required in the entire process of fermentation.

Surface co-display of cellulases with synergy in Pichia pastoris (Part IV)

Step 6. Screen for the His+ Mut+ transformants

P.pastoris GS115 transformants are plated on MD (Minimal Dextrose) plates after electroporation. Incubate the plates at 30℃ for 2 days or longer. MD plate contains no histidine, thus can select out strains containing His+. Normally transformants with the phenotype of His+ Mut+ can generate single colonies within three days (Figure 1).

Figure 1. Transformants with the phenotype of His+ Mut+ generate single colonies on MD after incubation for two days

Figure 1. Transformants with the phenotype of His+ Mut+ generate single colonies on MD after incubation for two days

Antibiotic resistance genes in the backbone of expression vectors confer resistance to antibiotic in Pichia pastoris. Strains carry the backbone of pPIC9K and pPICZα are resistant to G418 and Zeocin respectively.The copy number of transformed DNA integrated into the genome of Pichia pastoris ranges randomly from one to multiple. Generally, strains with more copies of integrated DNA have higher enzyme activity as well as higher resistance to antibiotic.

To screen for strains with high enzyme activity, we use sterile toothpicks to patch 200 His+ Mut+ transformants of each kind on new plates (Figure 2). New plates contain medium concentration of antibiotics and are free from selective pressure of lacking histidine. Media and antibiotics are chosen according to the type of plasmid backbone the P.pastoris strains carry. Strains carry pPIC9K backbone are patched on YPD plate containing 2mg/mL G418; strains carry pPICZα are patched on YPDS plate containing 200ug/mL Zeocin. Incubate the plates at 30℃ for 2 days or longer. Transformants with relatively high copy number of transformed DNA yield large circular colonies (e.g. colony 31 in figure 2), while others remain small dots (e.g. colony 70 in figure 2).

Figure 2. patched YPD plate containing 2mg/mL G418 after incubation for 2 days

Figure 2. patched YPD plate containing 2mg/mL G418 after incubation for 2 days

Step 7. Screen for transformants with high enzyme activity

It’s laborious and time consuming to do fermentation and enzyme activity assay, so further screenings are required to screen for transformants with high enzyme activity.

Transformants obtained after resistance selection are patched on new BMMY (medium for fermentation) plates containing substrate to corresponding enzyme; we use xylan (birchwood), common substrate of all the four cellulases we are studying. Xylose is the decomposition product of xylan. Xylan can be stained by Congo red, a dye, while xylose can’t. Congo red is added into the medium with the terminal concentration of 5mg/100mL. If the strain expresses functional cellulase, the colony will yield a circular transparent zone around (Figure 3). The strain expresses cellulase with higher activity yields transparent zone with larger diameter.

Figure 4. schematic of a Congo red screening plate.

Figure 3. Schematic of a Congo red screening plate.

Patched Congo red screening plates are incubated at 30℃. The expression of cellulase is under the regulation of inducible promoter, PAOX1. 200uL methanol is added to the plate everyday to induce the expression of cellulase.

Figure 4. Congo red screening plates after five days of induction and incubation

Figure 4. Congo red screening plates after five days of induction and incubation

Unfortunately, none of our transformants yield any transparent zone after five days of induction and incubation (Figure 4); however, it doesn’t necessarily mean our transformants can’t express functional cellulase. There are many possible reasons for the failure of yielding transparent zones: 1) cellulases have many possible substrates, while xylan is not the most suitable one; 2) methanol is not the only carbon source in our BMMY medium. Other carbon sources are used prior to methanol by P.pastoris GS115, which prevents the transformants from being induced. 3) The activity of cellulases expressed by our transformants is below the threshold of the transparent zone method.

Composition of media used in our project

MD: 1.34 % YNB, 4×10−5 % biotin, 2 % dextrose, and 1.5 % agar

YPD: 1 % yeast extract, 2 % peptone, and 2 % glucose

YPDS: 1 % yeast extract, 2 % peptone, 2 % glucose, and 1M sorbitol

BMGY/BMMY: 1 % yeast extract, 2 % peptone, 100 mM potassium phosphate,                                        pH 6.0, 1.34 % yeast nitrogen base [YNB], 4×10−5 % biotin, and                                    1 % glycerol or 0.5 % methanol

Surface co-display of cellulases with synergy in Pichia pastoris (Part III)

Step 3. Determine the sequence of the plasmid by sequencing

There’s a chance of gene mutation during the process of vector construction. The precise order of nucleotides within our vector is determined by DNA sequencing. Plasmid samples are sent to TSINGKE Biological Technology (other companies with sequencing services are also okay). We get reports of the sequences of our samples; and use BLAST to do sequence alignment.

The BLAST result of RuXyn1

The BLAST result of RuXyn1

Our BLAST result of RuXyn1 is shown in the image above, indicating that the sequence of RuXyn1 in our sample is correct. The sequencing results of the other three enzyme genes (Xyn, BglX, Cel5A) are processed in the same way. If the sequencing result shows that the gene mutated, the vector should be reconstructed until you get the correct sequence.

Step 4. Linearize the plasmid to stimulate recombination

Linearization is to use restriction enzyme to digest and linearize the circular plasmid. Linearized DNA can generate stable transformants of Pichia pastoris via homologous recombination between the transforming DNA and regions of homology within the genome. Such integrants show extreme stability in the absence of selective pressure even when present as multiple copies.

The type of restriction enzyme used in linearization is determined both by the type of the host cell (Pichia Pastoris strain GS115 or KM71) and the phenotype of transformants (His+ Mut+ or His+ Muts). His+ represents the “Histidine synthesizing” phenotype, Mut+ represents the “Methanol utilizing” phenotype and Muts refers to the “Methanol utilization slow” phenotype.

The Pichia Pastoris strain GS115 is used as host cell in our project. GS115 has a mutation in its histidinol dehydrogenase gene (his4) that prevents it from synthesizing histidine (His). Both pPIC9K and pPICZα contain the HIS4 gene that complements his4 in the host cell (His+), thus transformants are selected for their ability to grow on histidine-deficient medium. We use Sal I and Sac I to linearize our expression vectors, pPIC9K (Figure 1) and pPICZα (Figure 2) respectively, according to Pichia Expression Kit (page32, Invitrogen).

Linearization of pPIC9K by Sal I

Figure 1. Linearization of pPIC9K by Sal I

Linearization of pPICZα by Sac I

Figure 2. Linearization of pPICZα by Sac I

Linearized plasmids are identified by agarose gel electrophoresis. Circular DNA runs faster than linearized DNA with the same primary structure. Here I take the linearization of pPIC9K-RuXyn as an example (Figure 3). We use the plasmid without enzyme digestion (pPIC9K-RuXyn) as negative control (R0), which runs ahead of the linearized DNA (RuXyn) in the agarose gel.

Figure 3. Identification of linearization of pPIC9K-RuXyn

Figure 3. Identification of linearization of pPIC9K-RuXyn

Step 5. Electroporation for the first round

Electroporation is to use electrical field to introduce the expression vector into the yeast cell. One kind of expression vector is introduced into the cell in one round of electroporation. Two rounds of electroporation are required, for we want to co-transform two enzyme genes into Pichia pastoris and co-display the two enzymes.

We use the method in Pichia Expression Kit (page 77, Invitrogen) to prepare electro competent cells and electroporation. pPIC9K-RuXyn, pPIC9K-BglX, pPICZα-Xyn and pPICZα-Cel5A are introduced into P. pastoris GS115 to construct GS115-9K-RuXyn, GS115-9K-BglX, GS115-Zα-Xyn and GS115-Zα-Cel5A. pPIC9K and pPICZα, both carry Sed1p only, are also introduced into P. pastoris GS115 to construct GS115-9K and GS115-Zα, which function as the negative control for Immunofluorescence Assay and Flow Cytometry analysis.

Surface co-display of cellulases with synergy in Pichia pastoris (Part II)

Among all the cellulases targeting different chemical bonds in lignocellulose, our group found two sets with synergy in the literature. Synergy in this context is a property that the two enzymes can enhance the activity of each other when functioning together. The first set is RuXyn1 (beta-D-xylosidase) and Xyn (beta-xylanase); and the second set is BglX (beta-glucosidase) and Cel5A (endoglucanase). The two sets of cellulases are responsible for degrading hemicellulose and cellulose respectively.

As for lignin, an aromatic polymer, pretreatment is required to remove this lignin from cellulose and hemicellulose. Without the protection of lignin, cellulose and hemicellulose will lose their resistance to enzymes and can be more easily degraded. Previous studies gave many options of enzymes used for pretreatment, such as Lip (Lignin peroxidase), MnP (Mn peroxidase) and GLOX (Glyoxal oxidase). So, our project focused on the degradation of cellulose and hemicellulose.

The process of constructing the surface co-display system is as follows.

Step 1. Choose a proper host cell and the corresponding anchor protein

As cellulases are all originated from fungi, we need a eukaryotic expression system. Pichia pastoris and Saccharomyces cerevisiae are both modern frequently used eukaryotic expression systems. We chose Pichia pastoris in our project, because it has several advantages over Saccharomyces cerevisiae: 1) it’s modification ability is stronger, capable of peptide folding, glycosylation, methylation and acetylation; 2) it contains promoter PAOX1, the regulation of which is the most efficient and strictest among the promoters discovered currently; 3) exogenous genes transformed into Pichia pastoris integrate into its genome, expressing steadily; while genes transformed into Saccharomyces cerevisiae  exist as plasmid that is very likely to be rejected by the cell after several generations.

pPIC9K and pPICZα are both good expression vectors in Pichia pastoris, but not all the cells transformed with the vector can successfully express the target protein on its surface. To reach a high activity of our whole-cell catalyst, we want the display ratio (the amount of cells expressing the target protein on the surface/ total cells) as high as possible, which depends on both the expression vector and the anchor protein. In our lab, the accessible candidates are Cwp2p, Sed1p and α-agglutinin. We expressed the candidates fused with EGFP (Enhanced Green Fluorescent Protein) based on pPIC9K and pPICZα respectively; and use FCM (Flow Cytometry) to test their display ratio. From our experiments we found that Sed1p is the best, with the display ratio of 99.50% in pPIC9k and 98.36% in pPICZα.

Step 2. Construct the expression vectors

pPICZα-Cel5A/ Xyn-sed1

Figure 1. pPICZα-Cel5A/ Xyn-sed1

Figure 2. pPIC9K-RuxynBglX-sed1

Figure 2. pPIC9K-RuxynBglX-sed1

The plasmid profiles are shown in figure 1 and 2. α factor signal is a secretion signal; PAOX1 is an induced promoter (functions only in the presence of specific inductor) sensitive to methanol. EcoR I, Mlu I and Not I are restriction sites used for the vector construction. We add a tag (FLAG tag or HA tag) to the target gene for the detection of the target protein.

Tags have their specific primary antibody. The primary antibody can be recognized by the secondary antibody through antigen-antibody reaction. The secondary antigen is labeled by fluorescent. Thus we can detect the target protein by detecting the fluorescence on the cell surface under the fluorescence microscope. To distinguish the two target proteins, we use two different tags (FLAG tag and HA tag). Different tags correspond to different primary and secondary antibodies, emitting fluorescence of different color.

Figure 3. Surface Co-display Systems

Figure 3. Surface Co-display Systems

Figure 3 is the schematic of the two surface co-display systems. pPIC9K has resistance against ampicillin and kanamycin, while pPICZα is resistant to zeocin. Notice that two enzyme genes co-expressed in one cell type are constructed in two different plasmid backbones; we can easily obtain the strain with both two expression vectors by resistance screening assay.

According to the plasmid backbone and the inserted gene, we name the four expression vectors pPIC9K-RuXyn, pPIC9K-BglX, pPICZα-Xyn and pPICZα-Cel5A.

Surface co-display of cellulases with synergy in Pichia pastoris (Part I)

From April 2013 to May 2014, I worked as a research assistant in the Key Laboratory of Molecular Biophysics of Ministry of Education, HUST; I led my independent research team to build two surface co-display systems of cellulases in Pichia pastoris.

Lignocellulose refers to plant dry matter (biomass). It is the combination of three different polymers (cellulosehemicellulose and lignin); both cellulose and hemicellulose are carbohydrate polymers and they bind tightly to lignin (an aromatic polymer).

Lignocellulose is the most abundant resource of biomass on the earth. Unfortunately, people hardly use it. Why? Because lignocellulose is formed by many different monomers via different chemical bonds; while one enzyme can only break one kind or one family of chemical bonds. Since monosaccharides are the primary materials for bio-fuels, such as bio-ethanol, a large number of enzymes are required for the complete degradation.

Scientists do have discovered many kinds of cellulases in nature; and they are in charge of breaking different chemical bonds respectively. You may ask, “Why not we just add all the enzymes needed?” This is truly a solution, but the purification of so many enzymes is very expensive, laborious and time-consuming. So traditionally, people just discard or burn lignocellulose (e.g. dead wood, straw and bran) as waste.

Surface display of enzymes is a novel solution to the problem. Surface display technique is to express the target protein as a fusion protein with the anchor (protein originally located on the surface of the cell), thus the target protein can be located on the surface of the cell rather than secreted into the medium. The beauty of the technique is that the whole cell will act as the catalyst; there’s no need to purify the target protein (enzyme). Considering the low recovery of purification, whole-cell catalyst is efficient as well as convenient. Moreover, the restoration of cells is a lot easier than that of proteins.

Surface Display in Pichia pastoris

Surface Display in Pichia pastoris

Our team added a GS linker (a short soft peptide consisting of Glycine and Serine) between the target protein and the anchor to ensure that there’s no interference in the folding and the function of the two proteins.

Remember that many kinds of proteins located on the cell surface are candidates of the anchor; the specific location of the anchor is not necessarily in the cell wall as shown in the image above, it also can be located in the cytomembrane or the intermembrane space.

Pull Down Assay: a technique to confirm interactions or to identify new interactions between proteins

When identifying the function of a protein, confirming the upstream and downstream events of a pathway, it is necessary to understand the interactions between the target protein and other proteins. Pull down assay is a good way to detect both known and unknown interactions.

In the pull down assay, the target protein is expressed with a fusion tag as the bait. The source of the prey depends on the purpose of the study; the prey can be either known or unknown, either one or more than one. There are several types of tags, such as GST (Glutathione S-transferase) tag, 6× polyhistidine tag and Halo tag; your choice depends on the source and the property of the target protein.

Here I will use GST-based pull down as an example.What we need is a bait (X protein fused with a GST tag), a prey (Y protein) and agarose beads containing reduced GSH (Glutathione).

Pierce Glutathione Agarose Beads

Pierce Glutathione Agarose Beads

The bead our lab uses is Thermo Scientific Pierce Glutathione Agarose; you can find all the parameters and instructions related to the bead  here.

Agarose Beads in tube

Agarose Beads in tube

To avoid contamination, we usually aliquot the beads into small tubes. The beads are stored in PBS (Phosphate Buffer Saline) to maintain the activity of GSH on the beads. The density of agarose is much higher than that of PBS, so the mixture will automatically separate into two parts when keeping still for about five minutes. Don’t forget to mix them again before using.

Schematic of GST-based pull down assay

Schematic of GST-based pull down assay

How does the assay work? GST has very high specific affinity to GSH, so the bait can be captured by the beads. When the prey added, if Y protein has specific interaction with X protein it will be pulled down as a Bead-GST-X-Y complex; if not, there will be no Y protein pulled down. The mixture should be washed every time after incubation to wash away non-specifically bound proteins. We will use SDS-PAGE and Western blotting to detect whether Y protein is pulled down.

Data are meaningless without a proper control. When the concentration of the prey is too high, non-specific binding will still exist even after several times of washing and elution; the Y protein can even be pulled down by GST protein! We use GST as the negative control of GST-X. If Western blotting shows that Y protein is only detected when using GST-X, it can serve as a proof of interactions between X and Y protein.

2013 iGEM (International Genetically Engineered Machine Competition)

iGEM, a premiere undergraduate Synthetic Biology competition, is one of the most important events in my college life; it is hosted by MIT (Massachusetts Institute of Technology) every year since 2005. Click here to know more about the competition.

In the competition, we organize a team (undergraduate students only) to design a project based on synthetic biology and realize the idea by ourselves. Most experiments are done through the summer vacation, but other things necessary to the competition, such as human practice, website designing, poster designing, will altogether take us about a year.

As a competition, its duration is really long and very time-consuming; we experienced many difficulties and failures and many teammates quit. I’m glad I didn’t give up and the team finally conquered all the obstacles. I represented the team to give the final presentation in November, 2013, MIT.

In 2013, our team made a biological oscillator, which can release propionate (recently reported to be able to relieve hypertension) periodically to cope with the morning surge of patients with hypertension.

Here is the design of the biological oscillator.

Biological Oscillator

Biological Oscillator

Hybrid promoter is the combination of the activation operator site from araBAD promoter and the repression operator site from lacZYA promoter. It can be activated by the AraC protein in the presence of arabinose and repressed by the LacI protein in the absence of IPTG. The mechanism of the oscillator is that we can activate the promoter by adding Arabinose and IPTG to the medium and thus results in the transcription of each component of the circuit. The increased production of AraC in the presence of Arabinose results in a positive feedback loop that increases promoter activity. However, the concurrent increase in production of LacI results in a linked negative feedback loop that decreases promoter activity. The two feedback loops with opposite effects can then drive an oscillatory behavior. Click here to know more about our project!

We designed and made a poster (Figure 1) for our  porject. Click the picture, then you can view details in our poster.

2013_iGEM Poster

Figure 1. 2013_iGEM Poster

This experience is unforgettable! I really enjoyed working in a team, and most importantly, I learned how to solve problems when getting stuck in a research and how to express myself properly and effectively in a team.

Team HUST-China 2013

Team HUST-China 2013

I met many friends during the Jamboree, MIT. This is Alex from the University of Marburg.

I met many friends during the Jamboree, MIT. This is Alex from the University of Marburg.

High Resolution Melting (HRM) analysis

These days, I’ve been using HRM to scan mutations in target gene of patients with Ventricular Tachycardia (VT).

Double-stranded DNA (dsDNA)  will dissociate at a certain high temperature; the temperature depends on the sequence and the structure of the dsDNA. The melting curve is an important property of dsDNA. dsDNAs with only one different base pair will have different melting curve, thus we can use it to detect mutant genes.

Human chromosomes are diplont. Since mutations are rare, even if the target gene of the patient is mutated, it is most likely to be heterozygous mutation (it is possible to be homozygous mutation, but very rare) and usually single base mutation. When use heterozygous mutated gene as PCR template, the product will have two different melting temperatures. In the Rotor-Gene Q Software (software that we use to process data of HRM in our lab) normal samples have a single peak at the melting temperature, while mutant samples have two. We can further sequence samples with two peaks to detect the mutated base pair.

This method is enabled by adding dsDNA-binding dyes (eg. SYTO® 9 Green Fluorescent Nucleic Acid Stain) when conducting PCR. The Rotor-Gene Q can draw the melting curve by detecting the fluorescence of the sample.

Rotor-Gene Q, the machine we use to do HRM

Rotor-Gene Q, the machine we use to do HRM

Example:

Raw Data

Raw Data

The result of one test shows the melting curves of 36 samples. The horizontal axis shows the temperature range, the vertical axis represents fluorescence intensity. The fluorescence intensity goes down sharply at a certain temperature, namely melting temperature.

Melt Data

Melt Data

Use the software to process the raw data. The horizontal axis shows the temperature range, the vertical axis represents dF/dT. The melt data shows that all samples have a single peak while one sample has two peaks, which indicates the possible mutation.