Small-scale, semi-automated purification of eukaryotic proteins for structure determination.
A simple approach that allows cost-effective automated purification of recombinant proteins in levels sufficient for functional characterization or structural studies is described. Studies with four human stem cell proteins, an engineered version of green fluorescent protein, and other proteins are included. The method combines an expression vector (pVP62K) that provides in vivo cleavage of an initial fusion protein, a factorial designed auto-induction medium that improves the performance of small-scale production, and rapid, automated metal affinity purification of His8-tagged proteins. For initial small-scale production screening, single colony transformants were grown overnight in 0.4 ml of auto-induction medium, produced proteins were purified using the Promega Maxwell 16, and purification results were analyzed by Caliper LC90 capillary electrophoresis. The yield of purified [U-15N]-His8-Tcl-1 was 7.5 microg/ml of culture medium, of purified [U-15N]-His8-GFP was 68 microg/ml, and of purified selenomethione-labeled AIA-GFP (His8 removed by treatment with TEV protease) was 172 microg/ml. The yield information obtained from a successful automated purification from 0.4 ml was used to inform the decision to scale-up for a second meso-scale (10-50 ml) cell growth and automated purification. 1H-15N NMR HSQC spectra of His8-Tcl-1 and of His8-GFP prepared from 50 ml cultures showed excellent chemical shift dispersion, consistent with well folded states in solution suitable for structure determination. Moreover, AIA-GFP obtained by proteolytic removal of the His8 tag was subjected to crystallization screening, and yielded crystals under several conditions. Single crystals were subsequently produced and optimized by the hanging drop method. The structure was solved by molecular replacement at a resolution of 1.7 A. This approach provides an efficient way to carry out several key target screening steps that are essential for successful operation of proteomics pipelines with eukaryotic proteins: examination of total expression, determination of proteolysis of fusion tags, quantification of the yield of purified protein, and suitability for structure determination.
PubMed ID: 17985212
PMC ID: PMC2668602
Article link: J Struct Funct Genomics.
Grant support: (5T32HG002760 NHGRI NIH HHS , GM50853 NIGMS NIH HHS , U54 GM074901 NIGMS NIH HHS
Genes referenced: cat.2 ccnf igf2bp3 mbp npm2 nr4a1
Article Images: [+] show captions
|Fig. 1. Expression vector pVP62K. (a) Linear map showing key features of the vector and location of the Bar-CAT toxic cassette and 3′ homology region (3′-hmr) for Flexi Vector cloning. (b) Nucleotide and encoded protein sequence in the linker region near to the SgfI cloning site. The TVMV protease site is ETVRFQS, where proteolysis occurs between the Q and S residues. The fusion protein may be cleaved in the expression host due to the presence of a low level of TVMV protease produced by constitutive expression from pVP62K. The TEV protease site is ENLYFQA, where proteolysis occurs between the Q and A residues. After purification of the His8-tagged protein, the His8 tag can be removed by treatment with TEV protease to release an N-terminal AIA-target|
|Fig. 2. Conditional methionine auxotrophy in E. coli B834. (a) Genome organization near to the metE gene in E. coli K12 . (b) Genome organization near to the metE gene in E. coli B834. In this organism, DNA sequencing revealed a large insert in the metE gene, which caused the protein to be truncated to 56 amino acids (aa), non-functional peptide|
|Fig. 3. Caliper LC90 analysis of His8-tagged proteins purified by Maxwell 16. Lanes LA and LB are molecular weight markers. Lanes A1–B2 are structural genomics target proteins (protein bands marked with ovals) with molecular weight ∼50–75 kDa. They were expressed in factorial evolved auto-induction medium containing selenomethionine  as an N-terminal fusion with MBP from pVP56K, a vector that does not give in vivo proteolysis of the fusion protein. Lane B3 contains His8-MBP (protein band marked with oval), while lane B4 (1.1 mg/ml) contains His8-GFP expressed from pVP62 after in vivo cleavage from MBP. Lanes with a purified expressed fusion protein with yield than 100 μg/ml are marked with a star (also see Table 2 )|
|Fig. 4. Small-scale purification screening of human embryonic stem cell proteins. Human stem cell proteins were expressed in E. coli B834 by auto-induction, liberated by in vivo proteolysis, and purified by the Maxwell 16 purification system. Table 2 provides further information on these proteins. Lane 1, molecular weight markers. Lanes 2 and 3, total cell lysate and eluted sample from purification of CCNF. No purified protein was detected. Lanes 4 and 5, C10orf96 was obtained in detectable amounts, but not sufficient for scale-up, along with two higher molecular weight contaminants. Lanes 6 and 7, His8-Tcl-1 was expressed, proteolyzed, and successfully purified. Lanes 8 and 9, NPM2 was expressed and proteolyzed, but only a small amount of protein was purified. In addition, the purified protein appeared to be partially degraded. Lanes 10 and 11, His8-GFP|
|Fig. 5. Replicate Maxwell 16 purification of human embryonic stem T-cell lymphoma-1 protein. Lane 1, molecular weight markers. Lanes 2–12, replicate purifications of His8-Tcl-1. Lane 13, His8-MBP-At2g34690.1, an Arabidopsis thaliana protein expression control|
|Fig. 6. 1H–15N HSQC NMR spectra of Maxwell-purified proteins Tcl-1 and GFP. (a) 750 MHz spectrum of His8-Tcl-1 obtained at 35°C (1.75 mg in 250 μl of 10 mM KHPO4, pH 7, containing 50 mM KCl). The total NMR time required to obtain this spectrum was 9.5 h. (b) 600 MHz spectrum of His8-GFP obtained at 35°C (5.6 mg in 250 μl of 10 mM KHPO4, pH 7, containing 50 mM KCl). The total NMR time required to obtain this spectrum was 1 h|
|Fig. 7. X-ray structure of AIA–GFP. The chromophore is shown as green cylinders representing bonded atoms|
|Fig. 8. Schematic of a purification screening protocol. Steps from obtaining a sequence-verified target in auto-cleavage vector pVP62K to identification of purified proteins. The transformed expression host is grown in auto-induction medium. Cells from production trials are loaded into the Maxwell 16 instrument for automated purification, and purified proteins are detected by Caliper LC90 capillary electrophoresis. Successful purification of a protein from auto-cleavage expression with yield exceeding 50 μg/ml of culture medium indicates feasibility of scale-up efforts|