
The PROTEIN PURIFICATION AND CHARACTERIZATION Team headed by George Phillips, Jr., PhD, is responsible for purifying all recombinant protein produced by the project. This team also confirms that the expressed protein corresponds to the expected product.
An efficient and semi-automated pipeline system has been established for high-throughput purification of E. coli-expressed proteins. The protocols have been optimized to work with unlabeled proteins, proteins labeled with Se-Met for X-ray crystallography, and proteins labeled with 15N or 13C;15N for NMR spectroscopy.
Key Steps of Protein Purification
Two-Step Purification of Recombinant Protein. Recombinant protein is purified using a two-step chromatography procedure, with an optional polishing step where ionic or size exclusion columns are used to improve the purity of the target protein. Once proteins have been processed through the concentration stages, all samples are sent to the ESI-MS and MALDI-MS analysis for quality assurance before X-ray crystal screening or NMR HSQC data acquisition.
Key Features of the System. The key features of this system include an HPLC that uses a binary gradient pump to purify six proteins sequentially by applying gradient elution from six independent Ni-IDA columns. One-step desalting and subtractive Ni-IDA chromatography removes His-tagged proteins from the target protein. Three such systems are presently in use.
The construct used always yields an N-terminal serine following TEV protease cleavage. Sesame (LIMS) is used for data capture and analysis. Details and statistics for each purification step, including % TEV cleavage, yields, and purity, are recorded.
Optimization of Purification Protocols. In order to optimize the pipeline, we developed protocols for automated protein production. We found that the presence of chaotrophic agents such as ethylene glycol and imidazole in the initial sonication buffer are critical to obtain high purity of fusion proteins. Optimum concentration of ethylene glycol and imidazole are 20% (w/v) and 35 mM, respectively. The optimized protocol allows purification of native, SeMet-, 15N-, and 13C;15N-labeled proteins up to 150 mg of protein from 2L culture volume. The purity of target proteins is typically greater than 90%, suitable for structural determination by X-ray crystallography and NMR.
Experimental bioinformatic data are used to improve efficiency of current E. coli protein production pipeline. The most common reason for failures of target purification is poor TEV proteolysis, particularly when the percentage of cleavage is less than 70%. The results emphasize the requirement for screening methods that can assess the TEV proteolysis at small-scale cell culture stage. We are in the process of testing expression vectors that contain different proteases cleavage sites and different linker sequences and are developing high-throughput methods for screening for cleavage and solubility at the small-scale expression stage.
We also utilize a highly ordered data storage system, Lamp module in Sesame (LIMS), to monitor the quality control of purification processes and to store the biochemical properties of purified protein such as mobility and purity on SDS-polyacylamide gel, UV-visible spectra, MALDI-MS, and ESI-MS.
Expression Vectors and Large-Scale Cell Growth. We have designed a (His)n-MBP fusion tag system (n = 6 or 8) to overcome the low solubility of recombinant eukaryotic proteins and to provide a generic Ni-IMAC purification strategy. The pVP13 and pVP16 expression vectors used for these studies were derived from pQE80 (Qiagen, Valencia, CA) to express an N-terminal fusion protein consisting of (His)n-MBP and a linker region containing the TEV protease site contiguous with the second residue of the target protein. Either E. coli Rosetta or B834 strains were used to produce unlabeled-, 15N-, and 15N/13C-, or SeMet-labeled proteins, respectively. Cells were inoculated in a 2-liter polyethylene terephthalate bottle which contained 500 ml of Terrific Broth or auto-induction medium and incubated in a shaker at 250 rpm, 25oC for 22-24 h. Cells were harvested by centrifugation at 5000 x g for 20 min.
Protein Purification Protocols. The overall philosophy of the protein purification pipeline is to automate the protocols as much as possible while preserving protein quality sufficient for structural studies. Protein purification processes are as follows:
Step 1: Cell lysis and preparation of the soluble fraction
Step 2: 1st IMAC capture of His-tagged fusion proteins
Step 3: Desalting of fusion proteins into TEV proteolysis buffer
Step 4: TEV proteolysis of fusion tags
Step 5: 2nd IMAC removal of tag and isolation of target proteins
Step 6: Desalting of targets
Step 7: Concentration of targets
Step 8: Drop-freezing of targets
Steps 2, 3, 5, and 6 are performed on the ÄKTA Purifier controlled by Unicorn 4.12 software. Detail description of purification processes was reported.
Confirm Identify and Integrity of Proteins. All purified proteins are subjected to MALDI-TOF and ESI mass spectrometry to confirm identity and integrity, determine oligomeric state, and investigate possible ligands. Incorporation levels of Se-Met and 15N and 13C isotopes are also determined by ESI-MS.