Resources>Blog>Historical Background and Biotechnology Milestones of Recombinant Proteins

Historical Background and Biotechnology Milestones of Recombinant Proteins

Biointron 2025-09-29 Read time: 8 mins
crf370078-fig-0002-m.jpg
Simple diagram of recombinant protein (RP) expression. DOI: 10.1111/1541-4337.70078

Molecular Biology Foundations Enabling Recombinant Protein Expression

Recombinant protein technology emerged from a series of foundational discoveries in molecular biology during the mid-20th century. These began with Avery, MacLeod, and McCarty's identification of DNA as the carrier of genetic information in the 1940s, followed by the “one gene–one enzyme” hypothesis by Beadle and Tatum, and culminated in Watson and Crick's 1953 model of DNA structure, aided by X-ray diffraction data from Rosalind Franklin.

The understanding of the central dogma, as in the flow of genetic information from DNA to RNA to protein, was clarified through the efforts of Khorana, Nirenberg, and Holley, who deciphered the genetic code. Their work established that triplet codons on mRNA specify individual amino acids, forming the molecular basis of protein synthesis. These discoveries provided the conceptual basis for later genetic manipulation.

By the late 1960s, the lacZ gene from E. coli, encoding β-galactosidase, became the first gene to be isolated and cloned, using bacteriophage vectors. This allowed researchers to examine gene regulation and laid the groundwork for gene insertion into other organisms.

Development of Tools for Gene Cloning

Two critical enzymatic tools were required to construct recombinant DNA: restriction endonucleases and DNA ligases. The restriction enzymes, which cleave DNA at specific sequences, enabled scientists to cut and manipulate DNA fragments with precision. DNA ligases catalyze the formation of phosphodiester bonds, enabling ligation of DNA fragments to form recombinant molecules.

By 1972, Jackson, Symons, and Berg used restriction enzymes (EcoRI) and DNA ligase to generate the first recombinant DNA molecules by joining bacterial and viral DNA. This work was followed by Cohen and Boyer’s experiments showing that plasmids containing foreign DNA could be introduced into E. coli, propagated, and expressed, thus establishing the first recombinant organisms.

The novelty of this approach lay not in the tools themselves, but in how they were applied. As Freeman Dyson and Peter Galison emphasized, the revolutionary power of recombinant DNA stemmed from repurposing existing molecular tools in new combinations to address previously intractable biological questions.

Related: Stable vs. Transient Expression Systems in Antibody Development

Initial Therapeutic Applications and Clinical Translation

The first therapeutic recombinant protein, human insulin, was produced in E. coli by Genentech in 1978 and approved by the FDA in 1982. This replaced animal-derived insulin and provided a more scalable, consistent, and immunologically safer alternative.

This success catalyzed a wave of biologic therapeutics including:

  • Recombinant human growth hormone (rhGH)

  • Recombinant erythropoietin (EPO)

  • Interferons (e.g., IFN-α2b, IFN-β1a)

  • Clotting factors (Factor VIII, IX)

  • Enzymes (e.g., DNase for cystic fibrosis)

  • Tissue plasminogen activator (tPA)

As of today, over 130 recombinant proteins have been approved by the FDA, with more than 170 in clinical use worldwide. These biologics are used to treat conditions such as diabetes, anemia, multiple sclerosis, hemophilia, rheumatoid arthritis, Crohn’s disease, and various cancers.

Generations of Therapeutic Recombinant Proteins

Recombinant proteins have undergone three generations of refinement:

  1. First generation: Unmodified proteins with native structure (e.g., insulin, rhGH).

  2. Second generation: Enhanced proteins with improved pharmacokinetics, reduced immunogenicity, or modified stability (e.g., pegylated interferons, darbepoetin).

  3. Third generation: Recombinant proteins engineered for novel delivery routes, targeted biodistribution, and optimized therapeutic index.

These enhancements reflect both genetic engineering strategies and advances in expression systems and purification technologies.

Advances in Expression Systems for Recombinant Proteins

Early recombinant protein production relied heavily on E. coli due to its rapid growth and genetic tractability. However, E. coli lacks the machinery for proper eukaryotic post-translational modifications (e.g., glycosylation, disulfide bond formation), limiting its use for complex proteins.

To overcome these limitations, the field has adopted a tiered set of eukaryotic expression systems, each with distinct capabilities:

Expression SystemCharacteristicsApplications
E. coliRapid growth; lacks PTMsEnzymes, small proteins
Pichia pastorisSome glycosylation; high expressionAntigens, cytokines
Baculovirus/insectComplex PTMs; moderate yieldsVaccines, structural biology
Mammalian (CHO, HEK293)Full PTMs; native foldingAntibodies, Fc-fusions

Mammalian systems are the gold standard for proteins requiring human-like glycosylation and accurate folding. CHO cells remain the primary host for therapeutic-grade protein production, while HEK293 is commonly used for rapid screening or research-scale expression.

Recombinant Protein Production →

From Gene to Protein: Technical Challenges and Solutions

Recombinant protein production begins by cloning a gene of interest (typically as cDNA) into a mammalian or microbial expression vector containing regulatory elements (e.g., promoter, enhancer, signal peptide).

Key technical considerations include:

  • Codon optimization: Adjusting gene sequences to match host tRNA abundance.

  • Vector design: Incorporating strong promoters (e.g., CMV), secretion signals, and affinity tags.

  • Protein folding: Co-expression of molecular chaperones or regulated induction to prevent aggregation.

  • Purification: Use of affinity chromatography (e.g., His-tag, Fc-tag) followed by polishing steps.

  • Protein modifications: Ensuring correct disulfide bond formation and glycosylation patterns.

For high-throughput applications, transient transfection methods in HEK293 or CHO cells are routinely used to express milligram-scale protein quantities in 10 days.

RushMab™ Small-Scale Antibody Expression Packages →

Regulatory and Industrial Frameworks

The rapid expansion of recombinant DNA and protein technologies prompted regulatory oversight. In 1976, the U.S. NIH issued the first Guidelines for Research Involving Recombinant DNA, providing biosafety frameworks for academic and industrial labs. These guidelines have evolved globally and continue to govern the use of recombinant organisms.

The biotechnology industry emerged in parallel. The formation of Genentech (1976), Biogen (1978), and Amgen (1980) marked the commercialization of molecular biology. Licensing of foundational patents (e.g., the Cohen-Boyer patents) enabled universities and research institutions to profit from biotech innovations, accelerating investment in recombinant therapeutics.

Modern Trends in Recombinant Protein Development

Several trends now define the landscape of recombinant protein production:

  • Transient expression is increasingly used for rapid screening and early-stage drug discovery.

  • CHO-based stable expression remains the benchmark for GMP manufacturing.

  • Automation and high-throughput platforms enable clone selection, media optimization, and purification process development.

  • Synthetic biology and cell-free systems are emerging as alternatives for difficult-to-express proteins.

  • Computational modeling and AI tools aid in protein engineering, stability prediction, and expression optimization.

Biointron’s Recombinant Protein Expression Platform

Biointron offers recombinant protein expression in both mammalian (CHO-K1 and HEK293) and bacterial (E. coli) systems. These platforms support rapid, scalable production of enzymes, soluble proteins, secreted proteins, and antibody fragments for research and diagnostic applications.

  • Mammalian expression is recommended for proteins requiring post-translational modifications such as glycosylation or disulfide bond formation.

  • E. coli expression is suitable for high-yield production of non-glycosylated proteins or constructs designed for cytoplasmic expression and affinity purification.

Standard workflows include codon optimization, transient transfection, affinity purification, and QC validation via SDS-PAGE, SEC-HPLC, and endotoxin testing (<1 EU/mg). Projects typically deliver results within 3-4 weeks and can scale from 100 mL to 10 L. 

Recombinant Protein Production →

Learn more about Biointron’s recombinant protein services here: https://www.biointron.com/recombinant-protein-expression/recombinant-protein-expression-in-mammalian-cells.html 

 

References:

  1. Berg, P., & Mertz, J. E. (2010). Personal Reflections on the Origins and Emergence of Recombinant DNA Technology. Genetics, 184(1), 9-17. https://doi.org/10.1534/genetics.109.112144

Subscribe to our Blog
Recommended Articles
Function-First Antibody Discovery with Microfluidics: Screening for Activity, Not Just Binding

Antibody discovery has traditionally relied on selecting clones based on target ……

Nov 12, 2025
SITC 2025 – Maryland: Highlights and Event Recap

The Society for Immunotherapy of Cancer (SITC) 2025 conference was held in Natio……

Nov 10, 2025
VHH Library Generation: Immunized, Naïve, and Synthetic Nanobody Libraries

A VHH library is a collection of variable domains of heavy-chain antibodies (VHH……

Nov 07, 2025
Biointron Launches Quarterly Travel Grant to Support Early-Career Antibody Researchers

Biointron is proud to announce the launch of the Biointron Quarterly Travel Gran……

Nov 06, 2025

Our website uses cookies to improve your experience. Read our Privacy Policy to find out more.