Native Chemical Ligation: How Long Peptides Get Built From Smaller Pieces
Chemists can write any peptide sequence on paper, but the machinery that builds them runs out of steam after a few dozen residues. Native chemical ligation is the reaction that gets around that limit, joining separately made fragments into one long, native chain. Here is how it works, why it needs a cysteine, and how researchers stitch three or more pieces together.
by Research Assistant·
You can write any peptide sequence you like on paper. Building it is the hard part. The chemistry that assembles peptides one amino acid at a time works beautifully for short chains, but it loses accuracy as the chain grows. Somewhere around fifty residues, it effectively runs out of room. Proteins are much longer than that. For research use only: the chemistry described here is a laboratory technique for research and manufacturing, not a method for producing material for human or animal use. So how does anyone build a long peptide or a small protein from scratch? The answer, for most of modern chemical protein synthesis, is native chemical ligation — a reaction that lets chemists make short, clean fragments separately and then join them into one continuous, native chain.
This article walks through why a single long synthesis breaks down, the two-piece trick that native chemical ligation relies on, the two-step mechanism that forms the bond, the catalyst that makes it practical, which junctions actually work, and how researchers extend the same idea to assemble proteins from three or more pieces.
Why You Can't Just Build a Long Peptide in One Go
Here is the short answer: every amino acid you add in a stepwise build is added with slightly less than perfect efficiency, and small imperfections multiply over a long chain. Add a residue at 99% efficiency and you barely notice. Repeat it a hundred times and the math turns against you — the yield of the full-length, correct product falls steeply, and the failure sequences that pile up become nearly impossible to separate from the one you wanted.
This is why solid-phase peptide synthesis, the standard way peptides are assembled on a resin support, is practical only up to roughly fifty residues before purity and yield degrade past the point of usefulness. Beyond that length, pushing the same chemistry harder does not help.
The fix is to stop thinking about one long build and start thinking convergently. Make several short fragments separately, purify each one while it is still easy to purify, and then join the finished pieces. Think of it as assembling something from well-made parts instead of carving it from a single block — and joining unprotected fragments cleanly in water is exactly the problem native chemical ligation was designed to solve, as the published overviews of the method describe.
The Two Pieces Native Chemical Ligation Needs
Native chemical ligation is picky. That pickiness is the whole point. It joins one specific pair of chemical ends and ignores almost everything else, which is what lets it work on fragments whose side chains are left fully exposed.
The first fragment has to end in a C-terminal thioester. A thioester is just an ester in which a sulfur atom stands in where an oxygen would normally sit; that small swap makes the carbon it is attached to far more reactive toward sulfur-bearing partners. The second fragment has to begin with an N-terminal cysteine. Cysteine is the one common amino acid carrying a thiol — a sulfur-containing side chain — and that thiol is the reactive handle that starts the reaction.
Everything else on both fragments stays unprotected. The carboxylic acids of aspartate and glutamate, the amino group of lysine, the hydroxyls of serine, threonine and tyrosine — the reaction simply does not touch them. That selectivity is what makes the method so forgiving: you do not have to mask the rest of the molecule the way much peptide chemistry requires. For a sense of how much variety those side chains introduce across even closely related peptides, it helps to look at the chemical structure of related peptides and how small differences add up.
The Reaction in Two Steps
What does the join actually look like up close? It happens in two stages, and the difference between them — one reversible, one not — is what makes the whole thing work.
Step one: the fragments find each other
First, the thiol on the N-terminal cysteine attacks the thioester at the end of the other fragment. This is a transthioesterification: the two pieces become linked through sulfur, forming a temporary intermediate. Crucially, this step is reversible — pieces can come together and fall apart again — so nothing is committed yet. That reversibility turns out to be a feature, not a bug, because it lets the system sort itself out before anything permanent happens.
Step two: the join becomes permanent
The temporary sulfur-linked intermediate then rearranges. An internal shift — chemists call it an S-to-N acyl shift — moves the connection from the sulfur over to the neighboring nitrogen, and the result is an ordinary peptide bond, indistinguishable from any other bond in the chain. This second step is effectively irreversible under the reaction conditions, which is what drives the whole process forward: pieces shuffle reversibly in step one, but once step two fires, that junction is locked in as native amide.
This selectivity has a neat consequence. Only the cysteine at the very start of a fragment ends up forming the new bond; cysteines sitting in the middle of either piece can take part in the reversible first step but cannot complete the permanent rearrangement, so they are left untouched. The concept was reported by Dawson, Muir and Kent in 1994, building on the broader chemical-ligation idea Kent and Schnölzer introduced in 1992, with the underlying acyl shift itself first observed by Wieland back in 1953, as documented in the mechanistic literature.
The Catalyst That Makes It Practical
Left entirely on its own, that first step can be slow. The practical workhorse versions of the reaction add a small-molecule thiol catalyst that speeds up the reversible exchange without changing the outcome.
The catalysts that work best are aryl thiols — sulfur compounds attached to an aromatic ring — with an acidity (pKa) above roughly six, because they exchange rapidly with the thioester. The standard choice today is MPAA, or 4-mercaptophenylacetic acid, which is effective and conveniently water-soluble, as a detailed study of the reaction's mechanism and catalysis established.
The conditions are mild and unglamorous, which is part of the appeal: an aqueous buffer at neutral pH, room temperature, and usually a high concentration of guanidine hydrochloride to keep the fragments dissolved while they react. No exotic solvents, no harsh heat.
Which Junctions Actually Work
Because the reaction is built around cysteine, an obvious question follows: does it matter what residue sits just before that cysteine at the join? It does — mostly for speed.
Combinatorial studies that paired thioester fragments against a cysteine-starting partner showed that all twenty naturally occurring amino acids can occupy the position immediately before the junction. There is a caveat: valine, isoleucine and proline ligate the most slowly, because their bulky or constrained structures crowd the reacting center. That work, which paired peptides and read out the products by mass spectrometry, mapped the full residue-by-residue picture and is described in a study expanding the scope of the method.
The practical takeaway for anyone planning a synthesis is simple. You cut the target sequence into fragments at points where a cysteine naturally sits, and where you have a choice, you steer away from the slow junctions. Plan the cut points well and half the work is done.
Beyond Cysteine
There is a catch with all of this: real proteins do not have a cysteine everywhere you might want to make a cut. Cysteine is actually one of the rarer amino acids, so if the method only ever worked at cysteine, its reach would be narrow. Chemists got around this in a few ways.
The most widely used trick is desulfurization. After the ligation is finished and the new bond is formed, the sulfur on the junction cysteine is chemically removed, which converts it into alanine. Since alanine is common, this effectively lets researchers treat many alanine sites as if they were cysteine for planning purposes. Building on the same idea, a family of thiol-derivatized amino acids — versions of phenylalanine, valine, threonine, lysine and others that carry a temporary sulfur handle — plus selenium-based selenocysteine extend the menu of usable junctions much further, as a review of extensions to the method lays out.
Stitching Three or More Pieces Together
Two fragments make one join. But most proteins need three, four, or more pieces Two fragments make one join. Most proteins are long enough to need three, four or more pieces, and that raises a sequencing problem:mdash; and that raises a sequencing problem: if you throw all the fragments into one pot, how do you stop them from joining in the wrong order?
The difficulty is that an internal fragment — one that sits in the middle of the final chain — needs both a reactive N-terminal cysteine and a C-terminal thioester. With both ends live at once, internal pieces could react with each other prematurely or even loop back on themselves. Researchers solve this with masking groups and careful ordering: temporarily capping the N-terminal cysteine of a fragment (for example with an Fmoc group) keeps it dormant until it is needed, so the pieces couple in the intended direction. Combined with kinetically controlled, one-pot strategies, this allows several fragments to be assembled in sequence without isolating every intermediate, an approach demonstrated in work on fully convergent, one-pot ligations using a masked cysteine.
The size ceiling lifts even further when one of the pieces is made biologically. In expressed protein ligation, a recombinant protein segment carrying the right reactive end is joined to a chemically synthesized fragment, which means there is essentially no upper limit on how large the final product can be. Once everything is assembled, the full-length product's identity has to be confirmed — this is where mass spectrometry confirms the sequence matches what was designed.
Frequently Asked Questions
What is native chemical ligation in simple terms?
It is a chemical reaction that joins two separately made peptide fragments into one longer chain, forming an ordinary peptide bond at the join. One fragment ends in a chemical handle called a thioester; the other begins with the amino acid cysteine. When they meet in water at neutral pH, they link up on their own.
Why is a cysteine residue required at the ligation site?
Cysteine carries a sulfur-containing thiol side chain that performs the first attack on the partner fragment's thioester. That sulfur is what lets the two pieces find each other selectively before the join rearranges into a normal peptide bond. Modern variants can remove or substitute the cysteine afterward, but the reaction still needs that sulfur handle to start.
How long a protein can native chemical ligation build?
By joining several fragments in sequence, researchers have chemically assembled proteins of roughly 150 to 300 amino acids and beyond, far past the rough fifty-residue practical ceiling of direct stepwise synthesis. Combining the method with recombinant protein segments, called expressed protein ligation, removes the upper size limit entirely.
Is native chemical ligation used to make peptides for human use?
No. It is a laboratory chemistry technique used in research and manufacturing studies. Any material discussed here is for research use only and is not intended for human or animal consumption.
The Bottom Line
Long peptides and proteins are not built in one heroic pass. They are built convergently — as short, clean fragments that are then joined — and native chemical ligation is the reaction that does the joining, knitting unprotected pieces together into one native chain at a cysteine junction under mild, watery conditions. The reversible first step and the irreversible acyl shift give it both selectivity and a thermodynamic push, and a simple aryl-thiol catalyst makes it fast enough to be routine. Extensions like desulfurization and one-pot, multi-fragment strategies keep widening the range of what chemists can reach. If you want to see where the fragments come from in the first place, the upstream step is solid-phase peptide synthesis; for how the finished product is checked, look at peptide mass spectrometry verification.
For research use only. Not for human or animal
consumption of any kind. The information in this article is for
educational purposes only and is not intended to diagnose, treat,
cure, or prevent any disease. The statements made have not been
evaluated by the U.S. Food and Drug Administration. These products
are NOT FDA APPROVED. Please consult with a licensed healthcare
professional before making any decisions regarding your health
or research.
Optides LLC is a chemical supplier. Optides LLC is not a
compounding pharmacy or chemical compounding facility as defined
under 503A of the Federal Food, Drug, and Cosmetic Act. Optides LLC
is not an outsourcing facility as defined under 503B of the Federal
Food, Drug, and Cosmetic Act.
Tags
Native Chemical LigationThioesterPeptide SynthesisResearch PeptidesProtein Chemistry
Stay In The Loop
Stay Updated on New Research Compounds
Be the first to know about new products, research guides, and exclusive offers.