A peptide sequence is more than a row of letters on a label. It is a precise description of how amino acids are linked together, and it tells researchers what the molecule is, how it might fold, and what it could do.
Reading the Letter Codes
Amino acids are written in two systems. The three-letter code is more readable: Gly for glycine, Ala for alanine, Leu for leucine. The single-letter code is more compact: G, A, L. Both refer to the same 20 standard amino acids.
A sequence is read from left to right, starting with the N-terminus (the free amino group) and ending with the C-terminus (the free carboxyl group). So "Gly-Glu-Pro-Pro-Pro" begins with glycine on the N-terminal end. Direction matters because peptides are not symmetrical — reversing the sequence gives a different molecule.
Modifications often appear at the ends. "Acetyl-" at the start or "-NH2" at the end indicates capping groups added to change stability or behavior. These small additions can change properties significantly.
What the Sequence Suggests About Function
Each amino acid contributes a side chain with its own chemistry. Charged residues like lysine (positive) or aspartate (negative) often sit on the surface and interact with other charged molecules. Hydrophobic residues like leucine and valine often face inward in folded structures.
Patterns in a sequence hint at function. Repeating prolines, for example, can create rigid bends. Cysteine pairs can form disulfide bridges that lock parts of the molecule into a stable shape. Researchers familiar with these patterns can often predict broad behavior just from reading the sequence.
Comparing a new sequence to known peptides also helps. A sequence that closely resembles a known signaling peptide may target similar pathways. Database tools like BLAST make this comparison straightforward.
Why Sequence Verification Matters
A peptide is only as useful as its sequence is correct. Synthesis errors can introduce missing amino acids, swapped residues, or unexpected modifications. Without verification, a study could be testing a slightly different molecule than intended.
Mass spectrometry is the standard verification tool. The measured mass should match the calculated mass for the intended sequence within a small tolerance. Tandem mass spectrometry can go further by fragmenting the peptide and confirming the order of residues.
HPLC complements mass spectrometry by showing how clean the sample is. A single sharp peak at the expected retention time supports the identity of the main product. Together, the two methods give researchers confidence that the vial matches the label.
From Sequence to Three-Dimensional Structure
A linear sequence is just a starting point. In solution, peptides can fold into helices, sheets, turns, or loose coils depending on their residues and environment. Short peptides often have flexible structures that shift, while longer or constrained sequences can hold more defined shapes.
Modeling tools and structural databases help researchers predict folding behavior. For larger peptides, experimental techniques like nuclear magnetic resonance or X-ray crystallography can resolve the actual three-dimensional structure.
Sequence analysis is a fast-evolving area, and predictive tools are improving every year. These compounds are intended for research use only and are not for human consumption.