Article — Protein Molecular Weight Calculator
Protein Molecular Weight Calculator: Sequence to kDa
A protein molecular weight calculator sums the residue mass of each amino acid in a sequence and adds one water molecule (18.015 Da) for the free N- and C-termini. The result is reported in daltons (Da) and kilodaltons (kDa). For a quick estimate without sequence, multiply residue count by 110 Da — accurate within 10 percent for most natural proteins.
This number drives nearly every protein experiment that comes after. SDS-PAGE markers are labeled in kDa. Size exclusion columns separate by Stokes radius which correlates with MW. Mass spectrometry confirms predicted MW against observed mass to verify identity. A bad MW estimate cascades into every downstream calculation.
What protein molecular weight is
Protein molecular weight is the total mass of all atoms in one protein molecule, reported in daltons. Daltons are equivalent to g/mol, so a 50 kDa protein weighs 50,000 grams per mole. Most natural proteins fall between 10 and 250 kDa. Small peptides like insulin sit at 5.8 kDa. Therapeutic IgG antibodies are 150 kDa. Titin, the giant elastic muscle protein, is 3,800 kDa or 3.8 megadaltons.
MW is computed from sequence using residue masses — the mass of each amino acid after losing one water during peptide bond formation. The protein molecular weight calculator looks up residue mass for each of the 20 standard amino acids, sums across the sequence, and adds one water for the free termini.
Residue masses and the water correction
An isolated amino acid like glycine weighs 75.07 Da. Inside a peptide chain, glycine appears as a residue — what is left after the peptide bond ate one water during synthesis. Residue mass for glycine is 75.07 minus 18.02 (water) equals 57.05 Da. Every amino acid loses one water when joining the chain.
For a chain of n residues, the protein molecular weight equals the sum of residue masses plus one extra water (the free termini). The water correction is built into residue masses, so the formula reads: MW = Σ(residue masses) + 18.015.
The dalton is named after John Dalton, the early-1800s chemist who founded atomic theory. One dalton equals 1/12 the mass of a carbon-12 atom — exactly 1.66054 × 10⁻²⁴ grams. The unit is used across chemistry, biochemistry, and mass spectrometry without conversion.
The 110 Da quick estimate
For sequences you do not have on hand, multiply residue count by 110 Da. A 300-residue protein weighs about 33 kDa. The 110 number is the weighted average mass per residue across natural proteins, balancing the small residues (glycine 57, alanine 71) against the large ones (tryptophan 186, tyrosine 163, phenylalanine 147).
The estimate is accurate within 5 to 10 percent for most natural sequences. Unusual compositions break the rule — glycine-rich collagen runs lower than 110 Da/residue, while tryptophan-rich repeats run higher. For pre-experiment planning, the 110 Da rule is fine. For final calculations, use the sequence-based protein molecular weight calculator.
Average vs monoisotopic mass
Two MW values exist for every protein: average mass and monoisotopic mass. Average mass uses the natural-abundance weighted mean of each element (carbon = 12.011, hydrogen = 1.008). Monoisotopic mass uses the lightest isotope (carbon-12 = 12.000 exactly, hydrogen-1 = 1.00783).
The difference for a 50 kDa protein is about 25 Da — average mass runs higher because heavier isotopes contribute to the natural mix. Average is the right choice for SDS-PAGE, chromatography, and biochemistry. Monoisotopic is required for mass spectrometry, where the spectrometer separates individual isotopologues and reports the lightest peak as the monoisotopic mass.
If your mass-spec reading is about 25 Da lower than the calculator predicts, do not panic. The spectrometer reported the monoisotopic peak while the calculator used average mass. Both numbers are right — pick the one that matches the technique.
Protein molecular weight in SDS-PAGE
SDS-PAGE separates proteins by mobility through a polyacrylamide gel in the presence of sodium dodecyl sulfate. SDS coats proteins with negative charge proportional to length, so mobility tracks size. Compared to a ladder of known MWs, an unknown band gives an estimated protein molecular weight.
The estimate is accurate within 5 to 10 percent on a well-run gel. Errors creep in from non-standard amino acid composition (proline-rich proteins migrate slowly), incomplete reduction, or atypical glycosylation. Always confirm critical MW assignments by mass spectrometry — gel-based estimates are useful for screening and identity confirmation, not final reporting.
Post-translational modifications
Predicted MW from sequence is the unmodified protein. Post-translational modifications (PTMs) shift the observed mass by predictable amounts. The protein molecular weight calculator does not include PTMs — add them manually if relevant.
- phosphorylation = +79.97 Da per phospho group
- methylation = +14.02 Da per methyl
- acetylation = +42.01 Da per acetyl
- ubiquitination = +8.6 kDa per ubiquitin
- SUMO modification = +11 kDa per SUMO
- N-glycosylation = +1 to +3 kDa per glycan
- disulfide bond = −2.02 Da per S-S bond
- initiator Met cleavage = −131.2 Da when removed
Using FASTA input correctly
The protein molecular weight calculator strips FASTA headers (lines starting with >), removes whitespace and non-letter characters, and converts to uppercase. Paste sequences directly from UniProt, NCBI, or any FASTA-format file. Multiple sequences in one input are concatenated — paste one at a time if you want individual MW for each.
Single-letter codes are standard (MKVLLVA...). Three-letter codes (Met-Lys-Val-Leu...) are not supported — most modern bioinformatics tools have moved to single-letter. Non-standard residues like selenocysteine (U) and pyrrolysine (O) are ignored as unknown and reported in the warning row.
Protein molecular weight pitfalls
Three common errors trip up new researchers. First, including the signal peptide or pro-domain in the sequence when the mature protein has it cleaved off. Check UniProt for the mature sequence before calculating. Second, forgetting that initiator methionine is removed in vivo for most proteins where the second residue is small. Third, comparing predicted MW from sequence to observed MW from a glycosylated cell-culture product — glycans add 5 to 20 percent to total mass.
Many secreted proteins have 15- to 30-residue signal peptides that are removed during secretion. Pasting the gene-translated sequence gives a MW 2 to 4 kDa higher than the mature secreted protein. UniProt lists both the propeptide and the mature sequence — pick the right one for your sample.
MW = Σ residue + 18 full sequenceMW ≈ 110 × n quick estimatekDa = Da / 1000 scaleµM = mg/mL × 1000 / kDa to molar