A. Protein Sequence

From NESG Wiki
Jump to navigation Jump to search

A1. Fasta format.

The first line shoul start with '>' symbol.

Next one or more lines contain sequence in 1-letter code.



Example_A1:
 
MDSKEVLVHVKNLEKNKSNDAAVLEILHVL
DKEFVPTE KLLRETKVGVE VNKFKKSTN
VEISKLVKKMISSWKDAIN   
 




A2. "Standard" format.

Each line contains name of one residue in 3-lettr code and, optionally, the residue ID.

(only residue ID of the first residue is used to start numbering of the sequence positions).



Example_A2.1 (position ID is not specified)
 
GLN
GLY
HIS
MET
PRO
GLY
ILE
ILE
TYR
GLU
GLY
LYS
GLY
THR
ASN
MET
GLU
....
 
Example_A2.2 (with specified all position ID):
 
GLN   -3
GLY   -2
HIS    -1
MET   0
PRO   1
GLY    2
ILE      3
ILE      4
TYR     5
.....
 
Example_A2.3 (with specified first position ID):
 
GLN   -3
GLY 
HIS
MET  
PRO 
GLY 
ILE 
ILE 
TYR 
.....