Sorry! I forgot to remind you to put a > before the descriptor. That tells the program that the entry is FAST-A format with a descriptor followed by the amino acid sequence.
I tried the search in Prosite (
http://prosite.expasy.org/scanprosite/) with the > but there were no hits. That means your sequence SSGLQGDPGQTPTAEAVQVPPGPLGL was not recognized by Prosite as a protein motif or domain--just a part of the sequence of the human collagen protein:
collagen [Homo sapiens] 1,678 aa protein.
Chromosome: X.Map: Xq22.Sex: male/female.Clone_lib: whole eye and kidney.Note: clones MS[6,17,29,71,98] and TM[27,29,30,46,51,52,53].Accession: BAA04809.1GI: 466538
Next I tried looking for your sequence in the NCBI Conserved Domain Database (CDD):
http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi
This program did find some motifs but I don't know how important they are:
Conserved domains on [gi|466538|dbj|BAA04809.1|] collagen [Homo sapiens]
List of domain hits
Name Accession Description Interval E-value
C4 pfam01413 C-terminal tandem repeated domain in type 4 procollagen; Duplicated domain in C-terminus of ... 1454-1561 2.17e-62
C4 pfam01413 C-terminal tandem repeated domain in type 4 procollagen; Duplicated domain in C-terminus of ...1562-1676 3.00e-57
Collagen pfam01391 Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...893-952 4.78e-04
Collagen pfam01391 Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...1078-1135 5.84e-03
I remembered from biochem class that collagen has a triple helix structure--three molecules running parallel and cross-linked to each other:
http://www.ncbi.nlm.nih.gov/books/NBK21582/
That's what makes collage so strong. The linkages are between lysines on adjacent molecules. The usual motif is Gly Pro X or Gly X HyP where 'X' is any amino acid other than Gly or Pro. Glycine is a small aa that allows closer contact of the strands. That's why there is so much glycine in collagen. The prolines allow the strands to wind around each other in a helical form, sort of like DNA only with three strands instead of two.
Here's a pretty good description of collagen structure and synthesis:
https://en.wikipedia.org/wiki/Collagen
If you need help with the names of the amino acids, here's a key:
http://www.thelabrat.com/protocols/aminoacidtable.shtml
I hope this information helps your project. Let us know if you have more questions.
Sybee