Repeat Annotation Request Form
The following form facilitates extraction of short lengths of repeat sequence annotation from commonly available genomes.
If you would like to download the raw annotations for the entire genome, *.out and *.align files can be found
here
.
Sequence Selection
Genome/Assembly:
Arabidopsis - Jun 2004 - araTha5
Cat - Mar 2006 - felCat3
Chicken - May 2006 - galGal3
Chimp - Jan 2006 - panTro2
Chimp - Nov 2003 - panTro1
Cow - Oct 2007 - bosTau4
Dog - May 2005 - canFam2
Drosophila - Apr 2006 - dm3
Elephant - May 2005 - loxAfr2
Horse - Sep 2007 - equCab2
Human - Feb 2009 - hg19
Human - Mar 2006 - hg18
Human - May 2004 - hg17
Mosquito - Feb 2003 - anoGam1
Mouse - July 2007 - mm9
Opossum - Jan 2006 - monDom4
Opossum - Oct 2006 - monDom5
Oranguatan - Jul 2007 - ponAbe2
Platypus - Jan 2006 - ornAna1
Rat - Jun 2003 - rn3
Rat - Nov 2004 - rn4
Rhesus - Jan 2006 - rheMac2
Rice - Jan 2007 - orySat5
Takifugu - Aug 2002 - fr1
Takifugu - Oct 2004 - fr2
X. Tropicalis - Aug 2005 - xenTro2
Zebrafinch - Jul 2008 - taeGut1
Zebrafish - Jul 2007 - danRer5
Zebrafish - Jun 2008 - danRer6
Zebrafish - May 2005 - danDer3
Select the genome and assembly from one of the options in the drop down box.
Range:
Ranges consist of three identifiers. A valid dna chromosome for the genome specified followed by a start and end position (inclusive). For example human chromosome 1 from position 10-1000 would be chr1:10-1000. Multiple ranges can be entered separated by a ";".
Result Type:
annotations
raw alignments
masked genomic sequence
fasta
Select the result type for the range. "annotations" returns RepeatMasker style table of repeat annotations. "raw alignments" returns the alignment file used to create the RepeatMasker annotations. "masked genomic sequence" returns fasta formatted data from the assembly with interspersed repeats masked. "fasta" returns each interspersed repeat instance sequence in fasta format.
Masking Format:
x
n
lower case
Specify the character to use for masking or use lower case to designate repetitive sequences.
Filtering
Score:
>=
Filter out all repeats which score below this threshold.
Divergence: <
%
Filter out all repeats with a higher divergence.
Repeat Classes:
snRNA
Other
ARTEFACT
rRNA
Interspersed Only
tRNA
LINE
scRNA
SINE
DNA
RNA
Low_complexity
Simple_repeat
All
Satellite
LTR
RC
Repeat classes you would like included in your results.
Repeat Name:
Search for a particular repeat name ie. "AluSx". Do not include the type information in your name ie. "AluSx#SINE/Alu". The classes filter should be set to "All" if you are using a name filter.
Institute for Systems Biology
This server is made possible by funding from the National Human Genome Research Institute (NHGRI grant # RO1 HG002939-01) 2003.