- UNSW
- ...
- Our schools
- Mathematics & Statistics
- Engage with us
- Seminars
- 2014
- Improving motif finder via a two-tiered significance analysis
- Home
- Our school
- Study with us
- Our research
-
Student life & resources
- Undergraduate
- Honours year
- Postgraduate coursework
-
Postgraduate research
- Info for new students
- Current research students
- Postgraduate conference
- Postgraduate events
- Postgraduate student awards
- Michael Tallis PhD Research Travel Award
- Information about research theses
- Past research students
- Resources
- Entry requirements
- PhD projects
- Obtaining funding
- Application & fee information
-
Student services
- Help for postgraduate students
- Thesis guidelines
- School assessment policies
- Computing information
- Mathematics Drop-in Centre
- Consultation
- Statistics Consultation Service
- Academic advice
- Enrolment variation
- Changing tutorials
- Illness or misadventure
- Application form for existing casual tutors
- ARC grants Head of School sign off
- Computing facilities
- Choosing your major
- Student societies
- Student noticeboard
- Casual tutors
- Engage with us
- News & events
- Contact
- Home
- Our school
- Study with us
- Our research
-
Student life & resources
Postgraduate research
- Info for new students
- Current research students
- Postgraduate conference
- Postgraduate events
- Postgraduate student awards
- Michael Tallis PhD Research Travel Award
- Information about research theses
- Past research students
- Resources
- Entry requirements
- PhD projects
- Obtaining funding
- Application & fee information
Student services
- Help for postgraduate students
- Thesis guidelines
- School assessment policies
- Computing information
- Mathematics Drop-in Centre
- Consultation
- Statistics Consultation Service
- Academic advice
- Enrolment variation
- Changing tutorials
- Illness or misadventure
- Application form for existing casual tutors
- ARC grants Head of School sign off
- Computing facilities
- Choosing your major
- Engage with us
- News & events
- Contact
Abstract:
Regulatory proteins bind to certain sites of the DNA to regulate the transcription of a protein, thus characterising these binding sites is of a great interest to the bioinformatics community. Motif finding problem is the problem of finding these binding sites among a set of co-regulated sequences.
With over 9000 unique users recorded in the first half of 2013, MEME is one of the most popular motif finding tools available. Reliable estimates of the statistical significance of motifs can greatly increase the usefulness of any motif finder. Currently MEME evaluates its EM generated candidate motifs using an extension of BLAST's E-value to the motif finding context. While the drawbacks of MEME's current significance evaluation was pointed out previously, there was no practical substitute suited for its needs, especially since MEME also relies on the E-value internally to rank competing candidate motifs.
We offer a two-tiered significance analysis that can replace the E-value in selecting the best candidate motif and in evaluating its overall statistical significance. We show that our new approach substantially improve and would also provide the user with a reliable significance analysis. In addition, for large input sets our new approach is in fact faster than the currently implemented E-value analysis.