Vitae
EDUCATION
- Ph.D Computer Science, West Virginia University, USA, 2008
Graduate Study, Computer Science, McGill University, Canada, 2003
B.S. Computer Science, Northeastern University, China, 2002
PROFESSIONAL EXPERIENCES
- Software Engineer, Microsoft Live Search, Redmond, WA, July 2008 -
Work on Microsoft Live.com search division, chunk builder and word breaker products.
- Algorithm Intern, Applied Biosystems, Foster City, California, Summer 2007
Test and implement the novel Delta Delta Ct analysis algorithm. Produce a white paper and present the results of the relative quantity analysis.
- Research Assistant, West Virginia University, Morgantown, WV, Jan 2003 – May 2008
Design, implement and test news classification computer program. Extract related metrics of interests and classify news text with 10×10 ways cross validation approach. End-to-end engineer and test a suffix array based algorithm to index data and apply the proposed algorithm on data compression and data indexing. Develop and test computer program to measure data complexity. Propose and implement hyperspectral images compression algorithm.
PUBLICATIONS
- Donald Adjeroh and Fei Nan, “Direct Suffix-Sorting via Shannon-Fano-Elias Codes”, Theoretical Computer Science, 2008.
- Fei Nan, “Direct Suffix Sorting and Its Applications”, Doctorate Dissertation, West Virginia University, 2008.
- Donald Adjeroh and Fei Nan, “Suffix Sorting via Shannon-Fano-Elias Codes”, 2008 IEEE Data Compression Conference (DCC 2008), Snowbird, UT.
- Fei Nan, “Bioinformatic Study of gamma-Secretase and its Substrates”, Master Thesis, West Virginia University, 2008.
- Fei Nan and Donald Adjeroh, A Sort-Based Algorithm for Multiple Sequence Alignment. 2007 Computational Systems Bioinformatics Conference (CSB 2007), San Diego, California.
- Fei Nan and Donald Adjeroh. An Algorithm for Suffix Sorting and Its Applications. 2006 Computational Systems Bioinformatics Conference (CSB 2006), Stanford, California.
- Donald Adjeroh, Fei Nan. On Compressibility of Protein Sequences. 2006 IEEE Data Compression Conference (DCC 2006), Snowbird, Utah.
- Fei Nan, Donald Adjeroh. On Complexity Measures for Biological Sequences. 2004 IEEE Computational Systems Bioinformatics Conference (CSB 2004), Stanford, California.
INVITED TALKS
- Relative Quantity Error Analysis at Applied Biosystems on August 2007
- On Compressibility of Protein Sequences at DCC 2006
JOURNAL REVIEWER
- Journal of Theoretical Biology
MEMBERSHIPS AND CERTIFICATES
- SAS Certified Advance Programmer Credential
- Membership of Sigma Xi
- Membership of Upsilon Pi Epsilon (UPE)
- Membership of IEEE Computer Science
- Membership of Association for Computing Machinery (ACM)
- Membership of Society for Industrial and Applied Mathematics
- Membership of International Society for Computational Biology (ISCB)
- Distinguished Graduate Thesis
- President’s Fellowship of Northeastern University
CITATIONS
- Leila Pirhajia, Mehdi Kargar, Armita Sheari, Hadi Poormohammadi, Mehdi Sadeghi, Hamid Pezeshk, Changiz Eslahchi, “The performances of the chi-square test and complexity measures for signal recognition in biological sequences”, Journal of Theoretical Biology, November, 2007
- Minh Duc Cao Trevor I. Dix Lloyd Allison, Chris Mears, “A Simple Statistical Algorithm for Biological Sequence Compression”, Data Compression Conference 2007, March 2007
- Dario Benedetto, Emanuele Caglioti, Claudia Chica, “Compressing Proteomes: The Relevance of Medium Range Correlations”, EURASIP Journal on Bioinformatics and Systems Biology, September, 2007
- Chen Lin, Course COSI 175a: Data Compression and Multimedia Processing, Brandeis University, Fall 2006 semester
- Paul H. Siegel, “Advances in Information Recording”, AMS Bookstore, 2008, ISBN 0821837524
- Jie Lin, Yue Jiang, Don Adjeroh, “The Virtual Suffix Tree: An Efficient Data Structure for Suffix Trees and Suffix Arrays”, Proceedings of Prague Stringology Conference 2008, September 2008
- Donald Adjeroh, Timothy Bell, Amar Mukherjee, “The Burrows-Wheeler Transform”, Springer, 2008, ISBN 0387789081
- Leila Pirhaji, Mehdi Sadeghi, Mehdi Kargar, Armita Sheari, Hadi Poormohammadi, Hamid Pezeshk, Changiz Eslahchi, “Pattern Recognition using the Chi-square Test and Complexity Measures in Biological Sequences”, 5th National Biotechnology Congress, November 2007
- Guillaume Wisniewski, “Learning in structured spaces: Applications to the labeling of sequences and automatic processing documents”, Doctorate Dissertation, University of Paris VI - Pierre and Marie Curie, November 2007
- Amar Mukherjee, “ITR Collaborative Research: Compressed Search and Retrieval for Very Large Text and Image Repositories”, Annual Report, University of Central Florida, July, 2006


