Columbia University, Bachelor of Science 2008–present (expected 2012)
Computer Science (Artificial Intelligence)
Columbia Center for Computational Learning Systems2010–present
Developed a parse fuzzification algorithm for syntactic preordering which resulted in a significant improvement in the quality of Arabic-English machine translation.
Columbia Natural Language Processing Group2009–present
Developed a framework for human-assisted part-of-speech tagging of a LiveJournal corpus using Amazon's Mechanical Turk, and implemented a phrase-level opinion classifier for unstructured weblog text. Supervised the creation of a corpus of agreement/disagreement annotations, and developed a system for automatic agreement/disagreement classification over pairs of posts in threaded discussions.
U.C. Berkeley CONCEPT Lab2007
Synthesized lead ferrite thin films via pulsed-laser deposition, in an attempt to create a novel room-temperature multiferroic ceramic. Characterized growth conditions for PbFe12O19 and bounded the possible growth conditions for PbFeO3.
Gamescrafters (U.C. Berkeley computational game theory group)2006–2007
Found strong solutions for small Hex boards, both in the standard configuration and in several novel variations.
"Fuzzy Syntactic Reordering for PSMT"
Jacob Andreas, Nizar Habash and Owen Rambow. In proceedings of the EMNLP Workshop on Machine Translation. July 29, 2011, Edinburgh, Scotland.
"Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment"
Mukund Jha, Jacob Andreas, Kapil Thadani, Sara Rosenthal and Kathleen McKeown. In proceedings of the NAACL Workshop on Creating Speech and Language Data with Mechanical Turk. June 6 2010, Los Angeles, California.
"Semi-Automated Annotation for Prepositional Phrase Attachment"
Sara Rosenthal, William J. Lipovsky, Kathleen McKeown, Kapil Thadani and Jacob Andreas. In proceedings of the 7th International Conference on Language Resources and Computation (LREC). May 2010, Valletta, Malta.
CRA Outstanding Undergraduate Researcher (finalist)2011
Google Intern Scholarship2011
C.P.D. Research Grant2011
EMNLP Travel Grant2011
TBΠ Society (early induction)2011
C.P.D. Summer Practicum Grant2009
C. Prescott Davis Scholar2008–2012
Robert C. Byrd Honors Scholarship2008–2012
National Merit Scholarship2008–2009
Governor's All-State Academic Award2008
Eagle Scout2008
Grand Prize and Intel Prize for Computer Science (SFBASF)2005
"The Optimum Aspect Ratio for Compressing Image Files of Text" (San Francisco Bay Area Science Fair).
Columbia Underground Listing of Professor Ability, Lead Developer2009–present
Developed, maintained and edited a popular professor- and course-review website at Columbia, featuring over 20,000 reviews.
Google, Software Engineering Intern2011
Investigated techniques for structured and unstructured data extraction from web pages for snippet generation. Made changes to the search indexer and result UI affecting millions of users daily.
Microsoft Live Labs, Software Engineering Intern2010
Developed several Windows Phone 7 applications using Live Labs technologies. Explored navigation metaphors and data visualization schemes for small screens, and prototyped a distributed framework for cross-site communication between web services.
Lost Tribe Applications, Founder2009
Co-founded an iPhone application development studio, and developed the database backend for Synagogues, a location-aware synagogue directory for the iPhone.
Lawrence Berkeley National Laboratories, Developer and Research Assistant2007–2009
Completely rewrote the Particle Data Group's online publication management system, used to distribute scientific publications to thousands of institutions annually.
Developed a natural language classifier for the Advanced Computing for Science group to identify socio-emotional content in chat logs collected from an LBNL astronomy project.
Created several components (including a guided tour recording and playback system) for AMELIA, a 3-D educational tool and visualization environment for the ATLAS detector of the Large Hadron Collider at CERN.
Fehr & Peers Transportation Associates, IT Intern2008
Developed web-based network monitoring tools for a major traffic engineering firm. Helped to develop several congestion and traffic simulations.
GNOME Do2008
Wrote the frst version of the Twitter plugin (now the core Microblogging plugin) and the del.icio.us plugin.
FIRST Robotics Team 395, Programming Coach2010–present
Taught Bronx public high school students to design and build a working robot for a national robotics competition.