← home

Résumé / Curriculum Vitæ

Education

Columbia University, Bachelor of Science 2008–present (expected 2012)

Computer Science (Artificial Intelligence)

Research Experience

Columbia Center for Computational Learning Systems2010–present

Developed a parse fuzzification algorithm for syntactic preordering which resulted in a significant improvement in the quality of Arabic-English machine translation.

Columbia Natural Language Processing Group2009–present

Developed a framework for human-assisted part-of-speech tagging of a LiveJournal corpus using Amazon's Mechanical Turk, and implemented a phrase-level opinion classifier for unstructured weblog text. Supervised the creation of a corpus of agreement/disagreement annotations, and developed a system for automatic agreement/disagreement classification over pairs of posts in threaded discussions.

U.C. Berkeley CONCEPT Lab2007

Synthesized lead ferrite thin films via pulsed-laser deposition, in an attempt to create a novel room-temperature multiferroic ceramic. Characterized growth conditions for PbFe12O19 and bounded the possible growth conditions for PbFeO3.

Gamescrafters (U.C. Berkeley computational game theory group)2006–2007

Found strong solutions for small Hex boards, both in the standard configuration and in several novel variations.

Publications

"Fuzzy Syntactic Reordering for PSMT"

Jacob Andreas, Nizar Habash and Owen Rambow. In proceedings of the EMNLP Workshop on Machine Translation. July 29, 2011, Edinburgh, Scotland.

"Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment"

Mukund Jha, Jacob Andreas, Kapil Thadani, Sara Rosenthal and Kathleen McKeown. In proceedings of the NAACL Workshop on Creating Speech and Language Data with Mechanical Turk. June 6 2010, Los Angeles, California.

"Semi-Automated Annotation for Prepositional Phrase Attachment"

Sara Rosenthal, William J. Lipovsky, Kathleen McKeown, Kapil Thadani and Jacob Andreas. In proceedings of the 7th International Conference on Language Resources and Computation (LREC). May 2010, Valletta, Malta.

Awards and Grants

CRA Outstanding Undergraduate Researcher (finalist)2011

Google Intern Scholarship2011

C.P.D. Research Grant2011

EMNLP Travel Grant2011

TBΠ Society (early induction)2011

C.P.D. Summer Practicum Grant2009

C. Prescott Davis Scholar2008–2012

Robert C. Byrd Honors Scholarship2008–2012

National Merit Scholarship2008–2009

Governor's All-State Academic Award2008

Eagle Scout2008

Grand Prize and Intel Prize for Computer Science (SFBASF)2005

"The Optimum Aspect Ratio for Compressing Image Files of Text" (San Francisco Bay Area Science Fair).

Professional Experience

Columbia Underground Listing of Professor Ability, Lead Developer2009–present

Developed, maintained and edited a popular professor- and course-review website at Columbia, featuring over 20,000 reviews.

Google, Software Engineering Intern2011

Investigated techniques for structured and unstructured data extraction from web pages for snippet generation. Made changes to the search indexer and result UI affecting millions of users daily.

Microsoft Live Labs, Software Engineering Intern2010

Developed several Windows Phone 7 applications using Live Labs technologies. Explored navigation metaphors and data visualization schemes for small screens, and prototyped a distributed framework for cross-site communication between web services.

Lost Tribe Applications, Founder2009

Co-founded an iPhone application development studio, and developed the database backend for Synagogues, a location-aware synagogue directory for the iPhone.

Lawrence Berkeley National Laboratories, Developer and Research Assistant2007–2009

Completely rewrote the Particle Data Group's online publication management system, used to distribute scientific publications to thousands of institutions annually.

Developed a natural language classifier for the Advanced Computing for Science group to identify socio-emotional content in chat logs collected from an LBNL astronomy project.

Created several components (including a guided tour recording and playback system) for AMELIA, a 3-D educational tool and visualization environment for the ATLAS detector of the Large Hadron Collider at CERN.

Fehr & Peers Transportation Associates, IT Intern2008

Developed web-based network monitoring tools for a major traffic engineering firm. Helped to develop several congestion and traffic simulations.

Open Source Contributions

GNOME Do2008

Wrote the frst version of the Twitter plugin (now the core Microblogging plugin) and the del.icio.us plugin.

Community Service

FIRST Robotics Team 395, Programming Coach2010–present

Taught Bronx public high school students to design and build a working robot for a national robotics competition.