Konstantin Voevodski

Google

76 Ninth Avenue, 4th Floor

New York, NY 10011

E-mail: kvodski @ google . com

Research

My thesis presents algorithms for clustering large data sets in a limited information setting, and tools for locally exploring large networks.

Ph.D. Thesis: Clustering and Network Analysis with Biological Applications.

Advisors: Shang-Hua Teng, Yu Xia.

Education

Ph.D. in Computer Science, Boston University, 2011.

B.A. in Computer Science, Boston University, 2005.

Publications

Active Clustering of Biological Sequences.
With Maria-Florina Balcan, Heiko Roglin, Shang-Hua Teng, and Yu Xia.
Journal of Machine Learning Research, 2012, 13(Jan):203-225.

Min-Sum Clustering of Protein Sequences with Limited Distance Information.
With Maria-Florina Balcan, Heiko Roglin, Shang-Hua Teng, and Yu Xia.
In Proc. of the 1st International Workshop on Similarity-Based Pattern Analysis and Recognition, SIMBAD 2011 (Venice, Italy), pages 192-206, 2011.

Class Label Enhancement via Related Instances.
With Zornitsa Kozareva and Shang-Hua Teng.
In Proc. of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011 (Edinburgh, Scotland, UK).

Efficient Clustering with Limited Distance Information.
With Maria-Florina Balcan, Heiko Roglin, Shang-Hua Teng, and Yu Xia.
In Proc. of the 26th Conference on Uncertainty in Artificial Intelligence, UAI 2010 (Catalina Island, USA), pages 632-641, 2010.

Quantitative Gene Expression Profiles in Real Time From Expressed Sequence Tag Databases.
With Vincent Funari, Dimitry Leyfer, Laura Yerkes, Donald Cramer, and Dean Tolan.
Gene Expression, 2010, 14(6): 321-336.

Spectral Affinity in Protein Networks.
With Shang-Hua Teng and Yu Xia.
BMC Systems Biology, 2009, 3:112.

Finding Local Communities in Protein Networks.
With Shang-Hua Teng and Yu Xia.
BMC Bioinformatics, 2009, 10:297.

Manuscripts

Local Algorithms for Interactive Clustering.
With Pranjal Awasthi and Maria-Florina Balcan.

Non-Conservative Diffusion and its Application to Social Network Analysis.
With Rumi Ghosh, Kristina Lerman, Tawan Surachawala, and Shang-Hua Teng.

Tools to Locally Explore Networks

Local Protein Community Finder is an application that finds the local community of a queried node in a network.

Protein Network Neighbor Search is an application that finds the closest neighbors of a queried node in a network.

Open Source Code

Some open-source clustering code is available here.

Teaching

Instructor:

CS112 - Data Structures, Spring '08.

CS112 - Data Structures, Summer '07.

Teaching Assistant:

CS131 - Combinatorial Structures, Spring '11 Lab Homepage.

CS112 - Data Structures, Spring '10 Lab Homepage.

Miscellaneous

Protein sequence datasets.

Evaluating clustering properties of protein sequence datasets.