Dina Bashkirova
I am a PhD candidate in the Image and Video Computing Group at Boston University advised by Professor Kate Saenko.
My research is focused on robust and semantically-consistent conditional image generation and generative domain alignment. I am also interested in label-efficient object localization and semantic segmentation, as well as applications of AI and Computer Vision for Good.
Prior to that, I received my BSc and MSc in Computer Science at Kazan Federal University.
dbash [at] bu [dot] edu  / 
CV  / 
short CV  / 
Google Scholar
|
|
Research
My primary interests include generative models, image translation and label-efficient learning.
My goal is to make generative models more expressive and controllable for artists of various skill levels.
To achieve this goal, I explore various ways to achieve fine-grained semantic control with label-efficient disentanglement and domain alignment.
In addition to my work on generative models, I am also deeply passionate about the applications of AI for the environment.
I am currently leading a project on the automation of industrial waste sorting with the hope to significantly reduce environmental impact and improve efficiency in the waste management process.
|
|
MaskSketch:
Unpaired Structure-guided Masked Image Generation
Dina Bashkirova, Jose Lezama,
Kihyuk Sohn, Kate Saenko,
Irfan Essa
CVPR (highlight) 2023
project page
In this paper, we introduce MaskSketch, an image generation method that allows spatial conditioning of the generation result using a guiding sketch as an extra conditioning signal during sampling.
Given an input sketch and its class label, MaskSketch samples realistic images that follow the given structure. MaskSketech works on sketches of various degrees of abstraction by leveraging a pre-trained masked image generators, while not requiring model finetuning or pairwise supervision.
|
|
VisDA 2022:
Visual Domain Adaptation Challenge for Industrial Waste Sorting
Dina Bashkirova, Samarth Mishra,
Diala Lteif,
Piotr Teterwak,
Donghyun Kim,
James Akl,
Fadi Alladkani,
Berk Calli,
Sarah Adel Bargal,
Kate Saenko
NeurIPS 2022
challenge page
We introduce a visual domain adaptation challenge for industrial waste sorting.
Large variability in visual appearance and composition of waste stream make generalization especially challenging for the automated waste sorting solutions.
In this competition, we propose to improve generalization of the waste detection models using SynthWaste, novel synthetic dataset for waste sorting.
|
|
Disentangled Unsupervised Image Translation via Restricted Information Flow
Ben Usman*, Dina Bashkirova*, Kate Saenko
WACV 2022
arxiv / bib
We propose a new many-to-many image translation method that infers which attributes are
domain-specific from data by constraining information flow through the network using translation honesty losses and a penalty on the capacity of the domain-specific embedding, and does not rely on hard-coded inductive architectural biases.
|
|
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
Dina Bashkirova, Mohamed Abdelfattah, Ziliang Zhu, James Akl, Fadi Alladkani,
Ping Hu, Vitaly Ablavsky,
Berk Calli, Sarah Adel Bargal
Kate Saenko
CVPR 2022
arxiv / project page / bib / code
We present the first in-the-wild object segmentation dataset for industrial waste sorting for fully-, semi- and weakly-supervised setups.
Our ZeroWaste datasets presents a challenging computer vision task of semantic segmentation of extremely cluttered scenes with
highly deformable and translucent objects.
|
|
Evaluation of Correctness in Unsupervised Many-to-Many Image Translation
Dina Bashkirova, Ben Usman, Kate Saenko
WACV 2022
arxiv / github / bib
We propose an evaluation protocol for the disentanglement quality of unsupervised many-to-many image translation (UMI2I) methods.
We show that modern UMI2I methods fail to correctly disentangle the domain-specific from shared factors and mostly rely on their
corresponding inductive biases to determine which factors should be changed after translation.
|
|
VisDA 2021:
Universal Domain Adaptation to Improve Performance on Out-of-Distribution Data
Dina Bashkirova*(equal contribution),
Dan Hendrycks*,
Donghyun Kim*,
Samarth Mishra*,
Kate Saenko*,
Kuniaki Saito*,
Piotr Teterwak*,
Ben Usman*
NeurIPS 2021
challenge page
The fifth iteration of the Visual Domain Adaptation (VisDA) Challenge in which we target the universal domain adaptation problem.
|
|
Compositional Models: Multi-Task Learning and Knowledge Transfer with
Modular Networks
Andrey Zhmoginov, Mark Sandler, Dina Bashkirova.
arXiv 2020
arxiv
/ github
/ bib
We propose a novel modular model architecture that allows parameter sharing and reuse for various computer vision tasks,
such as multi-task learning, knowledge transfer and domain adaptation.
|
|
Adversarial Self-Defense for Cycle-Consistent GANs
Dina Bashkirova, Ben Usman, Kate Saenko
NeurIPS 2019
arxiv
/ github
/ project page
/ poster
/ proceedings
/ bib
We show that cycle-consistent models perform a self-adversarial attack by embedding low-amplitude structured noise into intermediate
generated images to reconstruct input images perfectly. We propose two techniques that prevent this kind of "cheating" and
show that defending against such self-adversarial attacks improves the translation quality.
|
|
Unsupervised video-to-video translation
Dina Bashkirova, Ben Usman, Kate Saenko
arXiv 2018
arxiv
/ github
/ volumetric data
/ bib
We propose a spatiotemporal extention of CycleGAN and show when it performs better then per-frame translation on
two novel unsupervised video-to-video translation benchmarks including a novel CT-to-MRI volumetric medical domain.
|
|
Fast L1 Gauss transforms for edge-aware image filtering
Dina Bashkirova, Shin Yoshizawa,
Roustam Latypov,
Hideo Yokota
Proceedings of ISP RAS 2017
arxiv
/ bib
We propose a novel approximation method for fast Gaussian convolution of two-dimensional
uniform point sets that involves L1 distance metric for Gaussian
function and domain splitting approach to achieve fast computation.
|
Service
Grader for CS480 Introduction to Computer Graphics in 2018, CS542 Machine Learning in 2020 and 2021.
Reviewer for NeurIPS 2018-2021, ICCV 2021, ICLR 2020, 2021, 2023, CVPR 2020, 2023, WACV 2020, 2021, NeurIPS DistShift Workshop 2021.
Co-organized the VisDA challenge at NeurIPS21 and NeurIPS22.
|
Hobbies and Interests
I enjoy reading, singing and playing music with my friends, drawing and handcrafting, weightlifting, playing boardgames and spending time with my dogs.
|
|