Senior Research Scientist
Adobe Research
One Broadway, Cambridge MA 02142
vshin [at] adobe [dot] com
I am a senior research scientist at
Adobe Research.
I build systems that combine algorithms with novel user interactions made possible by those algorithms.
My primary focus is audiovisual media—especially video. I'm passionate about helping people create compelling video stories, communicate more effectively through video, and engage with video content in smarter, more meaningful ways.
I received my PhD in Computer Science from MIT, where I was advised by
Fredo Durand in the
Computer Graphics Lab.
Before joining MIT, I completed my B.S. in Computer Science at Princeton University,
working with Thomas Funkhouser.
![]() |
VideoDiff: Human-AI Video Co-Creation with Alternatives
CHI 2025
|
![]() |
Compositional Structures as Substrates for Human-AI Co-creation Environment: A Design Approach and A Case Study
CHI 2025
|
![]() |
SoundToons: Exemplar-Based Authoring of Interactive Audio-Driven Animation Sprites.
ACM IUI 2023
|
![]() |
Automated Conversion of Music Videos into Lyric Videos.
UIST 2023
|
![]() |
Beyond Subtitles Captioning and Visualizing Non-Speech Sounds in User Generated Videos.
ASSETS 2022
|
![]() |
CatchLive: Real-time Summarization of Live Streams with Stream Content and Interaction Data
CHI 2022
|
![]() |
Multi-level Correspondence via Graph Kernels for Editing Vector Graphics Designs
|
![]() |
Beyond Show of Hands: Engaging Viewers via Expressive and Scalable Visual Communication in Live Streaming
[paper]
CHI 2021
|
![]() |
Snapstream: Snapshot-based Interactions in Live Streaming for Visual Art
Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (CHI 2020)
|
![]() |
Temporal Segmentation of Creative Live Streams
[pdf]
Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (CHI 2020)
|
![]() |
Generating Audio-Visual Slideshows from Text Articles Using Word Concreteness
Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (CHI 2020)
|
![]() |
Pose2Pose: Pose Selection and Transfer for 2D Character Animation
|
![]() |
B-Script: Transcript-based B-roll Video Editing with Recommendations
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI 2019)
|
![]() |
DynamicSlide: Exploring the Design Space of Reference-based Interaction Techniques for Slide-based Lecture Videos
Proceedings of the 2018 Workshop on Multimedia for Accessible Human Computer Interface
|
![]() |
On Learning Associations of Faces and Voices
[pdf]
In Proceedings of Asian Conference on Computer Vision (ACCV 2018)
|
![]() |
Project Blink: Creating the Future of AI-Powered Video Editing
[blog post][video]
Project Blink is an AI-powered, web-based video editing app that transforms video editing. By leveraging the AI's media understanding capabilities, Project Blink allows users to edit by content rather than frame-by-frame. Users can search for words, images, people, and moments in a video, then cut and paste just as they would in a text document, streamlining the video editing workflow. |
Adobe Research offers an exceptional internship program for graduate students. Here are a few research interns I have been fortunate to work with.
If you are a PhD student and interested in a research internship, please send me an email with your CV and a summary of your research interests.