Much of the world's data is in the form of visual media. In order to utilize meaningful information from multimedia and deliver innovative products, such as
Google Photos, Google builds machine-learning systems that are designed to enable computer perception of visual input, in addition to pursuing image and video analysis techniques focused on image/scene reconstruction and understanding.
This week, Boston hosts the
2015 Conference on Computer Vision and Pattern Recognition (CVPR 2015), the premier annual computer vision event comprising the main CVPR conference and several co-located workshops and short courses. As a leader in
computer vision research, Google will have a strong presence at CVPR 2015, with many Googlers presenting publications in addition to hosting workshops and tutorials on topics covering image/video annotation and enhancement, 3D analysis and processing, development of semantic similarity measures for visual objects, synthesis of meaningful composites for visualization/browsing of large image/video collections and more.
Learn more about some of our research in the list below (Googlers highlighted in
blue). If you are attending CVPR this year, we hope you’ll stop by our booth and chat with our researchers about the projects and opportunities at Google that go into solving interesting problems for hundreds of millions of people. Members of the
Jump team will also have a prototype of the camera on display and will be showing videos produced using the Jump system on
Google Cardboard.
Tutorials:Applied Deep Learning for Computer Vision with TorchKoray Kavukcuoglu, Ronan Collobert, Soumith ChintalaDIY Deep Learning: a Hands-On Tutorial with CaffeEvan Shelhamer, Jeff Donahue, Yangqing Jia, Jonathan Long, Ross GirshickImageNet Large Scale Visual Recognition Challenge TutorialOlga Russakovsky, Jonathan Krause, Karen Simonyan, Yangqing Jia, Jia Deng, Alex Berg, Fei-Fei LiFast Image Processing With HalideJonathan Ragan-Kelley, Andrew Adams, Fredo DurandOpen Source Structure-from-MotionMatt Leotta, Sameer Agarwal, Frank Dellaert, Pierre Moulon, Vincent RabaudOral Sessions:Modeling Local and Global Deformations in Deep Learning: Epitomic Convolution, Multiple Instance Learning, and Sliding Window DetectionGeorge Papandreou, Iasonas Kokkinos, Pierre-André SavalleGoing Deeper with Convolutions Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew RabinovichDynamicFusion: Reconstruction and Tracking of Non-Rigid Scenes in Real-TimeRichard A. Newcombe, Dieter Fox, Steven M. SeitzShow and Tell: A Neural Image Caption Generator Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru ErhanLong-Term Recurrent Convolutional Networks for Visual Recognition and Description Jeffrey Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, Trevor DarrellVisual Vibrometry: Estimating Material Properties from Small Motion in Video Abe Davis, Katherine L. Bouman, Justin G. Chen, Michael Rubinstein, Frédo Durand, William T. FreemanFast Bilateral-Space Stereo for Synthetic Defocus Jonathan T. Barron, Andrew Adams, YiChang Shih, Carlos HernándezPoster Sessions:Learning Semantic Relationships for Better Action Retrieval in ImagesVignesh Ramanathan, Congcong Li, Jia Deng, Wei Han, Zhen Li, Kunlong Gu, Yang Song, Samy Bengio, Charles Rosenberg, Li Fei-FeiFaceNet: A Unified Embedding for Face Recognition and ClusteringFlorian Schroff, Dmitry Kalenichenko, James PhilbinA Mixed Bag of Emotions: Model, Predict, and Transfer Emotion DistributionsKuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik, Andrew C. GallagherBest-Buddies Similarity for Robust Template MatchingTali Dekel, Shaul Oron, Michael Rubinstein, Shai Avidan, William T. FreemanArticulated Motion Discovery Using Pairs of TrajectoriesLuca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio FerrariReflection Removal Using Ghosting CuesYiChang Shih, Dilip Krishnan, Frédo Durand, William T. FreemanP3.5P: Pose Estimation with Unknown Focal LengthChangchang WuMatchNet: Unifying Feature and Metric Learning for Patch-Based MatchingXufeng Han, Thomas Leung, Yangqing Jia, Rahul Sukthankar, Alexander C. BergInferring 3D Layout of Building Facades from a Single ImageJiyan Pan, Martial Hebert, Takeo KanadeThe Aperture Problem for Refractive MotionTianfan Xue, Hossein Mobahei, Frédo Durand, William T. FreemanVideo Magnification in Presence of Large MotionsMohamed Elgharib, Mohamed Hefeeda, Frédo Durand, William T. FreemanRobust Video Segment Proposals with Painless Occlusion HandlingZhengyang Wu, Fuxin Li, Rahul Sukthankar, James M. RehgOntological Supervision for Fine Grained Classification of Street View StorefrontsYair Movshovitz-Attias, Qian Yu, Martin C. Stumpe, Vinay Shet, Sacha Arnoud, Liron YatzivVIP: Finding Important People in ImagesClint Solomon Mathialagan, Andrew C. Gallagher, Dhruv BatraFusing Subcategory Probabilities for Texture ClassificationYang Song, Weidong Cai, Qing Li, Fan ZhangBeyond Short Snippets: Deep Networks for Video ClassificationJoe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, George TodericiWorkshops:THUMOS Challenge 2015Program organizers include: Alexander Gorban, Rahul SukthankarDeepVision: Deep Learning in Computer Vision 2015Invited Speaker: Rahul SukthankarLarge Scale Visual Commerce (LSVisCom)Panelist: Luc VincentLarge-Scale Video Search and Mining (LSVSM)Invited Speaker and Panelist: Rahul SukthankarProgram Committee includes: Apostol NatsevVision meets Cognition: Functionality, Physics, Intentionality and CausalityProgram Organizers include: Peter BattagliaBig Data Meets Computer Vision: 3rd International Workshop on Large Scale Visual Recognition and Retrieval (BigVision 2015)Program Organizers include: Samy BengioIncludes speaker Christian Szegedy - “Scalable approaches for large scale vision”Observing and Understanding Hands in Action (Hands 2015)Program Committee includes: Murphy SteinFine-Grained Visual Categorization (FGVC3)Program Organizers include: Anelia AngelovaLarge-scale Scene Understanding Challenge (LSUN)Winners of the Scene Classification Challenge: Julian Ibarz, Christian Szegedy and Vincent VanhouckeWinners of the Caption Generation Challenge: Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru ErhanLooking from above: when Earth observation meets vision (EARTHVISION)Technical Committee includes: Andreas WendelComputer Vision in Vehicle Technology: Assisted Driving, Exploration Rovers, Aerial and Underwater VehiclesInvited Speaker: Andreas WendelProgram Committee includes: Andreas WendelWomen in Computer Vision (WiCV)Invited Speaker: Mei HanChaLearn Looking at People (
sponsor)
Fine-Grained Visual Categorization (FGVC3) (
sponsor)