Most Influential ICCV Papers (2024-09)
The International Conference on Computer Vision (ICCV) is one of the top computer vision conferences in the world. Paper Digest Team analyzes all papers published on ICCV in the past years, and presents the 15 most influential papers for each year. This ranking list is automatically constructed based upon citations from both research papers and granted patents, and will be frequently updated to reflect the most recent changes. To find the latest version of this list or the most influential papers from other conferences/journals, please visit Best Paper Digest page. Note: the most influential papers may or may not include the papers that won the best paper awards. (Version: 2024-09)
To search or review papers within ICCV related to a specific topic, please use the search by venue (ICCV) and review by venue (ICCV) services. To browse the most productive ICCV authors by year ranked by #papers accepted, here are the most productive ICCV authors grouped by year.
This list is created by the Paper Digest Team. Experience the cutting-edge capabilities of Paper Digest, an innovative AI-powered research platform that empowers you to write, review, get answers and more.
Paper Digest Team
New York City, New York, 10017
team@paperdigest.org
TABLE 1: Most Influential ICCV Papers (2024-09)
Year | Rank | Paper | Author(s) |
---|---|---|---|
2023 | 1 | Segment Anything IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. |
ALEXANDER KIRILLOV et. al. |
2023 | 2 | Adding Conditional Control to Text-to-Image Diffusion Models IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present ControlNet, a neural network architecture to add spatial conditioning controls to large, pretrained text-to-image diffusion models. |
Lvmin Zhang; Anyi Rao; Maneesh Agrawala; |
2023 | 3 | Scalable Diffusion Models with Transformers IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We train latent diffusion models of images, replacing the commonly-used U-Net backbone with a transformer that operates on latent patches. |
William Peebles; Saining Xie; |
2023 | 4 | Zero-1-to-3: Zero-shot One Image to 3D Object IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an object given just a single RGB image. |
RUOSHI LIU et. al. |
2023 | 5 | Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a new T2V generation setting–One-Shot Video Tuning, where only one text-video pair is presented. |
JAY ZHANGJIE WU et. al. |
2023 | 6 | Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a new method of Fantasia3D for high-quality text-to-3D content creation. |
Rui Chen; Yongwei Chen; Ningxin Jiao; Kui Jia; |
2023 | 7 | Text2Video-Zero: Text-to-Image Diffusion Models Are Zero-Shot Video Generators IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Recent text-to-video generation approaches rely on computationally heavy training and require large-scale video datasets. In this paper, we introduce a new task, zero-shot text-to-video generation, and propose a low-cost approach (without any training or optimization) by leveraging the power of existing text-to-image synthesis methods (e.g., Stable Diffusion), making them suitable for the video domain. |
LEVON KHACHATRYAN et. al. |
2023 | 8 | Structure and Content-Guided Video Synthesis with Diffusion Models IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a structure and content-guided video diffusion model that edits videos based on descriptions of the desired output. |
Patrick Esser; Johnathan Chiu; Parmida Atighehchian; Jonathan Granskog; Anastasis Germanidis; |
2023 | 9 | Sigmoid Loss for Language Image Pre-Training IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a simple pairwise sigmoid loss for image-text pre-training. |
Xiaohua Zhai; Basil Mustafa; Alexander Kolesnikov; Lucas Beyer; |
2023 | 10 | DiffusionDet: Diffusion Model for Object Detection IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. |
Shoufa Chen; Peize Sun; Yibing Song; Ping Luo; |
2023 | 11 | Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show how ideas from rendering and signal processing can be used to construct a technique that combines mip-NeRF 360 and grid-based models such as Instant NGP to yield error rates that are 8%-77% lower than either prior technique, and that trains 24x faster than mip-NeRF 360. |
Jonathan T. Barron; Ben Mildenhall; Dor Verbin; Pratul P. Srinivasan; Peter Hedman; |
2023 | 12 | PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose PETRv2, a unified framework for 3D perception from multi-view images. |
YINGFEI LIU et. al. |
2023 | 13 | Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a method for editing NeRF scenes with text-instructions. |
Ayaan Haque; Matthew Tancik; Alexei A. Efros; Aleksander Holynski; Angjoo Kanazawa; |
2023 | 14 | LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel method, LLM-Planner, that harnesses the power of large language models to do few-shot planning for embodied agents. |
CHAN HEE SONG et. al. |
2023 | 15 | Make-It-3D: High-fidelity 3D Creation from A Single Image with Diffusion Prior IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we investigate the problem of creating high-fidelity 3D content from only a single image. |
JUNSHU TANG et. al. |
2021 | 1 | Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision. |
ZE LIU et. al. |
2021 | 2 | Emerging Properties in Self-Supervised Vision Transformers IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we question if self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets). |
MATHILDE CARON et. al. |
2021 | 3 | Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Unlike the recently-proposed Vision Transformer (ViT) that was designed for image classification specifically, we introduce the Pyramid Vision Transformer (PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks. |
WENHAI WANG et. al. |
2021 | 4 | ViViT: A Video Vision Transformer IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present pure-transformer based models for video classification, drawing upon the recent success of such models in image classification. |
ANURAG ARNAB et. al. |
2021 | 5 | Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To overcome such limitations, we propose a new Tokens-To-Token Vision Transformer (T2T-ViT), which incorporates 1) a layer-wise Tokens-to-Token (T2T) transformation to progressively structurize the image to tokens by recursively aggregating neighboring Tokens into one Token (Tokens-to-Token), such that local structure represented by surrounding tokens can be modeled and tokens length can be reduced; 2) an efficient backbone with a deep-narrow structure for vision transformer motivated by CNN architecture design after empirical study. |
LI YUAN et. al. |
2021 | 6 | CvT: Introducing Convolutions to Vision Transformers IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present in this paper a new architecture, named Convolutional vision Transformer (CvT), that improves Vision Transformer (ViT) in performance and efficiency by introducing convolutions into ViT to yield the best of both designs. |
HAIPING WU et. al. |
2021 | 7 | An Empirical Study of Training Self-Supervised Vision Transformers IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we go back to basics and investigate the effects of several fundamental components for training self-supervised ViT. |
Xinlei Chen; Saining Xie; Kaiming He; |
2021 | 8 | Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our solution, which we call mip-NeRF (a la mipmap), extends NeRF to represent the scene at a continuously-valued scale. |
JONATHAN T. BARRON et. al. |
2021 | 9 | The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that using larger models and artificial data augmentations can improve robustness on real-world distribution shifts, contrary to claims in prior work. |
DAN HENDRYCKS et. al. |
2021 | 10 | Vision Transformers for Dense Prediction IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce dense prediction transformers, an architecture that leverages vision transformers in place of convolutional networks as a backbone for dense prediction tasks. |
Rene Ranftl; Alexey Bochkovskiy; Vladlen Koltun; |
2021 | 11 | Segmenter: Transformer for Semantic Segmentation IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we introduce Segmenter, a transformer model for semantic segmentation. |
Robin Strudel; Ricardo Garcia; Ivan Laptev; Cordelia Schmid; |
2021 | 12 | CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by this, in this paper, we study how to learn multi-scale feature representations in transformer models for image classification. |
Chun-Fu (Richard) Chen; Quanfu Fan; Rameswar Panda; |
2021 | 13 | StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, weexplore leveraging the power of recently introduced Con-trastive Language-Image Pre-training (CLIP) models in or-der to develop a text-based interface for StyleGAN imagemanipulation that does not require such manual effort. |
Or Patashnik; Zongze Wu; Eli Shechtman; Daniel Cohen-Or; Dani Lischinski; |
2021 | 14 | Multiscale Vision Transformers IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Multiscale Vision Transformers (MViT) for video and image recognition, by connecting the seminal idea of multiscale feature hierarchies with transformer models. |
HAOQI FAN et. al. |
2021 | 15 | Nerfies: Deformable Neural Radiance Fields IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present the first method capable of photorealistically reconstructing deformable scenes using photos/videos captured casually from mobile phones. |
KEUNHONG PARK et. al. |
2019 | 1 | Searching for MobileNetV3 IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present the next generation of MobileNets based on a combination of complementary search techniques as well as a novel architecture design. |
ANDREW HOWARD et. al. |
2019 | 2 | FCOS: Fully Convolutional One-Stage Object Detection IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a fully convolutional one-stage object detector (FCOS) to solve object detection in a per-pixel prediction fashion, analogue to semantic segmentation. |
Zhi Tian; Chunhua Shen; Hao Chen; Tong He; |
2019 | 3 | CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We therefore propose the CutMix augmentation strategy: patches are cut and pasted among training images where the ground truth labels are also mixed proportionally to the area of the patches. |
SANGDOO YUN et. al. |
2019 | 4 | SlowFast Networks for Video Recognition IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present SlowFast networks for video recognition. |
Christoph Feichtenhofer; Haoqi Fan; Jitendra Malik; Kaiming He; |
2019 | 5 | CenterNet: Keypoint Triplets for Object Detection IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents an efficient solution that explores the visual patterns within individual cropped regions with minimal costs. |
KAIWEN DUAN et. al. |
2019 | 6 | CCNet: Criss-Cross Attention for Semantic Segmentation IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a Criss-Cross Network (CCNet) for obtaining such contextual information in a more effective and efficient way. |
ZILONG HUANG et. al. |
2019 | 7 | KPConv: Flexible and Deformable Convolution for Point Clouds IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Kernel Point Convolution (KPConv), a new design of point convolution, i.e. that operates on point clouds without any intermediate representation. |
HUGUES THOMAS et. al. |
2019 | 8 | Digging Into Self-Supervised Monocular Depth Estimation IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods. |
Clement Godard; Oisin Mac Aodha; Michael Firman; Gabriel J. Brostow; |
2019 | 9 | FaceForensics++: Learning to Detect Manipulated Facial Images IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To standardize the evaluation of detection methods, we propose an automated benchmark for facial manipulation detection. |
ANDREAS ROSSLER et. al. |
2019 | 10 | Free-Form Image Inpainting With Gated Convolution IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a generative image inpainting system to complete images with free-form mask and guidance. |
JIAHUI YU et. al. |
2019 | 11 | Moment Matching for Multi-Source Domain Adaptation IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We make three major contributions towards addressing this problem. First, we collect and annotate by far the largest UDA dataset, called DomainNet, which contains six domains and about 0.6 million images distributed among 345 categories, addressing the gap in data availability for multi-source UDA research. Second, we propose a new deep learning approach, Moment Matching for Multi-Source Domain Adaptation (M3SDA), which aims to transfer knowledge learned from multiple labeled source domains to an unlabeled target domain by dynamically aligning moments of their feature distributions. |
XINGCHAO PENG et. al. |
2019 | 12 | SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a large dataset to propel research on laser-based semantic segmentation. |
JENS BEHLEY et. al. |
2019 | 13 | TSM: Temporal Shift Module for Efficient Video Understanding IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a generic and effective Temporal Shift Module (TSM) that enjoys both high efficiency and high performance. |
Ji Lin; Chuang Gan; Song Han; |
2019 | 14 | YOLACT: Real-Time Instance Segmentation IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a simple, fully-convolutional model for real-time instance segmentation that achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is significantly faster than any previous competitive approach. |
Daniel Bolya; Chong Zhou; Fanyi Xiao; Yong Jae Lee; |
2019 | 15 | DeepGCNs: Can GCNs Go As Deep As CNNs? IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present new ways to successfully train very deep GCNs. |
Guohao Li; Matthias Muller; Ali Thabet; Bernard Ghanem; |
2017 | 1 | Mask R-CNN IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a conceptually simple, flexible, and general framework for object instance segmentation. |
Kaiming He; Georgia Gkioxari; Piotr Dollar; Ross Girshick; |
2017 | 2 | Focal Loss For Dense Object Detection IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate why this is the case. |
Tsung-Yi Lin; Priya Goyal; Ross Girshick; Kaiming He; Piotr Dollar; |
2017 | 3 | Grad-CAM: Visual Explanations From Deep Networks Via Gradient-Based Localization IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a technique for producing ‘visual explanations’ for decisions from a large class of Convolutional Neural Network (CNN)-based models, making them more transparent. |
RAMPRASAATH R. SELVARAJU et. al. |
2017 | 4 | Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples. |
Jun-Yan Zhu; Taesung Park; Phillip Isola; Alexei A. Efros; |
2017 | 5 | Deformable Convolutional Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce two new modules to enhance the transformation modeling capacity of CNNs, namely, deformable convolution and deformable RoI pooling. |
JIFENG DAI et. al. |
2017 | 6 | Least Squares Generative Adversarial Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To overcome such a problem, we propose in this paper the Least Squares Generative Adversarial Networks (LSGANs) which adopt the least squares loss function for the discriminator. |
XUDONG MAO et. al. |
2017 | 7 | Arbitrary Style Transfer In Real-Time With Adaptive Instance Normalization IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a simple yet effective approach that for the first time enables arbitrary style transfer in real-time. |
Xun Huang; Serge Belongie; |
2017 | 8 | StackGAN: Text To Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) to generate 256×256 photo-realistic images conditioned on text descriptions. |
HAN ZHANG et. al. |
2017 | 9 | Channel Pruning For Accelerating Very Deep Neural Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new channel pruning method to accelerate very deep convolutional neural networks.Given a trained CNN model, we propose an iterative two-step algorithm to effectively prune each layer, by a LASSO regression based channel selection and least square reconstruction. |
Yihui He; Xiangyu Zhang; Jian Sun; |
2017 | 10 | Learning Efficient Convolutional Networks Through Network Slimming IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel learning scheme for CNNs to simultaneously 1) reduce the model size; 2) decrease the run-time memory footprint; and 3) lower the number of computing operations, without compromising accuracy. |
ZHUANG LIU et. al. |
2017 | 11 | Revisiting Unreasonable Effectiveness Of Data In Deep Learning Era IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper takes a step towards clearing the clouds of mystery surrounding the relationship between `enormous data’ and visual deep learning. |
Chen Sun; Abhinav Shrivastava; Saurabh Singh; Abhinav Gupta; |
2017 | 12 | DualGAN: Unsupervised Dual Learning For Image-To-Image Translation IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by dual learning from natural language translation, we develop a novel mechanism, which enables image translators to be trained from two sets of images from two domains. |
Zili Yi; Hao Zhang; Ping Tan; Minglun Gong; |
2017 | 13 | Unlabeled Samples Generated By GAN Improve The Person Re-Identification Baseline In Vitro IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The main contribution of this paper is a simple semi-supervised pipeline that only uses the original training set without collecting extra data. |
Zhedong Zheng; Liang Zheng; Yi Yang; |
2017 | 14 | ThiNet: A Filter Level Pruning Method For Deep Neural Network Compression IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an efficient and unified framework, namely ThiNet, to simultaneously accelerate and compress CNN models in both training and inference stages. |
Jian-Hao Luo; Jianxin Wu; Weiyao Lin; |
2017 | 15 | Learning Spatio-Temporal Representation With Pseudo-3D Residual Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we devise multiple variants of bottleneck building blocks in a residual learning framework by simulating 3*3*3 convolutions with 1*3*3 convolutional filters on spatial domain (equivalent to 2D CNN) plus 3*1*1 convolutions to construct temporal connections on adjacent feature maps in time. |
Zhaofan Qiu; Ting Yao; Tao Mei; |
2015 | 1 | Fast R-CNN IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. |
Ross Girshick; |
2015 | 2 | Delving Deep Into Rectifiers: Surpassing Human-Level Performance On ImageNet Classification IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we study rectifier neural networks for image classification from two aspects. |
Kaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun; |
2015 | 3 | Learning Spatiotemporal Features With 3D Convolutional Networks IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a simple, yet effective approach for spatiotemporal feature learning using deep 3-dimensional convolutional networks (3D ConvNets) trained on a large scale supervised video dataset. |
Du Tran; Lubomir Bourdev; Rob Fergus; Lorenzo Torresani; Manohar Paluri; |
2015 | 4 | Deep Learning Face Attributes In The Wild IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel deep learning framework for attribute prediction in the wild. |
Ziwei Liu; Ping Luo; Xiaogang Wang; Xiaoou Tang; |
2015 | 5 | VQA: Visual Question Answering IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose the task of free-form and open-ended Visual Question Answering (VQA). We provide a dataset containing 0.25M images, 0.76M questions, and 10M answers (www.visualqa.org), and discuss the information it provides. |
STANISLAW ANTOL et. al. |
2015 | 6 | Learning Deconvolution Network For Semantic Segmentation IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel semantic segmentation algorithm by learning a deep deconvolution network. |
Hyeonwoo Noh; Seunghoon Hong; Bohyung Han; |
2015 | 7 | FlowNet: Learning Optical Flow With Convolutional Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we construct CNNs which are capable of solving the optical flow estimation problem as a supervised learning task. Since existing ground truth data sets are not sufficiently large to train a CNN, we generate a large synthetic Flying Chairs dataset. |
ALEXEY DOSOVITSKIY et. al. |
2015 | 8 | Scalable Person Re-Identification: A Benchmark IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: As a minor contribution, inspired by recent advances in large-scale image search, this paper proposes an unsupervised Bag-of-Words descriptor. |
LIANG ZHENG et. al. |
2015 | 9 | Holistically-Nested Edge Detection IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We develop a new edge detection algorithm that addresses two critical issues in this long-standing vision problem: (1) holistic image training; and (2) multi-scale feature learning. |
Saining Xie; Zhuowen Tu; |
2015 | 10 | Multi-View Convolutional Neural Networks For 3D Shape Recognition IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address this question in the context of learning to recognize 3D shapes from a collection of their rendered views on 2D images. |
Hang Su; Subhransu Maji; Evangelos Kalogerakis; Erik Learned-Miller; |
2015 | 11 | Unsupervised Visual Representation Learning By Context Prediction IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work explores the use of spatial context as a source of free and plentiful supervisory signal for training a rich visual representation. |
Carl Doersch; Abhinav Gupta; Alexei A. Efros; |
2015 | 12 | Predicting Depth, Surface Normals And Semantic Labels With A Common Multi-Scale Convolutional Architecture IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we address three different computer vision tasks using a single basic architecture: depth prediction, surface normal estimation, and semantic labeling. |
David Eigen; Rob Fergus; |
2015 | 13 | Conditional Random Fields As Recurrent Neural Networks IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To solve this problem, we introduce a new form of convolutional neural network that combines the strengths of Convolutional Neural Networks (CNNs) and Conditional Random Fields (CRFs)-based probabilistic graphical modelling. |
SHUAI ZHENG et. al. |
2015 | 14 | Aligning Books And Movies: Towards Story-Like Visual Explanations By Watching Movies And Reading Books IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To align movies and books we propose a neural sentence embedding that is trained in an unsupervised way from a large corpus of books, as well as a video-text neural embedding for computing similarities between movie clips and sentences in the book. |
YUKUN ZHU et. al. |
2015 | 15 | PoseNet: A Convolutional Network For Real-Time 6-DOF Camera Relocalization IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a robust and real-time monocular six degree of freedom relocalization system. |
Alex Kendall; Matthew Grimes; Roberto Cipolla; |
2013 | 1 | Action Recognition With Improved Trajectories IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper improves their performance by taking into account camera motion to correct them. |
Heng Wang; Cordelia Schmid; |
2013 | 2 | Transfer Feature Learning With Joint Distribution Adaptation IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we put forward a novel transfer learning approach, referred to as Joint Distribution Adaptation (JDA). |
Mingsheng Long; Jianmin Wang; Guiguang Ding; Jiaguang Sun; Philip S. Yu; |
2013 | 3 | Unsupervised Visual Domain Adaptation Using Subspace Alignment IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new domain adaptation (DA) algorithm where the source and target domains are represented by subspaces described by eigenvectors. |
Basura Fernando; Amaury Habrard; Marc Sebban; Tinne Tuytelaars; |
2013 | 4 | Anchored Neighborhood Regression For Fast Example-Based Super-Resolution IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes fast super-resolution methods while making no compromise on quality. |
Radu Timofte; Vincent De Smet; Luc Van Gool; |
2013 | 5 | DeepFlow: Large Displacement Optical Flow With Deep Matching IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a descriptor matching algorithm, tailored to the optical flow problem, that allows to boost performance on fast motions. |
Philippe Weinzaepfel; Jerome Revaud; Zaid Harchaoui; Cordelia Schmid; |
2013 | 6 | Abnormal Event Detection At 150 FPS In MATLAB IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Based on inherent redundancy of video structures, we propose an efficient sparse combination learning framework. |
Cewu Lu; Jianping Shi; Jiaya Jia; |
2013 | 7 | Structured Forests For Fast Edge Detection IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we take advantage of the structure present in local image patches to learn both an accurate and computationally efficient edge detector. |
Piotr Dollar; C. L. Zitnick; |
2013 | 8 | Efficient Image Dehazing With Boundary Constraint And Contextual Regularization IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an efficient regularization method to remove hazes from a single input image. |
Gaofeng Meng; Ying Wang; Jiangyong Duan; Shiming Xiang; Chunhong Pan; |
2013 | 9 | Towards Understanding Action Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We evaluate current methods using this dataset and systematically replace the output of various algorithms with ground truth. |
Hueihan Jhuang; Juergen Gall; Silvia Zuffi; Cordelia Schmid; Michael J. Black; |
2013 | 10 | Robust Face Landmark Estimation Under Occlusion IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel method, called Robust Cascaded Pose Regression (RCPR) which reduces exposure to outliers by detecting occlusions explicitly and using robust shape-indexed features. |
Xavier P. Burgos-Artizzu; Pietro Perona; Piotr Dollar; |
2013 | 11 | SUN3D: A Database Of Big Spaces Reconstructed Using SfM And Object Labels IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce SUN3D, a large-scale RGB-D video database with camera pose and object labels, capturing the full 3D extent of many places. |
Jianxiong Xiao; Andrew Owens; Antonio Torralba; |
2013 | 12 | Saliency Detection Via Dense And Sparse Reconstruction IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a visual saliency detection algorithm from the perspective of reconstruction errors. |
Xiaohui Li; Huchuan Lu; Lihe Zhang; Xiang Ruan; Ming-Hsuan Yang; |
2013 | 13 | Joint Deep Learning For Pedestrian Detection IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes that they should be jointly learned in order to maximize their strengths through cooperation. |
Wanli Ouyang; Xiaogang Wang; |
2013 | 14 | Depth From Combining Defocus And Correspondence Using Light-Field Cameras IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a novel simple and principled algorithm that computes dense depth estimation by combining both defocus and correspondence depth cues. |
Michael W. Tao; Sunil Hadap; Jitendra Malik; Ravi Ramamoorthi; |
2013 | 15 | Fast Object Segmentation In Unconstrained Video IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a technique for separating foreground objects from the background in a video. |
Anestis Papazoglou; Vittorio Ferrari; |
2011 | 1 | ORB: An Efficient Alternative To SIFT Or SURF IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a very fast binary descriptor based on BRIEF, called ORB, which is rotation invariant and resistant to noise. |
E. Rublee; V. Rabaud; K. Konolige and G. Bradski; |
2011 | 2 | HMDB: A Large Video Database For Human Motion Recognition IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this issue we collected the largest action video database to-date with 51 action categories, which in total contain around 7,000 manually annotated clips extracted from a variety of sources ranging from digitized movies to YouTube. |
H. Kuehne; H. Jhuang; E. Garrote; T. Poggio and T. Serre; |
2011 | 3 | BRISK: Binary Robust Invariant Scalable Keypoints IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we propose BRISK, a novel method for keypoint detection, description and matching. |
S. Leutenegger; M. Chli and R. Y. Siegwart; |
2011 | 4 | DTAM: Dense Tracking And Mapping In Real-time IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We use the hundreds of images available in a video stream to improve the quality of a simple photometric data term, and minimise a global spatially regularised energy functional in a novel non-convex optimisation framework. |
R. A. Newcombe; S. J. Lovegrove and A. J. Davison; |
2011 | 5 | Sparse Representation Or Collaborative Representation: Which Helps Face Recognition? IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Consequently, we propose a very simple yet much more efficient face classification scheme, namely CR based classification with regularized least square (CRC_RLS). |
L. Zhang; M. Yang and Xiangchu Feng; |
2011 | 6 | Struck: Structured Output Tracking With Kernels IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a framework for adaptive visual object tracking based on structured output prediction. |
S. Hare; A. Saffari and P. H. S. Torr; |
2011 | 7 | Semantic Contours From Inverse Detectors IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: For this purpose, we present a simple yet effective method for combining generic object detectors with bottom-up contours to identify object contours. In order to study the problem and evaluate quantitatively our approach, we present a dataset of semantic exterior boundaries on more than 20, 000 object instances belonging to 20 categories, using the images from the VOC2011 PASCAL challenge [7]. |
B. Hariharan; P. Arbel�ez; L. Bourdev; S. Maji and J. Malik; |
2011 | 8 | From Learning Models Of Natural Image Patches To Whole Image Restoration IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we answer these questions. |
D. Zoran and Y. Weiss; |
2011 | 9 | Adaptive Deconvolutional Networks For Mid And High Level Feature Learning IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a hierarchical model that learns image decompositions via alternating layers of convolutional sparse coding and max pooling. |
M. D. Zeiler; G. W. Taylor and R. Fergus; |
2011 | 10 | End-to-end Scene Text Recognition IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper focuses on the problem of word detection and recognition in natural images. |
Kai Wang; B. Babenko and S. Belongie; |
2011 | 11 | Domain Adaptation For Object Recognition: An Unsupervised Approach IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present one of the first studies on unsupervised domain adaptation in the context of object recognition, where we have labeled data only from the source domain (and therefore do not have correspondences between object categories across domains). |
R. Gopalan; Ruonan Li and R. Chellappa; |
2011 | 12 | Relative Attributes IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose to model relative attributes. |
D. Parikh and K. Grauman; |
2011 | 13 | Fisher Discrimination Dictionary Learning For Sparse Representation IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a novel dictionary learning (DL) method to improve the pattern classification performance. |
M. Yang; L. Zhang; X. Feng and D. Zhang; |
2011 | 14 | Ensemble Of Exemplar-SVMs For Object Detection And Beyond IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a conceptually simple but surprisingly powerful method which combines the effectiveness of a discriminative object detector with the explicit correspondence offered by a nearest-neighbor approach. |
T. Malisiewicz; A. Gupta and A. A. Efros; |
2011 | 15 | Segmentation As Selective Search For Object Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, we adapt segmentation as a selective search by reconsidering segmentation: We propose to generate many approximate locations over few and precise object delineations because (1) an object whose location is never generated can not be recognised and (2) appearance and immediate nearby context are most effective for object recognition. |
K. E. A. van de Sande; J. R. R. Uijlings; T. Gevers and A. W. M. Smeulders; |
2009 | 1 | What Is The Best Multi-stage Architecture For Object Recognition? IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper addresses three questions: 1. |
K. Jarrett; K. Kavukcuoglu; M. Ranzato and Y. LeCun; |
2009 | 2 | Building Rome In A Day IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a system that can match and reconstruct 3D scenes from extremely large collections of photographs such as those found by searching for a given city (e.g., Rome) on Internet photo sharing sites. |
S. Agarwal; N. Snavely; I. Simon; S. M. Seitz and R. Szeliski; |
2009 | 3 | Learning To Predict Where Humans Look IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this problem, we collected eye tracking data of 15 viewers on 1003 images and use this database as training and testing examples to learn a model of saliency based on low, middle and high-level image features. |
T. Judd; K. Ehinger; F. Durand and A. Torralba; |
2009 | 4 | Super-resolution From A Single Image IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we propose a unified framework for combining these two families of methods. |
D. Glasner; S. Bagon and M. Irani; |
2009 | 5 | Non-local Sparse Models For Image Restoration IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose in this paper to unify two different approaches to image restoration: On the one hand, learning a basis set (dictionary) adapted to sparse signal descriptions has proven to be very effective in image reconstruction and classification tasks. |
J. Mairal; F. Bach; J. Ponce; G. Sapiro and A. Zisserman; |
2009 | 6 | An HOG-LBP Human Detector With Partial Occlusion Handling IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: By combining Histograms of Oriented Gradients (HOG) and Local Binary Pattern (LBP) as the feature set, we propose a novel human detection approach capable of handling partial occlusion. |
X. Wang; T. X. Han and S. Yan; |
2009 | 7 | Tensor Completion For Estimating Missing Values In Visual Data IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we propose an algorithm to estimate missing values in tensors of visual data. |
Ji Liu; P. Musialski; P. Wonka and Jieping Ye; |
2009 | 8 | Attribute And Simile Classifiers For Face Verification IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present two novel methods for face verification. For further testing across pose, illumination, and expression, we introduce a new data set – termed PubFig – of real-world images of public figures (celebrities and politicians) acquired from the internet. |
N. Kumar; A. C. Berg; P. N. Belhumeur and S. K. Nayar; |
2009 | 9 | You’ll Never Walk Alone: Modeling Social Behavior For Multi-target Tracking IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce a model of dynamic social behavior, inspired by models developed for crowd simulation. |
S. Pellegrini; A. Ess; K. Schindler and L. van Gool; |
2009 | 10 | Fast Visibility Restoration From A Single Color Or Gray Level Image IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a novel algorithm and variants for visibility restoration from a single image. |
J. Tarel and N. Hauti�re; |
2009 | 11 | Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address the classic problems of detection, segmentation and pose estimation of people in images with a novel definition of a part, a poselet. To permit this we have built a new dataset, H3D, of annotations of humans in 2D photographs with 3D joint information, inferred using anthropometric constraints. |
L. Bourdev and J. Malik; |
2009 | 12 | Kernelized Locality-sensitive Hashing For Scalable Image Search IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Recent work has explored ways to embed high-dimensional features or complex distance functions into a low-dimensional Hamming space where items can be efficiently searched. |
B. Kulis and K. Grauman; |
2009 | 13 | On Feature Combination For Multiclass Object Classification IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we study several models that aim at learning the correct weighting of different features from training data. |
P. Gehler and S. Nowozin; |
2009 | 14 | Fast And Robust Earth Mover’s Distances IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a new algorithm for a robust family of Earth Mover’s Distances – EMDs with thresholded ground distances. |
O. Pele and M. Werman; |
2009 | 15 | Is That You? Metric Learning Approaches For Face Identification IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we present two methods for learning robust distance measures: (a) a logistic discriminant approach which learns the metric from a set of labelled image pairs (LDML) and (b) a nearest neighbour approach which computes the probability for two images to belong to the same class (MkNN). |
M. Guillaumin; J. Verbeek and C. Schmid; |
2007 | 1 | A Database And Evaluation Methodology For Optical Flow IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our goal is to establish a new set of benchmarks and evaluation methods for the next generation of optical flow algorithms. |
S. Baker; S. Roth; D. Scharstein; M. J. Black; J. P. Lewis and R. Szeliski; |
2007 | 2 | Image Classification Using Random Forests And Ferns IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We explore the problem of classifying images by the object categories they contain in the case of a large number of object categories. |
A. Bosch; A. Zisserman and X. Munoz; |
2007 | 3 | Probabilistic Linear Discriminant Analysis For Inferences About Identity IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we present a novel algorithm designed for these conditions. |
S. J. D. Prince and J. H. Elder; |
2007 | 4 | Total Recall: Automatic Query Expansion With A Generative Feature Model For Object Retrieval IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we bring query expansion into the visual domain via two novel contributions. |
O. Chum; J. Philbin; J. Sivic; M. Isard and A. Zisserman; |
2007 | 5 | Multi-View Stereo For Community Photo Collections IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a multi-view stereo algorithm that addresses the extreme changes in lighting, scale, clutter, and other effects in large online community photo collections. |
M. Goesele; N. Snavely; B. Curless; H. Hoppe and S. M. Seitz; |
2007 | 6 | What, Where And Who? Classifying Events By Scene And Object Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we use a number of sport games such as snow boarding, rock climbing or badminton to demonstrate event classification. We have assembled a highly challenging database of 8 widely varied sport events. |
L. Li and Li Fei-Fei; |
2007 | 7 | A Biologically Inspired System For Action Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a biologically-motivated system for the recognition of actions from video sequences. |
H. Jhuang; T. Serre; L. Wolf and T. Poggio; |
2007 | 8 | Objects In Context IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we propose to incorporate semantic object context as a post-processing step into any off-the-shelf object categorization model. |
A. Rabinovich; A. Vedaldi; C. Galleguillos; E. Wiewiora and S. Belongie; |
2007 | 9 | Semi-supervised Discriminant Analysis IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel method, called Semi- supervised Discriminant Analysis (SDA), which makes use of both labeled and unlabeled samples. |
D. Cai; X. He and J. Han; |
2007 | 10 | Eyeblink-based Anti-Spoofing In Face Recognition From A Generic Webcamera IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a real-time liveness detection approach against photograph spoofing in face recognition, by recognizing spontaneous eyeblinks, which is a non-intrusive manner. |
G. Pan; L. Sun; Z. Wu and S. Lao; |
2007 | 11 | Learning The Discriminative Power-Invariance Trade-Off IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our focus, in this paper, is on learning the optimal tradeoff for classification given a particular training set and prior constraints. |
M. Varma and D. Ray; |
2007 | 12 | Learning 3-D Scene Structure From A Single Still Image IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our goal is to create 3D models which are both quantitatively accurate as well as visually pleasing. |
A. Saxena; M. Sun and A. Y. Ng; |
2007 | 13 | A Geodesic Framework For Fast Interactive Image And Video Segmentation And Matting IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An interactive framework for soft segmentation and matting of natural images and videos is presented in this paper. |
X. Bai and G. Sapiro; |
2007 | 14 | Non-homogeneous Content-driven Video-retargeting IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An efficient algorithm for video retargeting is introduced. |
L. Wolf; M. Guttmann and D. Cohen-Or; |
2007 | 15 | Shape And Appearance Context Modeling IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we develop appearance models for computing the similarity between image regions containing deformable objects of a given class in realtime. |
X. Wang; G. Doretto; T. Sebastian; J. Rittscher and P. Tu; |
2005 | 1 | Actions As Space-time Shapes IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We adopt a recent approach by Gorelick et al. (2004) for analyzing 2D shapes and generalize it to deal with volumetric space-time action shapes. |
M. Blank; L. Gorelick; E. Shechtman; M. Irani and R. Basri; |
2005 | 2 | The Pyramid Match Kernel: Discriminative Classification With Sets Of Image Features IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a new fast kernel function which maps unordered feature sets to multi-resolution histograms and computes a weighted histogram intersection in this space. |
K. Grauman and T. Darrell; |
2005 | 3 | Neighborhood Preserving Embedding IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel subspace learning algorithm called neighborhood preserving embedding (NPE). |
Xiaofei He; Deng Cai; Shuicheng Yan and Hong-Jiang Zhang; |
2005 | 4 | Fusing Points And Lines For High Performance Tracking IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In particular, we present a method for integrating the two systems and robustly combining the pose estimates they produce. |
E. Rosten and T. Drummond; |
2005 | 5 | A Spectral Technique For Correspondence Problems Using Pairwise Constraints IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an efficient spectral method for finding consistent correspondences between two sets of features. |
M. Leordeanu and M. Hebert; |
2005 | 6 | Discovering Objects And Their Location In Images IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. |
J. Sivic; B. C. Russell; A. A. Efros; A. Zisserman and W. T. Freeman; |
2005 | 7 | Local Gabor Binary Pattern Histogram Sequence (LGBPHS): A Novel Non-statistical Model For Face Representation And Recognition IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a novel non-statistics based face representation approach, local Gabor binary pattern histogram sequence (LGBPHS), in which training procedure is unnecessary to construct the face model, so that the generalizability problem is naturally avoided. |
Wenchao Zhang; Shiguang Shan; Wen Gao; Xilin Chen and Hongming Zhang; |
2005 | 8 | Object Categorization By Learned Universal Visual Dictionary IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a new algorithm for the automatic recognition of object classes from images (categorization). |
J. Winn; A. Criminisi and T. Minka; |
2005 | 9 | Detection Of Multiple, Partially Occluded Humans In A Single Image By Bayesian Combination Of Edgelet Part Detectors IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a method for human detection in crowded scene from static images. |
Bo Wu and R. Nevatia; |
2005 | 10 | Creating Efficient Codebooks For Visual Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe a scalable acceptance-radius based clusterer that generates better codebooks and study its performance on several image classification tasks. |
F. Jurie and B. Triggs; |
2005 | 11 | Learning Object Categories From Google’s Image Search IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an approach that can learn an object category from just its name, by utilizing the raw output of image search engines available on the Internet. |
R. Fergus; L. Fei-Fei; P. Perona and A. Zisserman; |
2005 | 12 | Geometric Context From A Single Image IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We provide a multiple-hypothesis framework for robustly estimating scene structure from a single image and obtaining confidences for each geometric label. |
D. Hoiem; A. A. Efros and M. Hebert; |
2005 | 13 | Detecting Irregularities In Images And In Video IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address the problem of detecting irregularities in visual data, e.g., detecting suspicious behaviors in video sequences, or identifying salient patterns in images. We pose the problem of determining the validity of visual data as a process of constructing a puzzle: We try to compose a new observed image region or a new video segment (the query) using chunks of data (pieces of puzzle) extracted from previous visual examples (the database ). |
O. Boiman and M. Irani; |
2005 | 14 | Efficient Visual Event Detection Using Volumetric Features IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper studies the use of volumetric features as an alternative to popular local descriptor approaches for event detection in video sequences. |
Yan Ke; R. Sukthankar and M. Hebert; |
2005 | 15 | Evaluation Of Features Detectors And Descriptors Based On 3D Objects IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end we design a method, based on intersecting epipolar constraints, for providing ground truth correspondence automatically. We collect a database of 100 objects viewed from 144 calibrated viewpoints under three different lighting conditions. |
P. Moreels and P. Perona; |
2003 | 1 | Video Google: A Text Retrieval Approach To Object Matching In Videos IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. |
Sivic and Zisserman; |
2003 | 2 | Detecting Pedestrians Using Patterns Of Motion And Appearance IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Novel contributions of this paper include: i) development of a representation of image motion which is extremely efficient, and ii) implementation of a state of the art pedestrian detection system which operates on low resolution images under difficult conditions (such as rain and snow). |
Jones and Snow; |
2003 | 3 | Real-time Simultaneous Localisation And Mapping With A Single Camera IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a top-down Bayesian framework for single-camera localisation via mapping of a sparse set of natural features using motion modelling and an information-guided active measurement strategy, in particular addressing the difficult issue of real-time feature initialisation via a factored sampling approach. |
|
2003 | 4 | Learning A Classification Model For Segmentation IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a two-class classification model for grouping. |
Ren and Malik; |
2003 | 5 | On-line Selection Of Discriminative Tracking Features IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a method for evaluating multiple feature spaces while tracking, and for adjusting the set of features used to improve tracking performance. |
Collins and Liu; |
2003 | 6 | Recognizing Action At A Distance IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our goal is to recognize human action at a distance, at resolutions where a whole person may be, say, 30 pixels tall. |
Mori and Malik; |
2003 | 7 | Recognising Panoramas IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The problem considered in this paper is the fully automatic construction of panoramas. |
Brown and Lowe; |
2003 | 8 | Multiclass Spectral Clustering IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a principled account on multiclass spectral clustering. |
Yu and Shi; |
2003 | 9 | Context-based Vision System For Place And Object Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a context-based vision system for place and object recognition. |
Freeman and Rubin; |
2003 | 10 | Fast Pose Estimation With Parameter-sensitive Hashing IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a new algorithm that learns a set of hashing functions that efficiently index examples relevant to a particular estimation task. |
Viola and Darrell; |
2003 | 11 | Preemptive RANSAC For Live Structure And Motion Estimation IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A system capable of performing robust live ego-motion estimation for perspective cameras is presented. The system is powered by random sample consensus with preemptive scoring of … |
|
2003 | 12 | Image Parsing: Unifying Segmentation, Detection, And Recognition IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a general framework for parsing images into regions and objects. |
Zhuowen Tu; Xiangrong Chen; Yuille and Zhu; |
2003 | 13 | Computing Geodesics And Minimal Surfaces Via Graph Cuts IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a new segmentation method combining some of their benefits. |
Boykov and Kolmogorov; |
2003 | 14 | A Bayesian Network Framework For Relational Shape Matching IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The new Bethe free energy approach is used to estimate the pairwise correspondences between links of the template graphs and the data. |
Coughlan and Yuille; |
2003 | 15 | Natural Image Statistics For Natural Image Segmentation IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Building on recent progress in modeling filter response statistics of natural images we integrate a statistical model into a variational framework for image segmentation. |
Heiler and Schnorr; |
2001 | 1 | Robust Real-time Face Detection IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View |
P. Viola and M. Jones; |
2001 | 2 | A Database Of Human Segmented Natural Images And Its Application To Evaluating Segmentation Algorithms And Measuring Ecological Statistics IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a database containing ‘ground truth’ segmentations produced by humans for images of a wide variety of natural scenes. |
D. Martin; C. Fowlkes; D. Tal and J. Malik; |
2001 | 3 | Interactive Graph Cuts For Optimal Boundary & Region Segmentation Of Objects In N-D Images IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we describe a new technique for general purpose interactive segmentation of N-dimensional images. |
Y. Y. Boykov and M. -. Jolly; |
2001 | 4 | Lambertian Reflectance And Linear Subspaces IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We prove that the set of all reflectance functions (the mapping from surface normals to intensities) produced by Lambertian objects under distant, isotropic lighting lies close to a 9D linear subspace. |
R. Basri and D. Jacobs; |
2001 | 5 | Indexing Based On Scale Invariant Interest Points IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a new method for detecting scale invariant interest points. |
K. Mikolajczyk and C. Schmid; |
2001 | 6 | Computing Visual Correspondence With Occlusions Using Graph Cuts IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a new method which properly addresses occlusions, while preserving the advantages of graph cut algorithms. |
V. Kolmogorov and R. Zabih; |
2001 | 7 | Dynamic Textures IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a novel characterization of dynamic textures that poses the problems of modelling, learning, recognizing and synthesizing dynamic textures on a firm analytical footing. |
S. Soatto; G. Doretto and Ying Nian Wu; |
2001 | 8 | BraMBLe: A Bayesian Multiple-blob Tracker IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents two theoretical advances which address this limitation and lead to a robust multiple-person tracking system suitable for single-camera real-time surveillance applications. |
M. Isard and J. MacCormick; |
2001 | 9 | Deriving Intrinsic Images From Image Sequences IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus on a slightly, easier problem: given a sequence of T images where the reflectance is constant and the illumination changes, can we recover T illumination images and a single reflectance image? |
Y. Weiss; |
2001 | 10 | Learning The Semantics Of Words And Pictures IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a statistical model for organizing image collections which integrates semantic information provided by associate text and visual information provided by image features. |
K. Barnard and D. Forsyth; |
2001 | 11 | The Earth Mover’s Distance Is The Mallows Distance: Some Insights From Statistics IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We discuss the advantages and disadvantages of both distances, and statistical issues involved in computing them from data. |
E. Levina and P. Bickel; |
2001 | 12 | Face Recognition With Support Vector Machines: Global Versus Component-based Approach IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a component-based method and two global methods for face recognition and evaluate them with respect to robustness against pose changes. |
B. Heisele; P. Ho and T. Poggio; |
2001 | 13 | The Variable Bandwidth Mean Shift And Data-driven Scale Selection IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present two solutions for the scale selection problem in computer vision. |
D. Comaniciu; V. Ramesh and P. Meer; |
2001 | 14 | Flux Maximizing Geometric Flows IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Several geometric active contour models have been proposed for segmentation in computer vision. |
A. Vasilevskiy and K. Siddiqi; |
2001 | 15 | Matching Shapes IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a novel approach to measuring similarity between shapes and exploit it for object recognition. |
S. Belongie; J. Malik and J. Puzicha; |
1999 | 1 | Object Recognition From Local Scale-invariant Features IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Final verification of each match is achieved by finding a low residual least squares solution for the unknown model parameters. |
D. G. Lowe; |
1999 | 2 | Fast Approximate Energy Minimization Via Graph Cuts IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we address the problem of minimizing a large class of energy functions that occur in early vision. |
Y. Boykov; O. Veksler and R. Zabih; |
1999 | 3 | Texture Synthesis By Non-parametric Sampling IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A non-parametric method for texture synthesis is proposed. |
A. A. Efros and T. K. Leung; |
1999 | 4 | Flexible Camera Calibration By Viewing A Plane From Unknown Orientations IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Proposes a flexible new technique to easily calibrate a camera. |
Zhengyou Zhang; |
1999 | 5 | Wallflower: Principles And Practice Of Background Maintenance IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We compare our system with 8 other background subtraction algorithms. |
K. Toyama; J. Krumm; B. Brumitt and B. Meyers; |
1999 | 6 | Learning Low-level Vision IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show a learning-based method for low-level vision problems-estimating scenes from images. |
W. T. Freeman and E. C. Pasztor; |
1999 | 7 | A Theory Of Shape By Space Carving IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we consider the problem of computing the 3D shape of an unknown, arbitrarily-shaped scene from multiple photographs taken at known but arbitrarily-distributed viewpoints. |
K. N. Kutulakos and S. M. Seitz; |
1999 | 8 | Mean Shift Analysis And Applications IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A nonparametric estimator of density gradient, the mean shift, is employed in the joint, spatial-range (value) domain of gray level and color images for discontinuity preserving … |
D. Comaniciu and P. Meer; |
1999 | 9 | Segmentation Using Eigenvectors: A Unifying View IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we give a unified treatment of these algorithms, and show the close connections between them while highlighting their distinguishing features. |
Y. Weiss; |
1999 | 10 | Single View Metrology IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe how 3D affine measurements may be computed from a single perspective view of a scene given only minimal geometric information determined from the image. |
A. Criminisi; I. Reid and A. Zisserman; |
1999 | 11 | Vision In Bad Weather IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Based on this observation, we develop models and methods for recovering pertinent scene properties, such as three-dimensional structure, from images taken under poor weather conditions. |
S. K. Nayar and S. G. Narasimhan; |
1999 | 12 | Real-time Object Detection For smart Vehicles IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents an efficient shape-based object detection method based on Distance Transforms and describes its use for real-time vision on-board vehicles. |
D. M. Gavrila and V. Philomin; |
1999 | 13 | Empirical Evaluation Of Dissimilarity Measures For Color And Texture IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper empirically compares nine image dissimilarity measures that are based on distributions of color and texture features summarizing over 1,000 CPU hours of computational experiments. |
J. Puzicha; J. M. Buhmann; Y. Rubner and C. Tomasi; |
1999 | 14 | Three-dimensional Scene Flow IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a framework for the computation of dense, non-rigid scene flow from optical flow. |
S. Vedula; S. Baker; P. Rander; R. Collins and T. Kanade; |
1999 | 15 | A Probabilistic Exclusion Principle For Tracking Multiple Objects IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Another important contribution of the paper is the presentation of partitioned sampling, a new sampling method for multiple object tracking. |
J. MacCormick and A. Blake; |
1998 | 1 | Bilateral Filtering For Gray And Color Images IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Bilateral filtering smooths images while preserving edges, by means of a nonlinear combination of nearby image values. The method is noniterative, local, and simple. It combines … |
C. Tomasi and R. Manduchi; |
1998 | 2 | A Metric For Distributions With Applications To Image Databases IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we focus on applications to image databases, especially color and texture. |
Y. Rubner; C. Tomasi and L. J. Guibas; |
1998 | 3 | A General Framework For Object Detection IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a general trainable framework for object detection in static images of cluttered scenes. |
C. P. Papageorgiou; M. Oren and T. Poggio; |
1998 | 4 | Shock Graphs And Shape Matching IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a novel tree matching algorithm which finds the best set of corresponding nodes between two shock trees in polynomial time. |
K. Siddiqi; A. Shokoufandeh; S. J. Dickenson and S. W. Zucker; |
1998 | 5 | Depth Discontinuities By Pixel-to-pixel Stereo IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An algorithm to detect depth discontinuities from a stereo pair of images is presented. |
S. Birchfield and C. Tomasi; |
1998 | 6 | A Maximum-flow Formulation Of The N-camera Stereo Correspondence Problem IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes a new algorithm for solving the N-camera stereo correspondence problem by transforming it into a maximum-flow problem. |
S. Roy and I. J. Cox; |
1998 | 7 | Color- And Texture-based Image Segmentation Using EM And Its Application To Content-based Image Retrieval IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we present a new image representation which provides a transformation from the raw pixel data to a small set of image regions which are coherent in color and texture space. |
S. Belongie; C. Carson; H. Greenspan and J. Malik; |
1998 | 8 | Parameterized Modeling And Recognition Of Activities IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A framework for modeling and recognition of temporal activities is proposed. |
Y. Yacoob and M. J. Black; |
1998 | 9 | Motion Segmentation And Tracking Using Normalized Cuts IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a motion segmentation algorithm that aims to break a scene into its most prominent moving groups. |
Jianbo Shi and J. Malik; |
1998 | 10 | A Theory Of Catadioptric Image Formation IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we derive the complete class of single-lens single-mirror catadioptric sensors which have a single viewpoint and an expression for the spatial resolution of a catadioptric sensor in terms of the resolution of the camera used to construct it. |
S. Baker and S. K. Nayar; |
1998 | 11 | Thresholding For Change Detection IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe four different methods for selecting thresholds that work on very different principles. |
P. Rosin; |
1998 | 12 | A Mixed-state Condensation Tracker With Automatic Model-switching IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a significant development of random sampling methods to allow automatic switching between multiple motion models as a natural extension of the tracking process. |
M. Isard and A. Blake; |
1998 | 13 | Spatial Color Indexing And Applications IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We suggest the use of the color correlogram as a generic indexing tool to tackle various computer vision problems. |
Jing Huang; S. R. Kumar; M. Mitra and Wei-Jing Zhu; |
1998 | 14 | ASL Recognition Based On A Coupling Between HMMs And 3D Motion Analysis IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a framework for recognizing isolated and continuous American Sign Language (ASL) sentences from three-dimensional data. |
C. Vogler and D. Metaxas; |
1998 | 15 | Wide Baseline Stereo Matching IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The objective of this work is to enlarge the class of camera motions for which epipolar geometry and image correspondences can be computed automatically. |
P. Pritchett and A. Zisserman; |
1995 | 1 | Geodesic Active Contours IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Previous models of geometric active contours are improved as showed by a number of examples. |
V. Caselles; R. Kimmel and G. Sapiro; |
1995 | 2 | Alignment By Maximization Of Mutual Information IF:10 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: As applied in this paper, the technique is intensity-based, rather than feature-based. |
P. Viola and W. M. Wells; |
1995 | 3 | In Defence Of The 8-point Algorithm IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The fundamental matrix is a basic tool in the analysis of scenes taken with two uncalibrated cameras, and the 8 point algorithm is a frequently cited method for computing the … |
R. I. Hartley; |
1995 | 4 | Gradient Flows And Geometric Active Contour Models IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we analyze the geometric active contour models discussed previously from a curve evolution point of view and propose some modifications based on gradient flows relative to certain new feature-based Riemannian metrics. |
S. Kichenassamy; A. Kumar; P. Olver; A. Tannenbaum and A. Yezzi; |
1995 | 5 | Estimating The Tensor Of Curvature Of A Surface From A Polyhedral Approximation IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe a method to estimate the tensor of curvature of a surface at the vertices of a polyhedral approximation. |
G. Taubin; |
1995 | 6 | Tracking And Recognizing Rigid And Non-rigid Facial Motions Using Local Parametric Models Of Image Motion IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper explores the use of local parametrized models of image motion for recovering and recognizing the non-rigid and articulated motion of human faces. |
M. J. Black and Y. Yacoob; |
1995 | 7 | Model-based Tracking Of Self-occluding Articulated Objects IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe a framework for local trading of self occluding motion, in which one part of an object obstructs the visibility of another. |
J. M. Rehg and T. Kanade; |
1995 | 8 | Probabilistic Visual Learning For Object Detection IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an unsupervised technique for visual learning which is based on density estimation in high-dimensional spaces using an eigenspace decomposition. |
B. Moghaddam and A. Pentland; |
1995 | 9 | Curve And Surface Smoothing Without Shrinkage IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new method for smoothing piecewise linear shapes of arbitrary dimension and topology. |
G. Taubin; |
1995 | 10 | Face Recognition From One Example View IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We develop example-based techniques for applying the rotation seen in the prototypes to essentially rotate the single real view which is available. |
D. Beymer and T. Poggio; |
1995 | 11 | Topologically Adaptable Snakes IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The paper presents a typologically adaptable snakes model for image segmentation and object representation. |
T. McInerney and D. Terzopoulos; |
1995 | 12 | Stochastic Completion Fields: A Neural Model Of Illusory Contour Shape And Salience IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe an algorithm and representation level theory of illusory contour shape and salience. |
L. R. Williams and D. W. Jacobs; |
1995 | 13 | Recognition Of Human Body Motion Using Phase Space Constraints IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A new method for representing and recognizing human body movements is presented. |
L. W. Campbell and A. F. Bobick; |
1995 | 14 | Finding Faces In Cluttered Scenes Using Random Labeled Graph Matching IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An algorithm for locating quasi-frontal views of human faces in cluttered scenes is presented. |
T. K. Leung; M. C. Burl and P. Perona; |
1995 | 15 | Mosaic Based Representations Of Video Sequences And Their Applications IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe techniques for the basic elements of the mosaic construction process, namely alignment, integration, and residual analysis. |
M. Irani; P. Anandan and S. Hsu; |
1993 | 1 | Enhanced Image Capture Through Fusion IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors present an extension to the pyramid approach to image fusion. |
P. J. Burt and R. J. Kolczynski; |
1993 | 2 | A Framework For The Robust Estimation Of Optical Flow IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A graduated non-convexity algorithm is presented for recovering optical flow and motion discontinuities. |
M. J. Black and P. Anandan; |
1993 | 3 | Robust Computation Of Optical Flow In A Multi-scale Differential Framework IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors developed an algorithm for computing optical flow in a differential framework. |
J. Weber and J. Malik; |
1993 | 4 | Tracking Non-rigid Objects In Complex Scenes IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors describe a model-based method for tracking nonrigid objects moving in a complex scene. |
D. P. Huttenlocher; J. J. Noh and W. J. Rucklidge; |
1993 | 5 | A Finite Element Model For 3D Shape Reconstruction And Nonrigid Motion Tracking IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors present a physics-based approach for recovering the 3-D shape and tracking the motion of nonrigid objects using a 3-D elastically deformable balloon model. |
T. McInerney and D. Terzopoulos; |
1993 | 6 | A Computational Model Of Neural Contour Processing: Figure-ground Segregation And Illusory Contours IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors present a computational model of contour processing that was suggested by neurophysiological recordings from the monkey visual cortex. |
F. Heitger and R. von der Heydt; |
1993 | 7 | Extracting Projective Structure From Single Perspective Views Of 3D Point Sets IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A number of recent papers have argued that invariants do not exist for three-dimensional point sets in general position, which has often been misinterpreted to mean that … |
C. A. Rothwell; D. A. Forsyth; A. Zisserman and J. L. Mundy; |
1993 | 8 | Linear And Incremental Acquisition Of Invariant Shape Models From Image Sequences IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The authors show how to automatically acquire similarity-invariant shape representations of objects from noisy image sequences under a weak perspective. The incremental nature of … |
D. Weinshall and C. Tomas; |
1993 | 9 | Robust Structure From Motion Using Motion Parallax IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An efficient and geometrically intuitive algorithm for reliably interpreting the image velocities of moving objects in 3-D is presented. |
R. Cipolla; Y. Okamoto and Y. Kuno; |
1993 | 10 | Fast Segmentation, Tracking, And Analysis Of Deformable Objects IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors present a physically based deformable model which can be used to track and analyze non-rigid motion of dynamic structures in time sequences of 2-D or 3-D medical images. |
C. Nastar and N. Ayache; |
1993 | 11 | A Generalized Brightness Change Model For Computing Optical Flow IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Using this model, they describe a method for the computation of optical flow and investigate its performance in a variety of conditions involving brightness variations of scene points, due to illumination nonuniformity, light source motion, specular reflection, and/or interreflection. |
S. Negahdaripour and C. -. Yu; |
1993 | 12 | Recovering Reflectance And Illumination In A World Of Painted Polyhedra IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Such approaches prove inadequate in a 3-D world of painted polyhedra which allows for the existence of discontinuities in both the reflectance and illumination distributions. |
P. Sinha and E. Adelson; |
1993 | 13 | Learning Recognition And Segmentation Of 3-D Objects From 2-D Images IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A framework called Cresceptron is introduced for automatic algorithm design through learning of concepts and rules, thus deviating from the traditional mode in which humans specify the rules constituting a vision algorithm. |
J. J. Weng; N. Ahuja and T. S. Huang; |
1993 | 14 | Shape From Texture From A Multi-scale Perspective IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The problem of scale in shape from texture is addressed. The need for two scale parameters is emphasized: a local scale, for describing the amount of smoothing used for … |
T. Lindeberg and J. Garding; |
1993 | 15 | Diagonal Transforms Suffice For Color Constancy IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The overall goal is to present a theoretical analysis connecting many established theories of color constancy. |
G. D. Finlayson; M. S. Drew and B. V. Funt; |
1990 | 1 | Dynamic 3D Models With Local And Global Deformations: Deformable Superquadrics IF:9 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A physically-based approach is presented to fitting complex 3D shapes using a novel class of dynamic models. |
D. Terzopoulos and D. Metaxas; |
1990 | 2 | Indexing Via Color Histograms IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors introduce a technique called histogram intersection for efficiently matching model and image histograms. |
M. J. Swain and D. H. Ballard; |
1990 | 3 | Shape From Interreflections IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An iterative algorithm is presented that simultaneously recovers the actual shape and the actual reflectance from the pseudo estimates. |
S. K. Nayar; K. Ikeuchi and T. Kanade; |
1990 | 4 | Detecting And Localizing Edges Composed Of Steps, Peaks And Roofs IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The projection of depth or orientation discontinuities in a physical scene results in image intensity edges which are not ideal step edges but are more typically a combination of … |
P. Perona and J. Malik; |
1990 | 5 | Matching Range Images Of Human Faces IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To establish the optimal correspondence, a graph matching algorithm is applied. |
J. C. Lee and E. Milios; |
1990 | 6 | A Locally Adaptive Window For Signal Matching IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors presents a signal matching algorithm that can select an appropriate window size adaptively so as to obtain both precise and stable estimation of correspondences. |
M. Okutomi and T. Kanade; |
1990 | 7 | Pose Determination From Line-to-plane Correspondences: Existence Condition And Closed-form Solutions IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The author describes a polynomial method that, unlike previous methods, does not require prior knowledge about the location of the object. |
H. H. Chen; |
1990 | 8 | A Fast Algorithm For Active Contours IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A method of controlling snakes that combines speed, flexibility, and simplicity is presented. |
D. J. Williams and M. Shah; |
1990 | 9 | An Estimation-theoretic Framework For Image-flow Computation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A novel framework for computing image flow from time-varying imagery is described. |
A. Singh; |
1990 | 10 | BONSAI: 3D Object Recognition Using Constrained Search IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A description is presented of BONSAI, a model-based 3-D object recognition system, which identifies and localizes 3-D objects in range images of one or more parts which have been … |
P. J. Flynn and A. K. Jain; |
1990 | 11 | The 2.1-D Sketch IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A model is described for image segmentation that tries to capture the low-level depth reconstruction exhibited in early human vision, giving an important role to edge terminations. |
M. Nitzberg and D. Mumford; |
1990 | 12 | From Uncertainty To Visual Exploration IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The question posed is what can be inferred from ambiguity in processes of visual interpretation? Much emphasis is naturally placed on the form of constraints used to minimize … |
P. Whaite and F. P. Ferrie; |
1990 | 13 | A Finite Element Method Applied To New Active Contour Models And 3D Reconstruction From Cross Sections IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors present a model of deformation which solves some of the problems encountered with the original method such as instability and initial data while reducing the computational complexity. |
L. D. Cohen and I. Cohen; |
1990 | 14 | The Dynamic Analysis Of Apparent Contours IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The authors develop previous theories of the analysis of deformation of apparent contours under viewer motion. |
R. Cipolla and A. Blake; |
1990 | 15 | Vanishing Point Calculation As A Statistical Inference On The Unit Sphere IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: An examination is made of vanishing point calculation as a statistical estimation problem. It is assumed that image line segments have been previously clustered into groups of … |
R. T. Collins and R. S. Weiss; |
1988 | 1 | Geometric Hashing: A General And Efficient Model-based Recognition Scheme IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Y. Lamdan and H. J. Wolfson; |
1988 | 2 | Structural Saliency: The Detection Of Globally Salient Structures Using A Locally Connected Network IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View |
A. Sha’asua and S. Ullman; |
1988 | 3 | An Adaptive Clustering Algorithm For Image Segmentation IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View |
T. N. Pappas and N. S. Jayant; |
1988 | 4 | On The Sensitivity Of The Hough Transform For Object Recognition IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts View |
W. E. L. Grimson and D. P. Huttenlocher; |
1988 | 5 | Using Dynamic Programming For Minimizing The Energy Of Active Contours In The Presence Of Hard Constraints IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
A. A. Amini; S. Tehrani and T. E. Weymouth; |
1988 | 6 | Parallel Depth Recovery By Changing Camera Parameters IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View |
M. Subbarao; |
1988 | 7 | Efficiently Computing And Representing Aspect Graphs Of Polyhedral Objects IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Z. Gigus; J. Canny and R. Seidel; |
1988 | 8 | Modal Control Of An Attentive Vision System IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
J. J. Clark and N. J. Ferrier; |
1988 | 9 | Shape Information From Shading: A Theory About Human Perception IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
A. Pentland; |
1988 | 10 | Organization Of Smooth Image Curves At Multiple Scales IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
D. G. Lowe; |
1988 | 11 | Optimal Corner Detector IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts View |
K. Rangarajan; M. Shah and D. van Brackle; |
1988 | 12 | The Motion Coherence Theory IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View |
A. L. Yuille and N. M. Grzywacz; |
1988 | 13 | The Combinatorics Of Object Recognition In Cluttered Environments Using Constrained Search IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View |
W. E. L. Grimson; |
1988 | 14 | The Organization Of Curve Detection: Coarse Tangent Fields And Fine Spline Coverings IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View |
S. W. Zucker; C. David; A. Dobbins and L. Iverson; |
1988 | 15 | Robust Window Operators IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts View |
P. J. Besl; J. B. Birch and L. T. Watson; |