Skip to content

Latest commit



140 lines (138 loc) · 20.3 KB

File metadata and controls

140 lines (138 loc) · 20.3 KB

Computer vision

  • 3D Object Reconstruction from a Single Depth View with Adversarial Learning. arxiv code
  • Abnormal Event Detection in Videos using Spatiotemporal Autoencoder. arxiv tensorflow
  • Accurate Single Stage Detector Using Recurrent Rolling Convolution. arxiv code
  • Active Convolution: Learning the Shape of Convolution for Image Classification. arxiv caffe
  • A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs. url
  • [AENet] Learning Deep Audio Features for Video Analysis. arxiv code
  • A Neural Representation of Sketch Drawings. arxiv pytorch
  • A network of deep neural networks for distant speech recognition. arXiv
  • A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction. arxiv
  • Annotating Object Instances with a Polygon-RNN. arXiv
  • Building Detection from Satellite Images on a Global Scale. arxiv
  • Cascade R-CNN: Delving into High Quality Object Detection. arxiv code
  • Class-Weighted Convolutional Features for Visual Instance Search. arxiv code
  • Convolutional 2D Knowledge Graph Embeddings. arxiv code
  • CortexNet: a Generic Network Family for Robust Visual Temporal Representations. arXiv code
  • CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing. arXiv caffe
  • Cost-Effective Active Learning for Deep Image Classification. arxiv
  • Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks. arxiv caffe
  • DCT-like Transform for Image Compression Requires 14 Additions Only. arxiv
  • Deep Alignment Network: A convolutional neural network for robust face alignment. arxiv code
  • Deep Bayesian Active Learning with Image Data. arxiv keras
  • Deep Convolutional Neural Networks for Pairwise Causality. arxiv
  • DeepFix: Fixing Common C Language Errors by Deep Learning. pdf code
  • DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. arxiv tensorflow
  • Deep Image Prior. pdf code
  • Deep Learning Based Large-Scale Automatic Satellite Crosswalk Classification. arxiv code
  • Deep Learning Features at Scale for Visual Place Recognition. arxiv
  • Deep learning for predicting refractive error from retinal fundus images. arxiv
  • Deep Learning for Tumor Classification in Imaging Mass Spectrometry. arxiv
  • Deep Hybrid Similarity Learning for Person Re-identification. arxiv
  • Deep Photo Style Transfer. arxiv code
  • DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild. arxiv code
  • Detecting Curve Text in the Wild: New Dataset and New Solution. arxiv code
  • Detecting Oriented Text in Natural Images by Linking Segments. arxiv tensorflow
  • Disentangled Person Image Generation. arxiv
  • Disentangling Motion, Foreground and Background Features in Videos. arxiv code
  • Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. arxiv
  • DR2-Net: Deep Residual Reconstruction Network for Image Compressive Sensing. arxiv
  • Dual-Path Convolutional Image-Text Embedding. arxiv code
  • End-to-end Recovery of Human Shape and Pose. arxiv code
  • End-to-end Training for Whole Image Breast Cancer Diagnosis using An All Convolutional Design. arxiv code
  • End-to-end weakly-supervised semantic alignment. arxiv pytorch
  • Estimated Depth Map Helps Image Classification. arxiv code
  • Exercise Motion Classification from Large-Scale Wearable Sensor Data Using Convolutional Neural Networks. arxiv
  • Extreme 3D Face Reconstruction: Looking Past Occlusions. arxiv code
  • Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car. arxiv
  • FaceBoxes: A CPU Real-time Face Detector with High Accuracy. arxiv code
  • Face Detection using Deep Learning: An Improved Faster RCNN Approach. arxiv
  • Fader Networks: Manipulating Images by Sliding Attributes. arxiv pytorch
  • Fast Image Processing with Fully-Convolutional Networks. arxiv code
  • FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras. arxiv
  • Focal Loss for Dense Object Detection. arxiv mxnet tensorflow
  • Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View. arxiv
  • Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. arxiv
  • Improving Smiling Detection with Race and Gender Diversity. arxiv
  • Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis. arxiv code
  • Joint auto-encoders: a flexible multi-task learning framework. arxiv
  • Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN. arxiv
  • Large-Scale Evolution of Image Classifiers. arxiv pytorch
  • Learning a Mixture of Deep Networks for Single Image Super-Resolution. arxiv code]
  • Learning a time-dependent master saliency map from eye-tracking data in videos. arxiv code
  • Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision. arxiv
  • Learning Deep ResNet Blocks Sequentially using Boosting Theory. arxiv
  • Learning Feature Pyramids for Human Pose Estimation. arxiv code
  • Learning to Estimate 3D Hand Pose from Single RGB Images. arxiv tensorflow
  • Learning to Estimate Pose by Watching Videos. arxiv
  • Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models. arxiv
  • Learning to Learn from Noisy Web Videos. arxiv
  • Learning to Segment Every Thing. arxiv
  • Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image. arxiv
  • Light-Head R-CNN: In Defense of Two-Stage Object Detector. arxiv code
  • Linear Disentangled Representation Learning for Facial Actions. arxiv code
  • Loss Max-Pooling for Semantic Image Segmentation. arxiv pytorch
  • Mask R-CNN. arxiv caffe mxnet
  • MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels. arxiv
  • Mix-and-Match Tuning for Self-Supervised Semantic Segmentation. arxiv code
  • MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arxiv pytorch keras tensorflow
  • Modeling Relational Data with Graph Convolutional Networks. arxiv
  • [MobileNets] Efficient Convolutional Neural Networks for Mobile Vision Applications. arxiv keras
  • MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior. arxiv
  • Multi-Scale Dense Networks for Resource Efficient Image Classification. arxiv code
  • Negative Results in Computer Vision: A Perspective. arxiv
  • Neural Motifs: Scene Graph Parsing with Global Context. arxiv code
  • Object Detection Using Deep CNNs Trained on Synthetic Images. arxiv code
  • Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs. arxiv code
  • Optimizing Deep CNN-Based Queries over Video Streams at Scale. arxiv tensorflow
  • Pedestrian Alignment Network for Large-scale Person Re-identification. arxiv code
  • Perceptually Optimized Image Rendering. arxiv
  • PersonRank: Detecting Important People in Images. arxiv
  • Photographic Image Synthesis with Cascaded Refinement Networks. arxiv tensorflow
  • Pixel Recursive Super Resolution. arxiv
  • Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers. arxiv
  • Receptive Field Block Net for Accurate and Fast Object Detection. arxiv code
  • Recurrent Scale Approximation for Object Detection in CNN. arxiv code
  • Rethinking Atrous Convolution for Semantic Image Segmentation. arxiv
  • S^3FD: Single Shot Scale-invariant Face Detector. arxiv pytorch
  • SfM-Net: Learning of Structure and Motion from Video. arxiv
  • Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner. arxiv code
  • Single-Shot Refinement Neural Network for Object Detection. arxiv caffe
  • SLAM with Objects using a Nonparametric Pose Graph. arxiv code
  • Smart, Sparse Contours to Represent and Edit Images. arxiv
  • STN-OCR: A single Neural Network for Text Detection and Text Recognition. arxiv code
  • Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection. arxiv
  • Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image. arxiv code
  • SphereFace: Deep Hypersphere Embedding for Face Recognition. arxiv code
  • Spinal cord gray matter segmentation using deep dilated convolutions. arxiv [code]( ​​​​)
  • SSPP-DAN: Deep Domain Adaptation Network for Face Recognition with Single Sample Per Person.arxiv
  • StreetStyle: Exploring world-wide clothing styles from millions of photos. arxiv
  • SuperPoint: Self-Supervised Interest Point Detection and Description. arxiv
  • Supervised Multilayer Sparse Coding Networks for Image Classification. arxiv
  • SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis. arxiv code
  • SwGridNet: A Deep Convolutional Neural Network based on Grid Topology for Image Classification. arxiv code
  • [Tacotron] Towards End-to-End Speech Synthesis. arxiv code
  • Tangent: Automatic Differentiation Using Source Code Transformation in Python. arxiv code
  • Time-Contrastive Networks: Self-Supervised Learning from Video. arxiv
  • Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters. arxiv
  • Toward Geometric Deep SLAM. arxiv
  • Towards perspective-free object counting with deep learning. pdf code
  • Training object class detectors with click supervision. arxiv
  • TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation. arxiv code
  • Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US. arxiv
  • Unsupervised Image-to-Image Translation Networks. arxiv tensorflow
  • Unsupervised Learning by Predicting Noise. arxiv
  • Unsupervised Learning of Long-Term Motion Dynamics for Videos. arxiv
  • Variational Approaches for Auto-Encoding Generative Adversarial Networks. arxiv
  • Video-based Person Re-identification with Accumulative Motion Context. arxiv
  • Video Frame Interpolation via Adaptive Separable Convolution. pdf pytorch
  • Video Frame Synthesis using Deep Voxel Flow. arxiv code
  • ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection. arxiv
  • Visualizing LSTM decisions. arxiv
  • Visual Attribute Transfer through Deep Image Analogy. arxiv pytorch
  • Visual Discovery at Pinterest. arxiv
  • Visualizing Residual Networks. arxiv
  • VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. arxiv
  • Wide-Residual-Inception Networks for Real-time Object Detection. arxiv
  • YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video. arxiv code
  • Zoom Out-and-In Network with Recursive Training for Object Proposal. arxiv