This is my reading notes, some of which are comprehensive and some are brief. In addition, some of them embed my opinion with others public notes. The major related areas including object detection, image-to-image translation, and video operation. A paper list is summarized as follows.
- Deformable_Convolutional_Networks
- DSSD
- Faster_RCN
- Mask-RCNN
- MultiBox
- Object_detection_at_200_Frames_Per_Second
- R-FCN
- S3FD_Single_Shot_Scale-invariant_Face_Detector
- soft_nms
- SSD
- YOLO
- YOLOv3
- Zero-Shot_Detection
- [BPN]_Single-Shot_Bidirectional_Pyramid_Networks_for_High-Quality_Object_Detection
- [M-FRCNN]Object_Detection_with_Mask-based_Feature_Encoding
- [RefineDet]_Single-Shot_Refinement_Neural_Network_for_Object_Detection
- [RFB]_Receptive_Field_Block_Net_for_Accurate_and_Fast_Object_Detection
- Attention-GAN_for_Object_Transfiguration_in_Wild_Images
- Composable_Unpaired_Image_to_Image_Translation
- DualGAN_Unsupervised_Dual_Learning_for_Image-to-Image_Translation
- Generating_a_Fusion_Image_One’s_Identity_and_Another’s_Shape
- Learning_to_Deblur_Images_with_Exemplars
- WaterGAN_Unsupervised_Generative_Network_to_Enable_Real-time_Color_Correction_of_Monocular_Underwater_Images
- [cd-GAN]_Conditional_Image-to-Image_Translation
- [CGAN]_Conditional_Generative_Adversarial_Nets
- [CycleGAN]_Unpaired_Image-to-Image_Translation_using_Cycle-Consistent_Adversarial_Networks
- [DCGAN]_Unsupervised_Representation_Learning_with_Deep_Convolutional_Generative_Adversarial_Networks
- [GAN_AC]_Connecting_Generative_Adversarial_Networks_and_Actor-Critic_Methods
- [ITGAN]_Improved_Techniques_for_Training_GANs
- [IWGAN]_Improved_Training_of_Wasserstein_GANs
- [OpticalFlow]_Semi-Supervised_Learning_for_Optical_Flow_with_Generative_Adversarial_Networks
- [pix2pix]_Image-to-Image_Translation_with_Conditional_Adversarial_Networks
- [SRGAN]_Photo-Realistic_Single_Image_Super-Resolution_Using_a_Generative_Adversarial_Network
- [SSGAN]_Semi-supervised_Conditional_GANs
- [STN]_Spatial_Transformer_Networks
- [STSR]_Perceptual_Losses_for_Real-Time_Style_Transfer_and_Super-Resolution
- [UNIT]_Unsupervised_Image-to-Image_Translation_Networks
- Dynamic_Video_Segmentation_Network
- Primary_Video_Object_Segmentation_via_Complementary_CNNs_and_Neighborhood_Reversible_Flow
- Video_Object_Segmentation_with_Language_Referring_Expressions
- AboutGAN
- Auto-Directed_Video_Stabilization_with_Robust_L1_Optimal_Camera_Paths
- A_Fast_Orientation_Estimation_Approach_of_Natural_Images
- Deep_multi-scale_video_prediction_beyond_mean_square_error
- Deep_video_deblurring
- Generating_Videos_with_Scene_Dynamics
- Temporal_generative_adversarial_nets_with_singular_value_clipping
- Unsupervised_Learning_for_Physical_Interaction_through_Video_Prediction
- Unsupervised_Learning_of_Video_Representations_using_LSTMs
- [C-RNN-GAN]_Continuous_recurrent_neural_networks_with_adversarial_training
- [CodingFlow]_Enable_Video_Coding_for_Video_Stabilization
- [DBLRGAN]_Adversarial_Spatio-Temporal_Learning_for_Video_Deblurring
- [FFNet]_Video_Fast-Forwarding_via_Reinforcement_Learning
- [GRAN]_Generating_images_with_recurrent_adversarial_networks
- [LAPGAN]_Deep_Generative_Image_Models_using_a_Laplacian_Pyramid_of_Adversarial_Networks
- [MeshFlow]_Minimum_Latency_Online_Video_Stabilization
- [MoCoGAN]_Decomposing_Motion_and_Content_for_Video_Generation
- [SeqGAN]_Sequence_Generative_Adversarial_Nets_with_Policy_Gradient
- [SteadyFlow]_Spatially_Smooth_Optical_Flow_for_Video_Stabilization
- A_Cascaded_Convolutional_Neural_Network_for_Single_Image_Dehazing
- Concern
- Mix_and_match_networks_encoder-decoder_alignment_for_zero-pair_image_translation
- Multimodal_Unsupervised_Image-to-Image_Translation
- Towards_High_Performance_Video_Object_Detection_for_Mobiles
- [S-LSTM-GAN]_Shared_recurrent_neural_networks_with_adversarial_training
- Deep_Reinforcement_Learning_for_Visual_Object_Tracking_in_Videos
- Learning_Dynamic_Memory_Networks_for_Object_Tracking
- Learning_to_Track_Online_Multi-Object_Tracking_by_Decision_Making
- Re3_Real-Time_Recurrent_Regression_Networks_for_Visual_Tracking_of_Generic_Objects
- [ADN]_Action-Decision_Networks_for_Visual_Tracking_with_Deep_Reinforcement_Learning
- [ROLO]_Spatially_Supervised_Recurrent_Convolutional_Neural_Networks_for_Visual_Object_Tracking
- End-to-End_Detection_and_Re-identification_Integrated_Net_for_Person_Search
- Object_Detection_in_Videos_by_Short_and_Long_Range_Object_Linking
- On_The_Stability_of_Video_Detection_and_Tracking
- Recurrent_Neural_Network_Regularization
- Seq-NMS_for_Video_Object_Detection
- T-CNN_Tubelets_with_Convolutional_Neural_Networks_for_Object_Detection_from_Videos
- [A-LSTM]_Online_Video_Object_Detection_using_Association_LSTM
- [AOD]_Attentional_Network_for_Visual_Object_Detection
- [ATW]_Attention-based_Temporal_Weighted_Convolutional_Neural_Network_for_Action_Recognition
- [Bottleneck-LSTM]_Mobile_Video_Object_Detection_with_Temporally-Aware_Feature_Maps
- [ClosedLoop]_Spatio-Temporal_Closed-Loop_Object_Detection
- [D&T]_Detect_to_Track_and_Track_to_Detect
- [DuATM]_Dual_Attention_Matching_Network_for_Context-Aware_Feature_Sequence_based_Person_Re-Identification
- [MGN]_Learning_Discriminative_Features_with_Multiple_Granularities_for_Person_ReID
- [RAM]_Recurrent_Models_of_Visual_Attention
- [Re-id]_An_Improved_Deep_Learning_Architecture_for_Person_Re-Identification
- [ResAtt]_Residual_Attention_Network_for_Image_Classification
- [SoftAtt]_Recurrent_Soft_Attention_Model_for_Common_Object_Recognition
- [STMN]_Spatial-Temporal_Memory_Networks_for_Video_Object_Detection
- [STSN]_Object_Detection_in_Video_with_Spatiotemporal_Sampling_Networks
- [TCN]_Object_Detection_from_Video_Tubelets_with_Convolutional_Neural_Networks
- [TPN]_Object_Detection_in_Videos_with_Tubelet_Proposal_Networks