Skip to content

isLinXu/paper-list

Repository files navigation

paper-listGitHub starsGitHub forksGitHub watchersBuild StatusimgGitHub repo sizeGitHub language countGitHub last commitGitHubimg


Paper-List-DAILY
Automatically Update Papers Daily in list

Updated on 2024.11.26

paper_list

Table of Contents
  1. Classification
  2. Object Detection
  3. Semantic Segmentation
  4. Object Tracking
  5. Action Recognition
  6. Pose Estimation
  7. Image Generation
  8. LLM
  9. Scene Understanding
  10. Depth Estimation
  11. Audio Processing
  12. Multimodal
  13. Anomaly Detection
  14. Transfer Learning
  15. Optical Flow
  16. Reinforcement Learning
  17. Graph Neural Networks

Classification

Publish Date Title Authors PDF Code
2024-11-22 FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification Zhengrui Guo et.al. 2411.14743 link
2024-11-21 Adaptable Embeddings Network (AEN) Stan Loosmore et.al. 2411.13786 null
2024-11-20 Hierarchical Text Classification (HTC) vs. eXtreme Multilabel Classification (XML): Two Sides of the Same Medal Nerijus Bertalis et.al. 2411.13687 link
2024-11-20 Combining Autoregressive and Autoencoder Language Models for Text Classification João Gonçalves et.al. 2411.13282 link
2024-11-20 MEGL: Multimodal Explanation-Guided Learning Yifei Zhang et.al. 2411.13053 null
2024-11-19 Problem-dependent convergence bounds for randomized linear gradient compression Thomas Flynn et.al. 2411.12898 null
2024-11-19 Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs Ahmed Akib Jawad Karim et.al. 2411.12712 null
2024-11-22 STREAM: A Universal State-Space Model for Sparse Geometric Data Mark Schöne et.al. 2411.12603 null
2024-11-19 AdaCM $^2$ : On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Yuanbin Man et.al. 2411.12593 null
2024-11-19 Zero-Shot Crate Digging: DJ Tool Retrieval Using Speech Activity, Music Structure And CLAP Embeddings Iroro Orife et.al. 2411.12209 link
2024-11-19 Invariant Shape Representation Learning For Image Classification Tonmoy Hossain et.al. 2411.12201 link
2024-11-19 Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification Yuyang Xiao et.al. 2411.12151 null
2024-11-18 Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning Arundhati S. Shanbhag et.al. 2411.12073 link
2024-11-18 Vision Language Models Are Few-Shot Audio Spectrogram Classifiers Satvik Dixit et.al. 2411.12058 null
2024-11-18 Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging Milad Masroor et.al. 2411.11939 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-16 MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map Yuhong Chou et.al. 2411.10741 null
2024-11-16 Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image Jiawen Li et.al. 2411.10709 null
2024-11-16 Multi-perspective Contrastive Logit Distillation Qi Wang et.al. 2411.10693 null
2024-11-15 Vision Eagle Attention: A New Lens for Advancing Image Classification Mahmudul Hasan et.al. 2411.10564 link
2024-11-15 On the Cost of Model-Serving Frameworks: An Experimental Evaluation Pasquale De Rosa et.al. 2411.10337 null
2024-11-15 Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring Plugin Youngjoon Lee et.al. 2411.10212 link
2024-11-15 Outliers resistant image classification by anomaly detection Anton Sergeev et.al. 2411.10150 null
2024-11-15 Adapting the Biological SSVEP Response to Artificial Neural Networks Emirhan Böge et.al. 2411.10084 null
2024-11-15 Evidential Federated Learning for Skin Lesion Image Classification Rutger Hendrix et.al. 2411.10071 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 ResidualDroppath: Enhancing Feature Reuse over Residual Connections Sejik Park et.al. 2411.09475 null
2024-11-14 SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers Shravan Venkatraman et.al. 2411.09420 null
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 Computed tomography using meta-optics Maksym Zhelyeznuyakov et.al. 2411.08995 null
2024-11-13 CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt Mohammad Mahdi Mohajeri et.al. 2411.08979 null
2024-11-13 ScaleNet: Scale Invariance Learning in Directed Graphs Qin Jiang et.al. 2411.08758 link
2024-11-13 Efficient Whole Slide Image Classification through Fisher Vector Representation Ravi Kant Gupta et.al. 2411.08530 null
2024-11-12 HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification Cheng Jin et.al. 2411.07660 null
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-11 The Inherent Adversarial Robustness of Analog In-Memory Computing Corey Lammie et.al. 2411.07023 null
2024-11-11 ScaleKD: Strong Vision Transformers Could Be Excellent Teachers Jiawei Fan et.al. 2411.06786 link
2024-11-11 A Text Classification Model Combining Adversarial Training with Pre-trained Language Model and neural networks: A Case Study on Telecom Fraud Incident Texts Liu Zhuoxian et.al. 2411.06772 null
2024-11-11 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Deep Active Learning in the Open World Tian Xie et.al. 2411.06353 null
2024-11-09 Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs Shan Zhong et.al. 2411.06175 null
2024-11-09 AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems Zhiyu Zhu et.al. 2411.06146 null
2024-11-09 Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures Evgeny Sedov et.al. 2411.06124 null
2024-11-09 Mutual-energy inner product optimization method for constructing feature coordinates and image classification in Machine Learning Yuanxiu Wang et.al. 2411.06100 null
2024-11-08 GUIDEQ: Framework for Guided Questioning for progressive informational collection and classification Priya Mishra et.al. 2411.05991 link
2024-11-08 FisherMask: Enhancing Neural Network Labeling Efficiency in Image Classification Using Fisher Information Shreen Gul et.al. 2411.05752 link
2024-11-08 Visual-TCAV: Concept-based Attribution and Saliency Maps for Post-hoc Explainability in Image Classification Antonio De Santis et.al. 2411.05698 null
2024-11-08 Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan et.al. 2411.05603 null
2024-11-08 Training objective drives the consistency of representational similarity across datasets Laure Ciernik et.al. 2411.05561 link
2024-11-08 Estimating the Influence of Sequentially Correlated Literary Properties in Textual Classification: A Data-Centric Hypothesis-Testing Approach Gideon Yoffe et.al. 2411.04950 null
2024-11-07 Attention Masks Help Adversarial Attacks to Bypass Safety Detectors Yunfan Shi et.al. 2411.04772 link
2024-11-07 Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks Sanja Karilanova et.al. 2411.04760 null
2024-11-07 Is network fragmentation a useful complexity measure? Coenraad Mouton et.al. 2411.04695 null
2024-11-07 DISCO: DISCovering Overfittings as Causal Rules for Text Classification Models Zijian Zhang et.al. 2411.04649 null
2024-11-07 Neural Fingerprints for Adversarial Attack Detection Haim Fisher et.al. 2411.04533 link
2024-11-06 Multimodal Structure-Aware Quantum Data Processing Hala Hawashin et.al. 2411.04242 null
2024-11-06 RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models Maya Varma et.al. 2411.04097 link
2024-11-06 Overcoming label shift in targeted federated learning Edvin Listo Zec et.al. 2411.03799 null
2024-11-06 Deferred Poisoning: Making the Model More Vulnerable via Hessian Singularization Yuhao He et.al. 2411.03752 null
2024-11-05 Judge Like a Real Doctor: Dual Teacher Sample Consistency Framework for Semi-supervised Medical Image Classification Zhang Qixiang et.al. 2411.03041 null
2024-11-06 Confidence Calibration of Classifiers with Many Classes Adrien LeCoz et.al. 2411.02988 link
2024-11-05 Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization Pengkun Jiao et.al. 2411.02920 null
2024-11-05 ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate Shohei Taniguchi et.al. 2411.02853 link
2024-11-05 Integrated lithium niobate photonic computing circuit based on efficient and high-speed electro-optic conversion Yaowen Hu et.al. 2411.02734 null
2024-11-06 Wave Network: An Ultra-Small Language Model Xin Zhang et.al. 2411.02674 null
2024-11-04 FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification Bidisha Chakraborty et.al. 2411.02637 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-04 A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification Sorouralsadat Fatemi et.al. 2411.02476 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-03 Optimizing Gastrointestinal Diagnostics: A CNN-Based Model for VCE Image Classification Vaneeta Ahlawat et.al. 2411.01652 null
2024-11-03 ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis Xinyu Geng et.al. 2411.01564 null
2024-11-03 Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision Xiangzhong Luo et.al. 2411.01431 null
2024-11-02 Combining Financial Data and News Articles for Stock Price Movement Prediction Using Large Language Models Ali Elahi et.al. 2411.01368 null
2024-11-02 Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks Aarjav Kavathia et.al. 2411.01348 null
2024-11-02 MIC: Medical Image Classification Using Chest X-ray (COVID-19 and Pneumonia) Dataset with the Help of CNN and Customized CNN Nafiz Fahad et.al. 2411.01163 null
2024-11-02 Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement Bryan Bo Cao et.al. 2411.01099 link
2024-11-01 Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning Yuqing Zhou et.al. 2411.01045 null
2024-11-01 FISHing in Uncertainty: Synthetic Contrastive Learning for Genetic Aberration Detection Simon Gutwein et.al. 2411.01025 link
2024-10-31 Video Token Merging for Long-form Video Understanding Seon-Ho Lee et.al. 2410.23782 null
2024-10-31 Neurobench: DCASE 2020 Acoustic Scene Classification benchmark on XyloAudio 2 Weijie Ke et.al. 2410.23776 null
2024-10-31 QUEST-A: Untrained Filtering with Trained Focusing led to Enhanced Quantum Architectures Lian-Hui Yu et.al. 2410.23560 link
2024-11-01 Large Language Models for Patient Comments Multi-Label Classification Hajar Sakai et.al. 2410.23528 null
2024-10-30 Multilingual Vision-Language Pre-training for the Remote Sensing Domain João Daniel Silva et.al. 2410.23370 null
2024-10-30 Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks Axel Klawonn et.al. 2410.23359 null
2024-10-30 CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP Tianyu Yang et.al. 2410.23330 null
2024-10-30 Don't Just Pay Attention, PLANT It: Transfer L2R Models to Fine-tune Attention in Extreme Multi-Label Text Classification Debjyoti Saharoy et.al. 2410.23066 null
2024-10-30 Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Lam Nguyen Tung et.al. 2410.22663 null
2024-10-29 Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm Zaniar Sharifi et.al. 2410.22487 null
2024-10-29 EfficientNet with Hybrid Attention Mechanisms for Enhanced Breast Histopathology Classification: A Comprehensive Approach Naren Sengodan et.al. 2410.22392 null
2024-10-29 DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers Rakesh R. Menon et.al. 2410.22239 null
2024-10-29 Class-Aware Contrastive Optimization for Imbalanced Text Classification Grigorii Khvatskii et.al. 2410.22197 null
2024-10-29 Active Learning for Vision-Language Models Bardia Safaei et.al. 2410.22187 null
2024-10-29 Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets Adrian Iordache et.al. 2410.22184 link
2024-10-29 Natural Language Processing for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review Muhammad Bilal et.al. 2410.22180 null
2024-10-29 FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection Dat Nguyen et.al. 2410.21964 null
2024-10-29 Bayesian Optimization for Hyperparameters Tuning in Neural Networks Gabriele Onorato et.al. 2410.21886 null
2024-10-29 Advancing Efficient Brain Tumor Multi-Class Classification -- New Insights from the Vision Mamba Model in Transfer Learning Yinyi Lai et.al. 2410.21872 null
2024-10-28 Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks Noel Elias et.al. 2410.21561 null
2024-10-30 A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth Noel Elias et.al. 2410.21557 null
2024-10-28 Attacking Misinformation Detection Using Adversarial Examples Generated by Language Models Piotr Przybyła et.al. 2410.20940 null
2024-10-28 Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning Bing Han et.al. 2410.20775 null
2024-10-28 Interpretable Image Classification with Adaptive Prototype-based Vision Transformers Chiyu Ma et.al. 2410.20722 null
2024-10-27 Graph Neural Networks on Discriminative Graphs of Words Yassine Abbahaddou et.al. 2410.20469 null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-27 Sequential Large Language Model-Based Hyper-Parameter Optimization Kanan Mahammadli et.al. 2410.20302 link
2024-10-26 Enhancing CNN Classification with Lamarckian Memetic Algorithms and Local Search Akhilbaran Ghosh et.al. 2410.20234 null
2024-10-26 Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits Adit Jain et.al. 2410.20041 null
2024-10-26 Attacks against Abstractive Text Summarization Models through Lead Bias and Influence Functions Poojitha Thota et.al. 2410.20019 null
2024-10-26 Vulnerability of LLMs to Vertically Aligned Text Manipulations Zhecheng Li et.al. 2410.20016 null
2024-10-25 Learning the Regularization Strength for Deep Fine-Tuning via a Data-Emphasized Variational Objective Ethan Harvey et.al. 2410.19675 null
2024-10-24 Noise Adaption Network for Morse Code Image Classification Xiaxia Wang et.al. 2410.19180 link
2024-10-24 Hybrid Quantum-Classical Feature Extraction approach for Image Classification using Autoencoders and Quantum SVMs Donovan Slabbert et.al. 2410.18814 null
2024-10-24 Spatial-Temporal Search for Spiking Neural Networks Kaiwei Che et.al. 2410.18580 null
2024-10-25 Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks Lehan Wang et.al. 2410.18387 null
2024-10-23 Using Cartesian slice plots of a cosmological simulation as input of a convolutional neural network Guillermo Arreaga-Garcia et.al. 2410.18320 null
2024-10-25 Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing Dongliang Guo et.al. 2410.18267 null
2024-10-23 Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction Nicholas Walker et.al. 2410.18160 null
2024-10-23 Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers Edoardo Legnaro et.al. 2410.17816 null
2024-10-23 New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture Ach. Khozaimi et.al. 2410.17735 null
2024-10-24 Advancing Interpretability in Text Classification through Prototype Learning Bowen Wei et.al. 2410.17546 null
2024-10-23 Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning Jun-En Ding et.al. 2410.17494 null
2024-10-22 Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection Mahesh Vaijainthymala Krishnamoorthy et.al. 2410.17459 null
2024-10-22 Altogether: Image Captioning via Re-aligning Alt-text Hu Xu et.al. 2410.17251 null
2024-10-22 KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements Md Meftahul Ferdaus et.al. 2410.17172 link
2024-10-22 Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification Ganga Prasad Basyal et.al. 2410.16711 null
2024-10-21 Efficient Neural Network Training via Subset Pretraining Jan Spörer et.al. 2410.16523 null
2024-10-21 1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification Ram Mohan Rao Kadiyala et.al. 2410.15998 null
2024-10-21 Visual Representation Learning Guided By Multi-modal Prior Knowledge Hongkuan Zhou et.al. 2410.15981 null
2024-10-21 AutoTrain: No-code training for state-of-the-art models Abhishek Thakur et.al. 2410.15735 link
2024-10-21 ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts Xumeng Han et.al. 2410.15732 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 link
2024-10-19 Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion Chaodong Xiao et.al. 2410.15091 link
2024-10-19 PAT: Parameter-Free Audio-Text Aligner to Boost Zero-Shot Audio Classification Ashish Seth et.al. 2410.15062 null
2024-10-19 Weakly-supervised diagnosis identification from Italian discharge letters Vittorio Torri et.al. 2410.15051 null
2024-10-19 Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation Seulbi Lee et.al. 2410.14975 null
2024-10-18 A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification Maksuda Akter et.al. 2410.14536 null
2024-10-18 Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation Shuai Zhao et.al. 2410.14425 link
2024-10-18 A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference Bin Zhang et.al. 2410.14236 null
2024-10-18 Comparative Evaluation of Clustered Federated Learning Method Michael Ben Ali et.al. 2410.14212 link
2024-10-17 Reproducibility study of "LICO: Explainable Models with Language-Image Consistency" Luan Fletcher et.al. 2410.13989 link
2024-10-17 LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Yiming Shi et.al. 2410.13618 link
2024-10-17 Augmentation Policy Generation for Image Classification Using Large Language Models Ant Duru et.al. 2410.13453 null
2024-10-17 Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification Guangming Huang et.al. 2410.13439 null
2024-10-16 Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge Fawaz Sammani et.al. 2410.13016 link
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals Orchid Chetia Phukan et.al. 2410.12645 null
2024-10-17 From Measurement Instruments to Data: Leveraging Theory-Driven Synthetic Training Data for Classifying Social Constructs Lukas Birkenmaier et.al. 2410.12622 null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 null
2024-10-15 Clustering doc2vec output for topic-dimensionality reduction: A MITRE ATT&CK calibration Nathan Monnet et.al. 2410.11573 null
2024-10-15 LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models Hossein Abdi et.al. 2410.11551 null
2024-10-15 Reducing Labeling Costs in Sentiment Analysis via Semi-Supervised Learning Minoo Jafarlou et.al. 2410.11355 null
2024-10-14 Towards a More Complete Theory of Function Preserving Transforms Michael Painter et.al. 2410.11038 null
2024-10-14 Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning Etai Littwin et.al. 2410.10773 null
2024-10-15 Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based Aggregation Yosuke Yamagishi et.al. 2410.10710 link
2024-10-14 Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification Jiaxiang Gou et.al. 2410.10573 null
2024-10-14 Dynamic Power Control in a Hardware Neural Network with Error-Configurable MAC Units Maedeh Ghaderi et.al. 2410.10545 null
2024-10-14 Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks Xinyue Liu et.al. 2410.10454 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-14 A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets Nikolaos Mylonas et.al. 2410.10290 null
2024-10-14 big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo et.al. 2410.10267 null
2024-10-14 SkillAggregation: Reference-free LLM-Dependent Aggregation Guangzhi Sun et.al. 2410.10215 null
2024-10-14 Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models? Zeliang Zhang et.al. 2410.10160 null
2024-10-11 Efficient Hyperparameter Importance Assessment for CNNs Ruinan Wang et.al. 2410.08920 null
2024-10-11 Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning Nusrat Jahan Prottasha et.al. 2410.08598 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-11 Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks Yiyue Chen et.al. 2410.08508 null
2024-10-11 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-10 Bilinear MLPs enable weight-based mechanistic interpretability Michael T. Pearce et.al. 2410.08417 null
2024-10-10 What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias Aida Mohammadshahi et.al. 2410.08407 null
2024-10-10 Time Traveling to Defend Against Adversarial Example Attacks in Image Classification Anthony Etim et.al. 2410.08338 null
2024-10-10 More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing Sagi Shaier et.al. 2410.08003 null
2024-10-10 When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections Keryan Chelouche et.al. 2410.07689 null
2024-10-10 Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks Minxing Zhang et.al. 2410.07670 null
2024-10-10 StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models Minchan Kwon et.al. 2410.07652 null
2024-10-10 Explainability of Deep Neural Networks for Brain Tumor Detection S. Park et.al. 2410.07613 link
2024-10-10 CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features Po-han Li et.al. 2410.07610 null
2024-10-09 One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Fabian Paischer et.al. 2410.07170 link
2024-10-09 JPEG Inspired Deep Learning Ahmed H. Salamah et.al. 2410.07081 link
2024-10-09 Optimizing Estimators of Squared Calibration Errors in Classification Sebastian G. Gruber et.al. 2410.07014 null
2024-10-09 Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks Friedrich Wolf-Monheim et.al. 2410.06927 null
2024-10-09 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 null
2024-10-09 Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization Prateek Varshney et.al. 2410.06567 null
2024-10-08 A Comparative Study of Hybrid Models in Health Misinformation Text Classification Mkululi Sikosana et.al. 2410.06311 null
2024-10-08 Conformal Structured Prediction Botong Zhang et.al. 2410.06296 link
2024-10-08 TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data Jeremy Andrew Irvin et.al. 2410.06234 null
2024-10-08 Manual Verbalizer Enrichment for Few-Shot Text Classification Quang Anh Nguyen et.al. 2410.06173 null
2024-10-07 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-07 Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Senorita Deb et.al. 2410.05189 null
2024-10-07 IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification Yan He et.al. 2410.05100 null
2024-10-07 Explanation sensitivity to the randomness of large language models: the case of journalistic text classification Jeremie Bogaert et.al. 2410.05085 null
2024-10-07 Control-oriented Clustering of Visual Latent Representation Han Qi et.al. 2410.05063 null
2024-10-07 SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification Benjamin Feuer et.al. 2410.05057 link
2024-10-07 Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks Sandro Boccuzzo et.al. 2410.04866 null
2024-10-06 MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network Doanh C. Bui et.al. 2410.04507 null
2024-10-06 Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification Zhaorui Tan et.al. 2410.04492 link
2024-10-05 IT $^3$ : Idempotent Test-Time Training Nikita Durasov et.al. 2410.04201 null
2024-10-04 Classification-Denoising Networks Louis Thiry et.al. 2410.03505 null
2024-10-04 A Multimodal Framework for Deepfake Detection Kashish Gandhi et.al. 2410.03487 null
2024-10-04 On Uncertainty In Natural Language Processing Dennis Ulmer et.al. 2410.03446 link
2024-10-04 Comparing zero-shot self-explanations with human rationales in multilingual text classification Stephanie Brandl et.al. 2410.03296 null
2024-10-04 Sm: enhanced localization in Multiple Instance Learning for medical imaging classification Francisco M. Castro-Macías et.al. 2410.03276 null
2024-10-04 Selective Transformer for Hyperspectral Image Classification Yichu Xu et.al. 2410.03171 null
2024-10-03 CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification Jinghao Shi et.al. 2410.03038 null
2024-10-03 On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions Huy Nguyen et.al. 2410.02935 null
2024-10-03 Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups Zakhar Shumaylov et.al. 2410.02698 null
2024-10-03 LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Duy M. H. Nguyen et.al. 2410.02615 null
2024-10-03 Personalized Quantum Federated Learning for Privacy Image Classification Jinjing Shi et.al. 2410.02547 null
2024-10-03 BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning Gustav Wagner Zakarias et.al. 2410.02387 null
2024-10-03 CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration Thomas Buddenkotte et.al. 2410.02316 link
2024-10-03 Hard Negative Sample Mining for Whole Slide Image Classification Wentao Huang et.al. 2410.02212 link
2024-10-02 Kolmogorov-Arnold Network Autoencoders Mohammadamin Moradi et.al. 2410.02077 link
2024-10-02 Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Sreyan Ghosh et.al. 2410.02056 null
2024-10-02 FLAG: Financial Long Document Classification via AMR-based GNN Bolun et.al. 2410.02024 link
2024-10-02 MONICA: Benchmarking on Long-tailed Medical Image Classification Lie Ju et.al. 2410.02010 null
2024-10-02 Revisiting Hierarchical Text Classification: Inference and Metrics Roman Plaud et.al. 2410.01305 link
2024-10-02 Automatic deductive coding in discourse analysis: an application of large language models in learning analytics Lishan Zhang et.al. 2410.01240 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading Mostafa Hajighasemloua et.al. 2410.00779 null
2024-10-01 NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models Chi-Sheng Chen et.al. 2410.00712 null
2024-10-01 TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids Mazen Balat et.al. 2410.00403 null
2024-09-30 KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA Sachin Karmani et.al. 2410.00267 null
2024-09-30 A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification Marina Ribeiro et.al. 2410.00250 null
2024-09-30 Evaluating the performance of state-of-the-art esg domain-specific pre-trained large language models in text classification against existing models and traditional machine learning techniques Tin Yuet Chung et.al. 2410.00207 null
2024-10-02 Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification Kush Dubey et.al. 2410.00179 link
2024-09-30 POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator Eugenio Lomurno et.al. 2409.20447 null
2024-09-30 Satellite image classification with neural quantum kernels Pablo Rodriguez-Grasa et.al. 2409.20356 null
2024-09-30 All-optical autoencoder machine learning framework using diffractive processors Peijie Feng et.al. 2409.20346 null
2024-09-30 Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients Youssef Allouah et.al. 2409.20329 null
2024-09-30 Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies Shalini Sarode et.al. 2409.20237 null
2024-09-30 Classification of Radiological Text in Small and Imbalanced Datasets in a Non-English Language Vincent Beliveau et.al. 2409.20147 null
2024-09-30 SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision Transformers Nick Nikzad et.al. 2409.19850 null
2024-09-29 Adversarial Examples for DNA Classification Hyunwoo Yoo et.al. 2409.19788 null
2024-09-29 FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification Kexue Fu et.al. 2409.19720 null
2024-09-29 Vision-Language Models are Strong Noisy Label Detectors Tong Wei et.al. 2409.19696 link
2024-09-27 Unconditional stability of a recurrent neural circuit implementing divisive normalization Shivang Rawat et.al. 2409.18946 null
2024-09-27 Subspace Preserving Quantum Convolutional Neural Network Architectures Léo Monbroussou et.al. 2409.18918 null
2024-09-27 Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation Md. Farhadul Islam et.al. 2409.18506 null
2024-09-26 Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective Yu Wang et.al. 2409.18316 null
2024-09-26 Realistic Evaluation of Model Merging for Compositional Generalization Derek Tam et.al. 2409.18314 null
2024-09-26 DARE: Diverse Visual Question Answering with Robustness Evaluation Hannah Sterz et.al. 2409.18023 null
2024-09-26 The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification Andreas Waldis et.al. 2409.17929 null
2024-09-26 Cascade Prompt Learning for Vision-Language Model Adaptation Ge Wu et.al. 2409.17805 null
2024-09-26 Byzantine-Robust Aggregation for Securing Decentralized Federated Learning Diego Cajaraville-Aboy et.al. 2409.17754 null
2024-09-26 Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components Peiyong Wang et.al. 2409.17583 link
2024-09-26 Leveraging Annotator Disagreement for Text Classification Jin Xu et.al. 2409.17577 null
2024-09-26 Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE Xun Zhu et.al. 2409.17508 null
2024-09-26 Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification Guanyi Mou et.al. 2409.17474 null
2024-09-26 Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models Yuqing Zhou et.al. 2409.17455 null
2024-09-25 Block Expanded DINORET: Adapting Natural Domain Foundation Models for Retinal Imaging Without Catastrophic Forgetting Jay Zoellin et.al. 2409.17332 null
2024-09-25 BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices Yongqi Xu et.al. 2409.17093 link
2024-09-25 Accumulator-Aware Post-Training Quantization Ian Colbert et.al. 2409.17092 null
2024-09-26 HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space Jacob Fein-Ashley et.al. 2409.16897 link
2024-09-25 Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification Aram Khaksar et.al. 2409.16884 null
2024-09-25 Explicitly Modeling Pre-Cortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness Lucas Piper et.al. 2409.16838 link
2024-09-24 Unleashing the Potential of Synthetic Images: A Study on Histopathology Image Classification Leire Benito-Del-Valle et.al. 2409.16002 link
2024-09-24 An ensemble framework approach of hybrid Quantum convolutional neural networks for classification of breast cancer images Dibyasree Guha et.al. 2409.15958 null
2024-09-24 iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification Yuanzhe Jin et.al. 2409.15848 link
2024-09-23 Optimizing News Text Classification with Bi-LSTM and Attention Mechanism for Efficient Data Processing Bingyao Liu et.al. 2409.15576 null
2024-09-23 Critic Loss for Image Classification Brendan Hogan Rappazzo et.al. 2409.15565 null
2024-09-23 VLMine: Long-Tail Data Mining with Vision Language Models Mao Ye et.al. 2409.15486 null
2024-09-23 HydroVision: LiDAR-Guided Hydrometric Prediction with Vision Transformers and Hybrid Graph Learning Naghmeh Shafiee Roudbari et.al. 2409.15213 null
2024-09-23 Benchmarking Edge AI Platforms for High-Performance ML Inference Rakshith Jayanth et.al. 2409.14803 null
2024-09-23 Less yet robust: crucial region selection for scene recognition Jianqi Zhang et.al. 2409.14741 null
2024-09-22 Low-Light Enhancement Effect on Classification and Detection: An Empirical Study Xu Wu et.al. 2409.14461 null
2024-09-18 Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes Nikita Kiselev et.al. 2409.11995 link
2024-09-18 Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction Jin Jie Sean Yeo et.al. 2409.11964 null
2024-09-18 Agglomerative Token Clustering Joakim Bruslund Haurum et.al. 2409.11923 null
2024-09-18 Distillation-free Scaling of Large SSMs for Images and Videos Hamid Suleman et.al. 2409.11867 null
2024-09-18 Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks Amirhossein Dezhboro et.al. 2409.11665 null
2024-09-18 Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images A. A. G. Yogi Pramana et.al. 2409.11644 null
2024-09-18 Hyperspectral Image Classification Based on Faster Residual Multi-branch Spiking Neural Network Yang Liu et.al. 2409.11619 null
2024-09-17 Multi-Cohort Framework with Cohort-Aware Attention and Adversarial Mutual-Information Minimization for Whole Slide Image Classification Sharon Peled et.al. 2409.11119 null
2024-09-17 Anti-ESIA: Analyzing and Mitigating Impacts of Electromagnetic Signal Injection Attacks Denglin Kang et.al. 2409.10922 null
2024-09-16 Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? Kaleb Kassaw et.al. 2409.10775 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 InfoDisent: Explainability of Image Classification Models by Information Disentanglement Łukasz Struski et.al. 2409.10329 null
2024-09-16 Enhancing Image Classification in Small and Unbalanced Datasets through Synthetic Data Augmentation Neil De La Fuente et.al. 2409.10286 null
2024-09-15 Finetuning CLIP to Reason about Pairwise Differences Dylan Sam et.al. 2409.09721 null
2024-09-15 Compositional Audio Representation Learning Sripathi Sridhar et.al. 2409.09619 null
2024-09-14 One missing piece in Vision and Language: A Survey on Comics Understanding Emanuele Vivoli et.al. 2409.09502 link
2024-09-14 Real-world Adversarial Defense against Patch Attacks based on Diffusion Model Xingxing Wei et.al. 2409.09406 null
2024-09-14 Turbo your multi-modal classification with contrastive learning Zhiyu Zhang et.al. 2409.09282 null
2024-09-14 Leveraging Foundation Models for Efficient Federated Learning in Resource-restricted Edge Networks S. Kawa Atapour et.al. 2409.09273 null
2024-09-13 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds Sreyan Ghosh et.al. 2409.09213 link
2024-09-13 Pushing the boundaries of event subsampling in event-based video classification using CNNs Hesam Araghi et.al. 2409.08953 link
2024-09-13 Pushing Joint Image Denoising and Classification to the Edge Thomas C Markhorst et.al. 2409.08943 null
2024-09-13 Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering Changxin Liu et.al. 2409.08640 null
2024-09-13 Anytime Continual Learning for Open Vocabulary Classification Zhen Zhu et.al. 2409.08518 link
2024-09-12 Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention Mechanisms Fatemeh Askari et.al. 2409.07989 link
2024-09-12 Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters Shun Zou et.al. 2409.07896 link
2024-09-12 Classifying Images with CoLaNET Spiking Neural Network -- the MNIST Example Mikhail Kiselev et.al. 2409.07833 null
2024-09-12 Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption Zhizheng Lai et.al. 2409.07751 null
2024-09-12 DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning Kangyang Luo et.al. 2409.07734 null
2024-09-12 Cooperative Inference with Interleaved Operator Partitioning for CNNs Zhibang Liu et.al. 2409.07693 null
2024-09-11 Token Turing Machines are Efficient Vision Models Purvish Jajal et.al. 2409.07613 null
2024-09-11 Minimizing Embedding Distortion for Robust Out-of-Distribution Performance Tom Shaked et.al. 2409.07582 null
2024-09-11 A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks Erik B. Terres-Escudero et.al. 2409.07387 null
2024-09-11 Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding Ronald Katende et.al. 2409.07310 null
2024-09-11 LLM-based feature generation from text for interpretable machine learning Vojtěch Balek et.al. 2409.07132 null
2024-09-11 Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator Kangyang Luo et.al. 2409.06955 null
2024-09-10 Dynamic Decoupling of Placid Terminal Attractor-based Gradient Descent Algorithm Jinwei Zhao et.al. 2409.06542 null
2024-09-10 Seam Carving as Feature Pooling in CNN Mohammad Imrul Jubair et.al. 2409.06311 null
2024-09-10 EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification Suorong Yang et.al. 2409.06290 link
2024-09-09 A Small Claims Court for the NLP: Judging Legal Text Classification Strategies With Small Datasets Mariana Yukari Noguti et.al. 2409.05972 null
2024-09-09 SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values Chengwei Sun et.al. 2409.05926 null
2024-09-09 Adversarial Attacks on Data Attribution Xinhe Wang et.al. 2409.05657 null
2024-09-09 Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition Shiming Ge et.al. 2409.05384 null
2024-09-09 RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU Chengyuan Liu et.al. 2409.05275 null
2024-09-09 Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space Junho Lee et.al. 2409.05260 null
2024-09-08 PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels Aayushman et.al. 2409.04975 link
2024-09-07 Activation Function Optimization Scheme for Image Classification Abdur Rahman et.al. 2409.04915 null
2024-09-07 LoCa: Logit Calibration for Knowledge Distillation Runming Yang et.al. 2409.04778 null
2024-09-07 Swin Transformer for Robust Differentiation of Real and Synthetic Images: Intra- and Inter-Dataset Analysis Preetu Mehta et.al. 2409.04734 null
2024-09-06 Connectivity-Inspired Network for Context-Aware Recognition Gianluca Carloni et.al. 2409.04360 null
2024-09-06 An optically accelerated extreme learning machine using hot atomic vapors Pierre Azam et.al. 2409.04312 null
2024-09-06 PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation Tianqi Wei et.al. 2409.04038 null
2024-09-05 Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning Isaac Ray et.al. 2409.03938 null
2024-09-05 WaterMAS: Sharpness-Aware Maximization for Neural Network Watermarking Carl De Sousa Trias et.al. 2409.03902 null
2024-09-05 On-board Satellite Image Classification for Earth Observation: A Comparative Study of Pre-Trained Vision Transformer Models Thanh-Dung Le et.al. 2409.03901 null
2024-09-05 Have Large Vision-Language Models Mastered Art History? Ombretta Strafforello et.al. 2409.03521 null
2024-09-05 Non-Uniform Illumination Attack for Fooling Convolutional Neural Networks Akshay Jain et.al. 2409.03458 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning Bowen Tian et.al. 2409.03192 null
2024-09-05 The AdEMAMix Optimizer: Better, Faster, Older Matteo Pagliardini et.al. 2409.03137 null
2024-09-04 iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation Hayeon Jo et.al. 2409.02838 null
2024-09-03 MedUnA: Language guided Unsupervised Adaptation of Vision-Language Models for Medical Image Classification Umaima Rahman et.al. 2409.02729 null
2024-09-05 OpenFact at CheckThat! 2024: Combining Multiple Attack Methods for Effective Adversarial Text Generation Włodzimierz Lewoniewski et.al. 2409.02649 null
2024-09-04 Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization Cho-Ying Wu et.al. 2409.02486 null
2024-09-03 Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems Sanjita Prajapati et.al. 2409.02278 null
2024-09-05 Robust Clustering on High-Dimensional Data with Stochastic Quantization Anton Kozyriev et.al. 2409.02066 link
2024-09-03 Compressed learning based onboard semantic compression for remote sensing platforms Protim Bhattacharjee et.al. 2409.01988 null
2024-09-03 State-of-the-art Advances of Deep-learning Linguistic Steganalysis Research Yihao Wang et.al. 2409.01780 null
2024-09-03 Enhancing Fine-Grained Visual Recognition in the Low-Data Regime Through Feature Magnitude Regularization Avraham Chapman et.al. 2409.01672 null
2024-09-03 ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition Shiting Xiao et.al. 2409.01564 null
2024-08-30 Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain Francesca Grasso et.al. 2408.17362 link
2024-08-30 Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification Zhiwei Zhang et.al. 2408.17197 null
2024-08-30 Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study Shubham Agarwal et.al. 2408.17181 null
2024-09-02 Instant Adversarial Purification with Adversarial Consistency Distillation Chun Tong Lei et.al. 2408.17064 null
2024-08-30 Generative Modeling Perspective for Control and Reasoning in Robotics Takuma Yoneda et.al. 2408.17041 null
2024-08-29 Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detector Deepak Dagar et.al. 2408.16892 null
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 Android Malware Detection Based on RGB Images and Multi-feature Fusion Zhiqiang Wang et.al. 2408.16555 null
2024-08-29 SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models Guangxi Li et.al. 2408.16273 link
2024-08-29 Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation Yanghao Wang et.al. 2408.16266 null
2024-08-29 Low Saturation Confidence Distribution-based Test-Time Adaptation for Cross-Domain Remote Sensing Image Classification Yu Liang et.al. 2408.16265 null
2024-08-28 EMP: Enhance Memory in Data Pruning Jinying Xiao et.al. 2408.16031 null
2024-08-28 Local Descriptors Weighted Adaptive Threshold Filtering For Few-Shot Learning Bingchen Yan et.al. 2408.15924 null
2024-08-28 ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation Tiantian Feng et.al. 2408.15803 null
2024-08-28 Visual Prompt Engineering for Medical Vision Language Models in Radiology Stefan Denner et.al. 2408.15802 null
2024-08-28 Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings Lingyu Gao et.al. 2408.15650 null
2024-08-27 DCT-CryptoNets: Scaling Private Inference in the Frequency Domain Arjun Roy et.al. 2408.15231 null
2024-08-27 A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships Gracile Astlin Pereira et.al. 2408.15178 null
2024-08-28 AnomalousPatchCore: Exploring the Use of Anomalous Samples in Industrial Anomaly Detection Mykhailo Koshil et.al. 2408.15113 null
2024-08-27 Data downlink prioritization using image classification on-board a 6U CubeSat Keenan A. A. Chatar et.al. 2408.14865 null
2024-08-27 Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification Yiqiang Cai et.al. 2408.14862 null
2024-08-27 Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification Sirui Li et.al. 2408.14770 null
2024-08-26 On-Chip Learning with Memristor-Based Neural Networks: Assessing Accuracy and Efficiency Under Device Variations, Conductance Errors, and Input Noise M. Reza Eslami et.al. 2408.14680 null
2024-08-26 Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification Mahrukh Awan et.al. 2408.14441 null
2024-08-26 Uncertainties of Latent Representations in Computer Vision Michael Kirchhof et.al. 2408.14281 null
2024-08-26 MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification Feng Gao et.al. 2408.14255 null
2024-08-26 Feature Aligning Few shot Learning Method Using Local Descriptors Weighted Rules Bingchen Yan et.al. 2408.14192 null
2024-08-26 GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets Sven Oehri et.al. 2408.14131 null
2024-08-25 Few-Shot Histopathology Image Classification: Evaluating State-of-the-Art Methods and Unveiling Performance Insights Ardhendu Sekhar et.al. 2408.13816 null
2024-08-25 On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective Tal Alter et.al. 2408.13809 null
2024-08-25 Enhancing Adaptive Deep Networks for Image Classification via Uncertainty-aware Decision Fusion Xu Zhang et.al. 2408.13744 link
2024-08-25 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification Haizhao Jing et.al. 2408.13728 null
2024-08-24 Enhanced Astronomical Source Classification with Integration of Attention Mechanisms and Vision Transformers Srinadh Reddy Bhavanam et.al. 2408.13634 null
2024-08-23 Domain-specific long text classification from sparse relevant information Célia D'Cruz et.al. 2408.13253 null
2024-08-23 EAViT: External Attention Vision Transformer for Audio Classification Aquib Iqbal et.al. 2408.13201 null
2024-08-23 A gradient system based on anisotropic monochrome image processing with orientation auto-adjustment Harbir Antil et.al. 2408.12847 null
2024-08-23 Underwater SONAR Image Classification and Analysis using LIME-based Explainable Artificial Intelligence Purushothaman Natarajan et.al. 2408.12837 null
2024-08-23 VALE: A Multimodal Visual and Language Explanation Framework for Image Classifiers using eXplainable AI and Language Models Purushothaman Natarajan et.al. 2408.12808 null
2024-08-23 BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models Yige Li et.al. 2408.12798 null
2024-08-23 Semi-Supervised Variational Adversarial Active Learning via Learning to Rank and Agreement-Based Pseudo Labeling Zongyao Lyu et.al. 2408.12774 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561 link
2024-08-22 The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design Artem Snegirev et.al. 2408.12503 null
2024-08-22 Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification Sudi Murindanyi et.al. 2408.12426 null
2024-08-22 AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network Donghwa Kang et.al. 2408.12293 null
2024-08-22 Whole Slide Image Classification of Salivary Gland Tumours John Charlton et.al. 2408.12275 null
2024-08-22 Query-Efficient Video Adversarial Attack with Stylized Logo Duoxun Tang et.al. 2408.12099 null
2024-08-21 Approaching Deep Learning through the Spectral Dynamics of Weights David Yunis et.al. 2408.11804 link
2024-08-21 SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance Zhiqiang Wu et.al. 2408.11760 null
2024-08-21 Improving Calibration by Relating Focal Loss, Temperature Scaling, and Properness Viacheslav Komisarenko et.al. 2408.11598 link
2024-08-21 MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning Minghao Han et.al. 2408.11505 null
2024-08-21 Enabling Small Models for Zero-Shot Classification through Model Label Learning Jia Zhang et.al. 2408.11449 null
2024-08-21 Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond Minghao Liu et.al. 2408.11338 null
2024-08-21 Towards Evaluating Large Language Models on Sarcasm Understanding Yazhou Zhang et.al. 2408.11319 null
2024-08-20 Privacy-preserving Universal Adversarial Defense for Black-box Models Qiao Li et.al. 2408.10647 null
2024-08-20 A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping Kevin Kam Fung Yuen et.al. 2408.10572 null
2024-08-20 NoMatterXAI: Generating "No Matter What" Alterfactual Examples for Explaining Black-Box Text Classification Models Tuc Nguyen et.al. 2408.10528 null
2024-08-20 Cervical Cancer Detection Using Multi-Branch Deep Learning Model Tatsuhiro Baba et.al. 2408.10498 null
2024-08-19 HaSPeR: An Image Repository for Hand Shadow Puppet Recognition Syed Rifat Raiyan et.al. 2408.10360 link
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 null
2024-08-19 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Anke Tang et.al. 2408.10174 link
2024-08-19 Towards Robust Federated Image Classification: An Empirical Study of Weight Selection Strategies in Manufacturing Vinit Hegiste et.al. 2408.10024 null
2024-08-19 Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis Kira Maag et.al. 2408.10021 null
2024-08-19 Active Learning for Identifying Disaster-Related Tweets: A Comparison with Keyword Filtering and Generic Fine-Tuning David Hanny et.al. 2408.09914 null
2024-08-19 Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health Questions Sebastian Heineking et.al. 2408.09831 null
2024-08-19 AutoML-guided Fusion of Entity and LLM-based representations Boshko Koloski et.al. 2408.09794 null
2024-08-19 Dataset Distillation for Histopathology Image Classification Cong Cong et.al. 2408.09709 null
2024-08-19 A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification Claudio M. V. de Andrade et.al. 2408.09629 null
2024-08-18 Attention Is Not What You Need: Revisiting Multi-Instance Learning for Whole Slide Image Classification Xin Liu et.al. 2408.09449 null
2024-08-17 Narrowing the Focus: Learned Optimizers for Pretrained Models Gus Kristiansen et.al. 2408.09310 null
2024-08-16 DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models Eman Ali et.al. 2408.08855 null
2024-08-16 LEVIS: Large Exact Verifiable Input Spaces for Neural Networks Mohamad Fares El Hajj Chehade et.al. 2408.08824 null
2024-08-16 Leveraging FourierKAN Classification Head for Pre-Trained Transformer-based Text Classification Abdullah Al Imran et.al. 2408.08803 null
2024-08-16 Xpikeformer: Hybrid Analog-Digital Hardware Acceleration for Spiking Transformers Zihang Song et.al. 2408.08794 null
2024-08-16 Quantum convolutional neural networks for jet images classification Hala Elhag et.al. 2408.08701 null
2024-08-16 MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation Zunjie Xiao et.al. 2408.08600 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-16 Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness Hefei Mei et.al. 2408.08502 link
2024-08-15 Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention Zohaib Khan et.al. 2408.08454 null
2024-08-15 Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts Abdur R. Fayjie et.al. 2408.08432 null
2024-08-15 SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training Gengwei Zhang et.al. 2408.08295 link
2024-08-15 Moving Healthcare AI-Support Systems for Visually Detectable Diseases onto Constrained Devices Tess Watt et.al. 2408.08215 null
2024-08-15 Towards flexible perception with visual memory Robert Geirhos et.al. 2408.08172 null
2024-08-15 Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification Jiexuan Yan et.al. 2408.08125 link
2024-08-15 HAIR: Hypernetworks-based All-in-One Image Restoration Jin Cao et.al. 2408.08091 link
2024-08-14 Large Language Models Prompting With Episodic Memory Dai Do et.al. 2408.07465 null
2024-08-14 Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks Raghavendra Singh et.al. 2408.07243 null
2024-08-13 Efficient Search for Customized Activation Functions with Gradient Descent Lukas Strack et.al. 2408.06820 link
2024-08-13 Do Vision-Language Foundational models show Robust Visual Perception? Shivam Chandhok et.al. 2408.06781 link
2024-08-13 Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model Yongcheng Li et.al. 2408.06716 link
2024-08-13 Coherence Awareness in Diffractive Neural Networks Matan Kleiner et.al. 2408.06681 null
2024-08-12 Is it a work or leisure travel? Applying text classification to identify work-related travel on social networks Lucas Félix et.al. 2408.06341 null
2024-08-12 Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance Manuel Milling et.al. 2408.06264 null
2024-08-12 Deep Learning System Boundary Testing through Latent Space Style Mixing Amr Abdellatif et.al. 2408.06258 null
2024-08-12 Global-to-Local Support Spectrums for Language Model Explainability Lucas Agussurja et.al. 2408.05976 null
2024-08-12 A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class Qian Qiao et.al. 2408.05953 null
2024-08-12 Classifier Guidance Enhances Diffusion-based Adversarial Purification by Preserving Predictive Information Mingkun Zhang et.al. 2408.05900 null
2024-08-11 HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning Zhijian Chen et.al. 2408.05786 null
2024-08-11 PRECISe : Prototype-Reservation for Explainable Classification under Imbalanced and Scarce-Data Settings Vaibhav Ganatra et.al. 2408.05754 null
2024-08-11 Disposable-key-based image encryption for collaborative learning of Vision Transformer Rei Aso et.al. 2408.05737 null
2024-08-11 A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation Koushik Biswas et.al. 2408.05692 null
2024-08-09 A conformalized learning of a prediction set with applications to medical imaging classification Roy Hirsch et.al. 2408.05037 null
2024-08-09 Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks Verna Dankers et.al. 2408.04965 null
2024-08-09 LiD-FL: Towards List-Decodable Federated Learning Hong Liu et.al. 2408.04963 null
2024-08-09 In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Dahyun Kang et.al. 2408.04961 link
2024-08-08 Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes Bhushan Atote et.al. 2408.04606 null
2024-08-08 SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals Haoran Zheng et.al. 2408.04575 null
2024-08-08 An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification Ben Crulis et.al. 2408.04460 null
2024-08-08 Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction Yuchen Wang et.al. 2408.04294 null
2024-08-07 FMiFood: Multi-modal Contrastive Learning for Food Image Classification Xinyue Pan et.al. 2408.03922 null
2024-08-07 Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning Simret Araya Gebreegziabher et.al. 2408.03819 null
2024-08-07 Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification Georgia Sovatzidi et.al. 2408.03745 null
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks Jaewook Lee et.al. 2408.03663 null
2024-08-07 Making Robust Generalizers Less Rigid with Soft Ascent-Descent Matthew J. Holland et.al. 2408.03619 null
2024-08-06 AI Foundation Models in Remote Sensing: A Survey Siqi Lu et.al. 2408.03464 null
2024-08-06 Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust et.al. 2408.03274 null
2024-08-06 A Debiased Nearest Neighbors Framework for Multi-Label Text Classification Zifeng Cheng et.al. 2408.03202 null
2024-08-06 Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi Pranita Deshmukh et.al. 2408.03172 null
2024-08-06 Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Jonas Schmitt et.al. 2408.03046 null
2024-08-06 L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization Elvys Linhares Pontes et.al. 2408.03033 null
2024-08-06 Adversarial Robustness of Open-source Text Classification Models and Fine-Tuning Chains Hao Qin et.al. 2408.02963 null
2024-08-06 Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration Xiaoqing Zhang et.al. 2408.02906 null
2024-08-05 Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space Eduardo Sanchez-Karhunen et.al. 2408.02838 null
2024-08-05 Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning Services Shaopeng Fu et.al. 2408.02814 null
2024-08-05 FPT+: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification Yijin Huang et.al. 2408.02426 null
2024-08-05 On the Robustness of Malware Detectors to Adversarial Samples Muhammad Salman et.al. 2408.02310 null
2024-08-05 Low-Cost Self-Ensembles Based on Multi-Branch Transformation and Grouped Convolution Hojung Lee et.al. 2408.02307 null
2024-08-05 Network Fission Ensembles for Low-Cost Self-Ensembles Hojung Lee et.al. 2408.02301 null
2024-08-04 VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces Somnath Sendhil Kumar et.al. 2408.02140 null
2024-08-04 DeMansia: Mamba Never Forgets Any Tokens Ricky Fang et.al. 2408.01986 null
2024-08-06 A Survey and Evaluation of Adversarial Attacks for Object Detection Khoi Nguyen Tiet Nguyen et.al. 2408.01934 null
2024-08-03 Safe Semi-Supervised Contrastive Learning Using In-Distribution Data as Positive Examples Min Gu Kwak et.al. 2408.01872 null
2024-08-03 LAM3D: Leveraging Attention for Monocular 3D Object Detection Diana-Alexandra Sas et.al. 2408.01739 null
2024-08-02 Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder Matan Atad et.al. 2408.01571 null
2024-08-02 Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01372 null
2024-08-02 WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01231 null
2024-08-02 Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification Muhammad Ahmad et.al. 2408.01224 null
2024-08-02 Rethinking Pre-trained Feature Extractor Selection in Multiple Instance Learning for Whole Slide Image Classification Bryan Wong et.al. 2408.01167 null
2024-08-01 CERT-ED: Certifiably Robust Text Classification for Edit Distance Zhuoqun Huang et.al. 2408.00728 null
2024-08-01 Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images Xiaoyi Liu et.al. 2408.00636 null
2024-08-01 DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation Rakshith Subramanyam et.al. 2408.00331 null
2024-07-31 Vera Verto: Multimodal Hijacking Attack Minxing Zhang et.al. 2408.00129 null
2024-07-31 Learning Video Context as Interleaved Multimodal Sequences Kevin Qinghong Lin et.al. 2407.21757 null
2024-07-30 Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation Marcelo Matheus Gauy et.al. 2407.20989 null
2024-07-30 Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach Adam Wojciechowski et.al. 2407.20899 null
2024-08-01 DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention Wei Wang et.al. 2407.20843 null
2024-08-01 The Susceptibility of Example-Based Explainability Methods to Class Outliers Ikhtiyor Nematov et.al. 2407.20678 null
2024-07-30 Knowledge Fused Recognition: Fusing Hierarchical Knowledge for Image Recognition through Quantitative Relativity Modeling and Deep Metric Learning Yunfeng Zhao et.al. 2407.20600 null
2024-07-30 Exploring Liquid Neural Networks on Loihi-2 Wiktoria Agata Pawlak et.al. 2407.20590 null
2024-07-29 Graphite: A Graph-based Extreme Multi-Label Short Text Classifier for Keyphrase Recommendation Ashirbad Mishra et.al. 2407.20462 null
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 null
2024-07-29 Distilling High Diagnostic Value Patches for Whole Slide Image Classification Using Attention Mechanism Tianhang Nan et.al. 2407.19821 null
2024-07-28 Competition-based Adaptive ReLU for Deep Neural Networks Junjia Chen et.al. 2407.19441 null
2024-07-28 Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets Tianxiao Zhang et.al. 2407.19394 link
2024-07-27 Inference-Time Selective Debiasing Gleb Kuzmin et.al. 2407.19345 null
2024-07-27 Stellar Blend Image Classification Using Computationally Efficient Gaussian Processes Chinedu Eleh et.al. 2407.19297 null
2024-07-27 Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation Riyansha Singh et.al. 2407.19265 null
2024-07-27 A Survey of Malware Detection Using Deep Learning Ahmed Bensaoud et.al. 2407.19153 null
2024-07-26 UniForensics: Face Forgery Detection via General Facial Representation Ziyuan Fang et.al. 2407.19079 null
2024-07-26 A Scalable Quantum Non-local Neural Network for Image Classification Sparsh Gupta et.al. 2407.18906 link
2024-07-26 Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment Yuze Zheng et.al. 2407.18854 null
2024-07-26 Local Binary Pattern(LBP) Optimization for Feature Extraction Zeinab Sedaghatjoo et.al. 2407.18665 null
2024-07-26 Topology Optimization of Random Memristors for Input-Aware Dynamic SNN Bo Wang et.al. 2407.18625 null
2024-07-26 Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification Huiyan Bai et.al. 2407.18593 null
2024-07-26 VSSD: Vision Mamba with Non-Casual State Space Duality Yuheng Shi et.al. 2407.18559 link
2024-07-25 Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images Roberto Di Via et.al. 2407.18125 null
2024-07-25 Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network Sukwon Yun et.al. 2407.17857 link
2024-07-25 SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification Heng Fang et.al. 2407.17689 link
2024-07-26 Unsqueeze [CLS] Bottleneck to Learn Rich Representations Qing Su et.al. 2407.17671 link
2024-07-24 Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference Catherine Huang et.al. 2407.17663 null
2024-07-23 S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks Neha A S et.al. 2407.17587 null
2024-07-24 A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks Fabiano Belém et.al. 2407.17284 null
2024-07-24 Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification? Johannes Kiechle et.al. 2407.17219 link
2024-07-24 Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks Alessandro Sebastianelli et.al. 2407.17108 null
2024-07-24 An Adaptive Gradient Regularization Method Huixiu Jiang et.al. 2407.16944 null
2024-07-23 Lawma: The Power of Specialization for Legal Tasks Ricardo Dominguez-Olmedo et.al. 2407.16615 null
2024-07-23 Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging Daniela L. Ramos et.al. 2407.16608 null
2024-07-23 Designing robust diffractive neural networks with improved transverse shift tolerance Daniil V. Soshnikov et.al. 2407.16456 null
2024-07-23 Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks Ayan Igali et.al. 2407.16268 null
2024-07-23 HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification Shuyi Ouyang et.al. 2407.16244 null
2024-07-23 Improved Few-Shot Image Classification Through Multiple-Choice Questions Dipika Khullar et.al. 2407.16145 null
2024-07-22 Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network Zhen Wang et.al. 2407.16021 null
2024-07-22 AIDE: Antithetical, Intent-based, and Diverse Example-Based Explanations Ikhtiyor Nematov et.al. 2407.16010 null
2024-07-22 Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Aayush Saxena et.al. 2407.15904 null
2024-07-22 Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning Josiah Couch et.al. 2407.15724 null
2024-07-22 Retinomorphic Feature Detection and Machine Vision in a Network Laser Wai Kit Ng et.al. 2407.15558 null
2024-07-22 Learning deep illumination-robust features from multispectral filter array images Anis Amziane et.al. 2407.15472 null
2024-07-22 Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data Junha Song et.al. 2407.15383 null
2024-07-22 FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image Classification Weiping Ding et.al. 2407.15312 null
2024-07-21 Assessing Sample Quality via the Latent Space of Generative Models Jingyi Xu et.al. 2407.15171 null
2024-07-21 A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts Gokcen Gokceoglu et.al. 2407.15136 null
2024-07-20 Toward Efficient Convolutional Neural Networks With Structured Ternary Patterns Christos Kyrkou et.al. 2407.14831 link
2024-07-20 Subgraph Clustering and Atom Learning for Improved Image Classification Aryan Singh et.al. 2407.14772 null
2024-07-20 A Comprehensive Review of Few-shot Action Recognition Yuyang Wanyan et.al. 2407.14744 null
2024-07-19 DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks Sarah Jabbour et.al. 2407.14509 null
2024-07-19 Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models Xuenan Xu et.al. 2407.14355 null
2024-07-19 EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition Youssef Doulfoukar et.al. 2407.14314 null
2024-07-18 CoAPT: Context Attribute words for Prompt Tuning Gun Lee et.al. 2407.13808 null
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 Addressing Imbalance for Class Incremental Learning in Medical Image Classification Xuze Hao et.al. 2407.13768 null
2024-07-18 Differential Privacy Mechanisms in Neural Tangent Kernel Regression Jiuxiang Gu et.al. 2407.13621 null
2024-07-18 CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data Aristotelis Ballas et.al. 2407.13421 link
2024-07-17 LookupViT: Compressing visual information to a limited number of tokens Rajat Koner et.al. 2407.12753 null
2024-07-17 Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients Dohyung Kim et.al. 2407.12637 null
2024-07-17 Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? Aman Sinha et.al. 2407.12626 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-17 Non-parametric regularization for class imbalance federated medical image classification Jeffry Wicaksana et.al. 2407.12446 link
2024-07-17 FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification Markus Weißflog et.al. 2407.12375 null
2024-07-17 Adaptive Cascading Network for Continual Test-Time Adaptation Kien X. Nguyen et.al. 2407.12240 null
2024-07-16 Generalized Coverage for More Robust Low-Budget Active Learning Wonho Bae et.al. 2407.12212 null
2024-07-18 A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification Markus Marks et.al. 2407.12210 null
2024-07-16 Novel Artistic Scene-Centric Datasets for Effective Transfer Learning in Fragrant Spaces Shumei Liu et.al. 2407.11701 null
2024-07-16 Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification Naif Alkhunaizi et.al. 2407.11573 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-16 PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer Pierre-David Letourneau et.al. 2407.11306 null
2024-07-15 Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Philipp Allgeuer et.al. 2407.11211 null
2024-07-16 DataDream: Few-shot Guided Dataset Generation Jae Myung Kim et.al. 2407.10910 link
2024-07-15 Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification Linhao Qu et.al. 2407.10814 null
2024-07-15 Employing Sentence Space Embedding for Classification of Data Stream from Fake News Domain Paweł Zyblewski et.al. 2407.10807 null
2024-07-15 Anticipating Future Object Compositions without Forgetting Youssef Zahran et.al. 2407.10723 null
2024-07-15 GeoMix: Towards Geometry-Aware Data Augmentation Wentao Zhao et.al. 2407.10681 link
2024-07-15 Learning Natural Consistency Representation for Face Forgery Video Detection Daichi Zhang et.al. 2407.10550 null
2024-07-15 Improving Hyperbolic Representations via Gromov-Wasserstein Regularization Yifei Yang et.al. 2407.10495 null
2024-07-15 Backdoor Attacks against Image-to-Image Networks Wenbo Jiang et.al. 2407.10445 null
2024-07-14 Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia Dimitris Papaioannou et.al. 2407.10251 null
2024-07-14 Advancing Continual Learning for Robust Deepfake Audio Classification Feiyi Dong et.al. 2407.10108 null
2024-07-12 Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off Levente Halmosi et.al. 2407.09150 link
2024-07-12 Open Vocabulary Multi-Label Video Classification Rohit Gupta et.al. 2407.09073 null
2024-07-12 GPC: Generative and General Pathology Image Classifier Anh Tien Nguyen et.al. 2407.09035 null
2024-07-12 CAMP: Continuous and Adaptive Learning Model in Pathology Anh Tien Nguyen et.al. 2407.09030 null
2024-07-12 SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image Classification Tong Shu et.al. 2407.08968 null
2024-07-12 Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification Ke Ji et.al. 2407.08959 null
2024-07-11 Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique Jackson Hamel et.al. 2407.08800 null
2024-07-11 Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification Wenshuo Peng et.al. 2407.08787 null
2024-07-11 ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions Jiu Feng et.al. 2407.08691 link
2024-07-11 Histopathological Image Classification with Cell Morphology Aware Deep Neural Networks Andrey Ignatov et.al. 2407.08625 link
2024-07-11 BiasPruner: Debiased Continual Learning for Medical Image Classification Nourhan Bayasi et.al. 2407.08609 link
2024-07-11 GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification Aitao Yang et.al. 2407.08255 link
2024-07-11 Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis Gerard Christopher Yeo et.al. 2407.08182 null
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others Daniel Sikar et.al. 2407.07818 null
2024-07-11 Trainable Highly-expressive Activation Functions Irit Chelly et.al. 2407.07564 null
2024-07-10 HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image Classification Omar S. EL-Assiouti et.al. 2407.07516 null
2024-07-10 Towards a text-based quantitative and explainable histopathology image analysis Anh Tien Nguyen et.al. 2407.07360 null
2024-07-11 FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification Doanh C. Bui et.al. 2407.07340 link
2024-07-10 Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken Peifu Liu et.al. 2407.07307 link
2024-07-09 Exploring Camera Encoder Designs for Autonomous Driving Perception Barath Lakshmanan et.al. 2407.07276 null
2024-07-09 CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion Hosam S. EL-Assiouti et.al. 2407.06673 null
2024-07-09 NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification Hongfei Huang et.al. 2407.06579 null
2024-07-08 Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches Y. Cordero et.al. 2407.06416 null
2024-07-08 GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images Jon Crall et.al. 2407.06337 null
2024-07-08 Multi-Label Plant Species Classification with Self-Supervised Vision Transformers Murilo Gustineli et.al. 2407.06298 link
2024-07-08 Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label Noise Bidur Khanal et.al. 2407.05973 null
2024-07-08 Wavelet Convolutions for Large Receptive Fields Shahaf E. Finder et.al. 2407.05848 link
2024-07-08 Evaluating the Fairness of Neural Collapse in Medical Image Classification Kaouther Mouheb et.al. 2407.05843 null
2024-07-08 Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification Jiaying Shi et.al. 2407.05647 null
2024-07-08 New Directions in Text Classification Research: Maximizing The Performance of Sentiment Classification from Limited Data Surya Agustian et.al. 2407.05627 null
2024-07-08 Momentum Auxiliary Network for Supervised Local Learning Junhao Su et.al. 2407.05623 link
2024-07-08 Open-world Multi-label Text Classification with Extremely Weak Supervision Xintong Li et.al. 2407.05609 link
2024-07-08 FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance Jiedong Zhuang et.al. 2407.05578 null
2024-07-08 An accurate detection is not all you need to combat label noise in web-noisy datasets Paul Albert et.al. 2407.05528 null
2024-07-07 Leveraging Topological Guidance for Improved Knowledge Distillation Eun Som Jeon et.al. 2407.05316 link
2024-07-05 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Yuhan Zhu et.al. 2407.04603 null
2024-07-05 AMD: Automatic Multi-step Distillation of Large-scale Vision Models Cheng Han et.al. 2407.04208 null
2024-07-04 LeDNet: Localization-enabled Deep Neural Network for Multi-Label Radiography Image Classification Lalit Pant et.al. 2407.03931 null
2024-07-04 DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification Saifullah Saifullah et.al. 2407.03830 null
2024-07-04 reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis Kai Norman Clasen et.al. 2407.03653 link
2024-07-04 Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes Yusuke Hirota et.al. 2407.03623 null
2024-07-04 Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification Xuerong Zhang et.al. 2407.03596 null
2024-07-04 DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification Wenhui Zhu et.al. 2407.03575 link
2024-07-03 A multicategory jet image classification framework using deep neural network Jairo Orozco Sandoval et.al. 2407.03524 null
2024-07-03 Model Guidance via Explanations Turns Image Classifiers into Segmentation Models Xiaoyan Yu et.al. 2407.03009 null
2024-07-03 ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation Yipin Guo et.al. 2407.02881 null
2024-07-03 Fine-Grained Scene Image Classification with Modality-Agnostic Adapter Yiqun Wang et.al. 2407.02769 link
2024-07-03 ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers Yanfeng Jiang et.al. 2407.02763 null
2024-07-02 Spectral Graph Reasoning Network for Hyperspectral Image Classification Huiling Wang et.al. 2407.02647 null
2024-07-01 CGRclust: Chaos Game Representation for Twin Contrastive Clustering of Unlabelled DNA Sequences Fatemeh Alipour et.al. 2407.02538 link
2024-07-02 Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts Chunlan Ma et.al. 2407.02320 null
2024-07-03 Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis Sufen Ren et.al. 2407.02261 null
2024-07-02 Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification Shulei Qiu et.al. 2407.02123 null
2024-07-01 Optimized Learning for X-Ray Image Classification for Multi-Class Disease Diagnoses with Accelerated Computing Strategies Sebastian A. Cruz Romero et.al. 2407.01705 null
2024-07-02 xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart Tianrun Chen et.al. 2407.01530 link
2024-07-01 Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision Balaji VS et.al. 2407.01435 null
2024-07-01 Semantic Compositions Enhance Vision-Language Contrastive Learning Maxwell Aladago et.al. 2407.01408 null
2024-07-01 GalLoP: Learning Global and Local Prompts for Vision-Language Models Marc Lafon et.al. 2407.01400 null
2024-07-01 Protecting Privacy in Classifiers by Token Manipulation Re'em Harel et.al. 2407.01334 null
2024-07-01 Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks Roberto Alcover-Couso et.al. 2407.01327 null
2024-06-28 Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes Dmitry Demidov et.al. 2406.19814 link
2024-06-27 Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads Ali Khaleghi Rahimian et.al. 2406.19391 link
2024-06-27 Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation Yushun Tang et.al. 2406.19341 null
2024-06-27 Spiking Convolutional Neural Networks for Text Classification Changze Lv et.al. 2406.19230 link
2024-06-27 Adaptive Stochastic Weight Averaging Caglar Demir et.al. 2406.19092 link
2024-06-27 FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity Zhaobin Sun et.al. 2406.18995 link
2024-06-26 Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated Jiazhou Ji et.al. 2406.18259 null
2024-06-26 ViT-1.58b: Mobile Vision Transformers in the 1-bit Era Zhengqing Yuan et.al. 2406.18051 null
2024-06-25 Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation Tushar Prasanna Swaminathan et.al. 2406.17749 link
2024-06-25 Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning Arijit Sehanobish et.al. 2406.17740 null
2024-06-25 BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging Zeinab Sherkatghanad et.al. 2406.17640 link
2024-06-26 Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Sedigheh Eslami et.al. 2406.17639 null
2024-06-25 Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels Nicholas Pangakis et.al. 2406.17633 null
2024-06-25 Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification Huiyao Chen et.al. 2406.17534 link
2024-06-25 TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification Joshua Niemeijer et.al. 2406.17473 null
2024-06-25 Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning Jintao Yan et.al. 2406.17470 null
2024-06-25 Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes Qi Ma et.al. 2406.17438 null
2024-06-25 Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases Detection Peng Huang et.al. 2406.17338 null
2024-06-24 Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings Andrea Posada et.al. 2406.16611 link
2024-06-24 Improving robustness to corruptions with multiplicative weight perturbations Trung Trinh et.al. 2406.16540 null
2024-06-24 UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification Alvaro Lopez Pellicer et.al. 2406.16501 null
2024-06-24 Improving Quaternion Neural Networks with Quaternionic Activation Functions Johannes Pöppelbaum et.al. 2406.16481 null
2024-06-24 Learning in Wilson-Cowan model for metapopulation Raffaele Marino et.al. 2406.16453 link
2024-06-24 Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model Sai Ganesh et.al. 2406.16383 null
2024-06-24 Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels Zixia Jia et.al. 2406.16293 null
2024-06-23 Jacobian Descent for Multi-Objective Optimization Pierre Quinton et.al. 2406.16232 null
2024-06-23 Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction Yangdi Lu et.al. 2406.15982 null
2024-06-22 PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection Alvaro Lopez Pellcier et.al. 2406.15921 null
2024-06-21 Retrieval Augmented Zero-Shot Text Classification Tassallah Abdullahi et.al. 2406.15241 null
2024-06-21 DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation Yingying Fang et.al. 2406.15182 null
2024-06-21 This actually looks like that: Proto-BagNets for local and global interpretability-by-design Kerol Djoumessi et.al. 2406.15168 link
2024-06-21 Hierarchical thematic classification of major conference proceedings Arsentii Kuzmin et.al. 2406.14983 null
2024-06-21 Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks Minjong Cheon et.al. 2406.14916 link
2024-06-21 MU-Bench: A Multitask Multimodal Benchmark for Machine Unlearning Jiali Cheng et.al. 2406.14796 null
2024-06-20 Depth $F_1$ : Improving Evaluation of Cross-Domain Text Classification by Measuring Semantic Generalizability Parker Seegmiller et.al. 2406.14695 null
2024-06-20 Automatic Labels are as Effective as Manual Labels in Biomedical Images Classification with Deep Learning Niccolò Marini et.al. 2406.14351 null
2024-06-20 Self-supervised Interpretable Concept-based Models for Text Classification Francesco De Santis et.al. 2406.14335 null
2024-06-20 Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization Tanapat Ratchatorn et.al. 2406.14329 null
2024-06-20 Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach Mohamed Fadhlallah Guerri et.al. 2406.14120 null
2024-06-20 Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images Qinfeng Zhu et.al. 2406.14086 link
2024-06-21 CMTNet: Convolutional Meets Transformer Network for Hyperspectral Images Classification Faxu Guo et.al. 2406.14080 null
2024-06-20 Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods Tim Tsz-Kit Lau et.al. 2406.13936 null
2024-06-19 WATT: Weight Average Test-Time Adaption of CLIP David Osowiechi et.al. 2406.13875 link
2024-06-19 CNN Based Flank Predictor for Quadruped Animal Species Vanessa Suessle et.al. 2406.13588 null
2024-06-19 Online Domain-Incremental Learning Approach to Classify Acoustic Scenes in All Locations Manjunath Mulimani et.al. 2406.13386 null
2024-06-18 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837 link
2024-06-18 Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation Nikolas Koutsoubis et.al. 2406.12815 link
2024-06-18 Online Anchor-based Training for Image Classification Tasks Maria Tzelepi et.al. 2406.12662 null
2024-06-18 Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation Branislav Pecher et.al. 2406.12471 null
2024-06-18 GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory Haoze Wu et.al. 2406.12375 null
2024-06-18 What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering Federico Errica et.al. 2406.12334 null
2024-06-18 Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification Zehui Liao et.al. 2406.12293 null
2024-06-18 Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics Hyojin Kim et.al. 2406.12258 null
2024-06-19 MiSuRe is all you need to explain your image segmentation Syed Nouman Hasany et.al. 2406.12173 null
2024-06-17 Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation Hamidreza Rouzegar et.al. 2406.12114 link
2024-06-17 Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Lei Zhu et.al. 2406.11837 link
2024-06-17 PrAViC: Probabilistic Adaptation Framework for Real-Time Video Classification Magdalena Trędowicz et.al. 2406.11443 null
2024-06-17 Cross-domain Open-world Discovery Shuo Wen et.al. 2406.11422 link
2024-06-17 BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models Xuefeng Hu et.al. 2406.11309 null
2024-06-17 An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers Ashim Gupta et.al. 2406.11307 null
2024-06-17 Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification Letian Peng et.al. 2406.11115 null
2024-06-16 Fine-grained Classes and How to Find Them Matej Grcić et.al. 2406.11070 link
2024-06-16 Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality Liwei Che et.al. 2406.11048 null
2024-06-16 Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval Rohan Chavan et.al. 2406.11029 link
2024-06-16 Universal Cross-Lingual Text Classification Riya Savant et.al. 2406.11028 null
2024-06-14 UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner Dongchao Yang et.al. 2406.10056 null
2024-06-14 Comparison of fine-tuning strategies for transfer learning in medical image classification Ana Davila et.al. 2406.10050 null
2024-06-14 Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last Guy Hacohen et.al. 2406.09935 null
2024-06-13 MirrorCheck: Efficient Adversarial Defense for Vision-Language Models Samar Fares et.al. 2406.09250 null
2024-06-13 Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models Christopher Schröder et.al. 2406.09206 null
2024-06-13 Large-Scale Evaluation of Open-Set Image Classification Techniques Halil Bisgin et.al. 2406.09112 link
2024-06-13 LaCoOT: Layer Collapse through Optimal Transport Victor Quétu et.al. 2406.08933 null
2024-06-13 The Penalized Inverse Probability Measure for Conformal Classification Paul Melki et.al. 2406.08884 null
2024-06-13 Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency Maor Dikter et.al. 2406.08840 link
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773 null
2024-06-12 Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification Martin Juan José Bucher et.al. 2406.08660 null
2024-06-12 Intelligent Multi-View Test Time Augmentation Efe Ozturk et.al. 2406.08593 null
2024-06-12 Transformation-Dependent Adversarial Attacks Yaoteng Tan et.al. 2406.08443 null
2024-06-12 AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer Yitao Xu et.al. 2406.08298 null
2024-06-12 DistilDoc: Knowledge Distillation for Visually-Rich Document Applications Jordy Van Landeghem et.al. 2406.08226 null
2024-06-12 Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor Yongjie Si et.al. 2406.08122 null
2024-06-12 Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network Yanxiong Li et.al. 2406.08119 null
2024-06-12 A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder Lixian Zhang et.al. 2406.08079 null
2024-06-12 Adversarial Evasion Attack Efficiency against Large Language Models João Vitorino et.al. 2406.08050 null
2024-06-12 Accurate Explanation Model for Image Classifiers using Class Association Embedding Ruitao Xie et.al. 2406.07961 link
2024-06-12 Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection Jie Feng et.al. 2406.07949 null
2024-06-12 Small Scale Data-Free Knowledge Distillation He Liu et.al. 2406.07876 link
2024-06-11 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions Alireza Afzal Aghaei et.al. 2406.07456 link
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332 null
2024-06-11 Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment Takuto Igarashi et.al. 2406.07280 null
2024-06-11 EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity Labels Shuqi Zhu et.al. 2406.07151 link
2024-06-11 RS-Agent: Automating Remote Sensing Tasks through Intelligent Agents Wenjia Xu et.al. 2406.07089 null
2024-06-11 DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification Jiamu Sheng et.al. 2406.07050 null
2024-06-11 Fairness-Aware Meta-Learning via Nash Bargaining Yi Zeng et.al. 2406.07029 null
2024-06-11 Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models Zhenyi Lu et.al. 2406.07001 link
2024-06-11 Scaling up masked audio encoder learning for general audio classification Heinrich Dinkel et.al. 2406.06992 null
2024-06-10 Multi-Objective Neural Architecture Search for In-Memory Computing Md Hasibul Amin et.al. 2406.06746 null
2024-06-10 Robust Latent Representation Tuning for Image-text Classification Hao Sun et.al. 2406.06048 null
2024-06-09 Contrastive Learning from Synthetic Audio Doppelgangers Manuel Cherep et.al. 2406.05923 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification Yuxin Hong et.al. 2406.05677 null
2024-06-09 Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer Vision Pranav Jeevan et.al. 2406.05612 link
2024-06-08 Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification Yunhe Gao et.al. 2406.05596 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184 link
2024-06-07 A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification Christian Giannetti et.al. 2406.05096 null
2024-06-07 Classification Metrics for Image Explanations: Towards Building Reliable XAI-Evaluations Benjamin Fresz et.al. 2406.05068 link
2024-06-07 REP: Resource-Efficient Prompting for On-device Continual Learning Sungho Jeon et.al. 2406.04772 null
2024-06-07 AICoderEval: Improving AI Domain Code Generation of Large Language Models Yinghui Xia et.al. 2406.04712 null
2024-06-07 Cooperative Meta-Learning with Gradient Augmentation Jongyun Shin et.al. 2406.04639 link
2024-06-06 OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference Dujian Ding et.al. 2406.04508 null
2024-06-06 Can Language Models Use Forecasting Strategies? Sarah Pratt et.al. 2406.04446 null
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330 link
2024-06-07 BEADs: Bias Evaluation Across Domains Shaina Raza et.al. 2406.04220 null
2024-06-06 What Do Language Models Learn in Context? The Structured Task Hypothesis Jiaoda Li et.al. 2406.04216 null
2024-06-06 Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness Lars Hillebrand et.al. 2406.04156 link
2024-06-07 ReDistill: Residual Encoded Distillation for Peak Memory Reduction Fang Chen et.al. 2406.03744 null
2024-06-06 LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification Chun Liu et.al. 2406.03725 link
2024-06-05 Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review Sonia Bbouzidi et.al. 2406.03478 null
2024-06-05 IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models David Ifeoluwa Adelani et.al. 2406.03368 null
2024-06-05 Audio Mamba: Bidirectional State Space Model for Audio Representation Learning Mehmet Hamza Erol et.al. 2406.03344 link
2024-06-05 FusionBench: A Comprehensive Benchmark of Deep Model Fusion Anke Tang et.al. 2406.03280 null
2024-06-05 VWise: A novel benchmark for evaluating scene classification for vehicular applications Pedro Azevedo et.al. 2406.03273 null
2024-06-05 Tiny models from tiny data: Textual and null-text inversion for few-shot distillation Erik Landolsi et.al. 2406.03146 link
2024-06-05 Exploiting LMM-based knowledge for image classification tasks Maria Tzelepi et.al. 2406.03071 null
2024-06-04 Randomized Geometric Algebra Methods for Convex Neural Networks Yifei Wang et.al. 2406.02806 null
2024-06-04 DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark Chi-Jui Chang et.al. 2406.02468 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-04 Hybrid Quantum-Classical Neural Network for LAB Color Space Image Classification Kwokho Ng et.al. 2406.02229 null
2024-06-03 Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) Zane Durante et.al. 2406.01662 link
2024-06-03 CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations Franz Motzkus et.al. 2406.01649 null
2024-06-03 Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients Yuncong Zuo et.al. 2406.01439 null
2024-06-03 Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization Firas Khader et.al. 2406.01314 null
2024-06-03 Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE Jiaxu Liu et.al. 2406.01282 null
2024-06-04 MultiMax: Sparse and Multi-Modal Attention Learning Yuxuan Zhou et.al. 2406.01189 link
2024-06-03 Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling Wrick Talukdar et.al. 2406.01096 null
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022 null
2024-05-31 Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study Pallavi Mitra et.al. 2405.20876 null
2024-05-31 Improving Generalization and Convergence by Enhancing Implicit Regularization Mingze Wang et.al. 2405.20763 null
2024-05-31 Robust Stable Spiking Neural Networks Jianhao Ding et.al. 2405.20694 null
2024-05-31 Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space Yukai Zhang et.al. 2405.20685 null
2024-05-31 GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification Hansang Lee et.al. 2405.20650 null
2024-05-31 ToxVidLLM: A Multimodal LLM-based Framework for Toxicity Detection in Code-Mixed Videos Krishanu Maity et.al. 2405.20628 null
2024-05-30 Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation Louis L. Chen et.al. 2405.20531 null
2024-05-30 DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Haoxing Chen et.al. 2405.19707 link
2024-05-30 A Novel Approach for Automated Design Information Mining from Issue Logs Jiuang Zhao et.al. 2405.19623 null
2024-05-29 I Bet You Did Not Mean That: Testing Semantic Importance via Betting Jacopo Teneggi et.al. 2405.19146 link
2024-05-29 Verifiably Robust Conformal Prediction Linus Jeary et.al. 2405.18942 null
2024-05-29 Leveraging Many-To-Many Relationships for Defending Against Visual-Language Adversarial Attacks Futa Waseda et.al. 2405.18770 null
2024-05-29 GIST: Greedy Independent Set Thresholding for Diverse Data Summarization Matthew Fahrbach et.al. 2405.18754 null
2024-05-29 LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification Renyi Qu et.al. 2405.18672 null
2024-05-28 Its Not a Modality Gap: Characterizing and Addressing the Contrastive Gap Abrar Fahim et.al. 2405.18570 null
2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang et.al. 2405.18415 link
2024-05-28 MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution Wenzhuo Liu et.al. 2405.18240 null
2024-05-28 Confidence-aware multi-modality learning for eye disease screening Ke Zou et.al. 2405.18167 link
2024-05-28 4-bit Shampoo for Memory-Efficient Network Training Sike Wang et.al. 2405.18144 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 null
2024-05-27 WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average Louis Fournier et.al. 2405.17517 null
2024-05-27 Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators Yunian Pan et.al. 2405.17370 null
2024-05-27 On the Noise Robustness of In-Context Learning for Text Generation Hongfu Gao et.al. 2405.17264 null
2024-05-27 Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image Classification Shujun Yang et.al. 2405.17110 link
2024-05-26 Demystify Mamba in Vision: A Linear Attention Perspective Dongchen Han et.al. 2405.16605 null
2024-05-26 AdaFisher: Adaptive Second Order Optimization via Fisher Information Damien Martins Gomes et.al. 2405.16397 null
2024-05-25 ModelLock: Locking Your Model With a Spell Yifeng Gao et.al. 2405.16285 null
2024-05-25 Accelerating Transformers with Spectrum-Preserving Token Merging Hoai-Chau Tran et.al. 2405.16148 null
2024-05-25 Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack Mingli Zhu et.al. 2405.16134 null
2024-05-24 Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images Yiran Luo et.al. 2405.15961 null
2024-05-24 A Neurosymbolic Framework for Bias Correction in CNNs Parth Padalkar et.al. 2405.15886 null
2024-05-24 What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models Abdelrahman Abdelhamed et.al. 2405.15668 null
2024-05-24 Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning Wenhan Chang et.al. 2405.15662 null
2024-05-24 Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables James Hinns et.al. 2405.15661 null
2024-05-24 Harnessing Increased Client Participation with Cohort-Parallel Federated Learning Akash Dhasade et.al. 2405.15644 null
2024-05-24 Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification Barış Büyüktaş et.al. 2405.15405 null
2024-05-24 CLIP model is an Efficient Online Lifelong Learner Leyuan Wang et.al. 2405.15155 null
2024-05-24 OptLLM: Optimal Assignment of Queries to Large Language Models Yueyue Liu et.al. 2405.15130 null
2024-05-23 A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-time Adaptation for Vision-Language Models Mario Döbler et.al. 2405.14977 link
2024-05-23 Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron Can Cui1 et.al. 2405.14851 null
2024-05-23 Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property Yuya Yoshikawa et.al. 2405.14522 null
2024-05-23 SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification Zuoyong Li et.al. 2405.14506 null
2024-05-23 Scalable Visual State Space Model with Fractal Scanning Lv Tang et.al. 2405.14480 null
2024-05-23 Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation Daniel Kienzle et.al. 2405.14467 null
2024-05-23 Boosting Robustness by Clipping Gradients in Distributed Learning Youssef Allouah et.al. 2405.14432 null
2024-05-23 Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators Changze Lv et.al. 2405.14362 null
2024-05-23 Simple Hamiltonian dynamics is a powerful quantum processing resource Akitada Sakurai et.al. 2405.14245 null
2024-05-23 ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks T. Y. S. S Santosh et.al. 2405.14211 null
2024-05-22 Just rotate it! Uncertainty estimation in closed-source models via multiple queries Konstantinos Pitas et.al. 2405.13864 null
2024-05-21 Decentralized Federated Learning Over Imperfect Communication Channels Weicai Li et.al. 2405.12894 null
2024-05-21 Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting Omar Hamed et.al. 2405.12705 null
2024-05-21 Exploration of Masked and Causal Language Modelling for Text Generation Nicolo Micheletti et.al. 2405.12630 null
2024-05-21 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification Yan He et.al. 2405.12487 null
2024-05-20 Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models Nida Nasir et.al. 2405.12126 null
2024-05-20 Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification Weilian Zhou et.al. 2405.12003 link
2024-05-20 A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers Tom Roth et.al. 2405.11904 null
2024-05-21 A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus Eduard Poesina et.al. 2405.11877 link
2024-05-20 SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model Siavash Shams et.al. 2405.11831 link
2024-05-20 Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques Siva Rajesh Kasa et.al. 2405.11775 null
2024-05-19 SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Jialong Guo et.al. 2405.11582 link
2024-05-19 Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification Manan Shah et.al. 2405.11574 link
2024-05-19 An Invisible Backdoor Attack Based On Semantic Feature Yangming Chen et.al. 2405.11551 null
2024-05-19 Verification technology for finger vein biometric George Kumi Kyeremeh et.al. 2405.11540 null
2024-05-17 Reduced storage direct tensor ring decomposition for convolutional neural networks compression Mateusz Gabor et.al. 2405.10802 link
2024-05-17 Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Jie Zhu et.al. 2405.10542 link
2024-05-17 Smart Expert System: Large Language Models as Text Classifiers Zhiqiang Wang et.al. 2405.10523 link
2024-05-16 Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge Florian Schmid et.al. 2405.10018 null
2024-05-16 ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset Johannes Rückert et.al. 2405.10004 link
2024-05-15 Improving Label Error Detection and Elimination with Uncertainty Quantification Johannes Jakubik et.al. 2405.09602 null
2024-05-15 Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck Hongru Li et.al. 2405.09514 null
2024-05-15 Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy Feng Wang et.al. 2405.09014 link
2024-05-14 The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks Ziquan Liu et.al. 2405.08886 link
2024-05-14 Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling Gregory Holste et.al. 2405.08780 null
2024-05-14 FolkTalent: Enhancing Classification and Tagging of Indian Folk Paintings Nancy Hada et.al. 2405.08776 null
2024-05-14 The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks Carmela Calabrese et.al. 2405.08695 null
2024-05-14 Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis Qingpeng Kong et.al. 2405.08681 link
2024-05-14 Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning Alain Riou et.al. 2405.08679 null
2024-05-14 Dual-Branch Network for Portrait Image Quality Assessment Wei Sun et.al. 2405.08555 null
2024-05-13 Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp Rachel Hong et.al. 2405.08209 link
2024-05-14 MambaOut: Do We Really Need Mamba for Vision? Weihao Yu et.al. 2405.07992 link
2024-05-13 Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics Haoyang Zheng et.al. 2405.07839 link
2024-05-13 Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent Michael Kohler et.al. 2405.07619 null
2024-05-13 On-device Online Learning and Semantic Management of TinyML Systems Haoyu Ren et.al. 2405.07601 null
2024-05-13 GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation Andrey V. Galichin et.al. 2405.07562 null
2024-05-13 Fine-tuning the SwissBERT Encoder Model for Embedding Sentences and Documents Juri Grosjean et.al. 2405.07513 null
2024-05-13 MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks Haijiang Tian et.al. 2405.07411 null
2024-05-12 Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images Fatema Tuj Johora Faria et.al. 2405.07338 null
2024-05-12 Differentiable Model Scaling using Differentiable Topk Kai Liu et.al. 2405.07194 null
2024-05-11 A framework of text-dependent speaker verification for chinese numerical string corpus Litong Zheng et.al. 2405.07029 null
2024-05-10 Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification Yaoqin Ye et.al. 2405.06468 null
2024-05-10 Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data Rongyu Zhang et.al. 2405.06413 null
2024-05-10 SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora Faisal Qarah et.al. 2405.06239 null
2024-05-09 Deep Multi-Task Learning for Malware Image Classification Ahmed Bensaoud et.al. 2405.05906 null
2024-05-09 Enhancing Suicide Risk Detection on Social Media through Semi-Supervised Deep Label Smoothing Matthew Squires et.al. 2405.05795 null
2024-05-09 CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks Nick et.al. 2405.05755 null
2024-05-09 How Quality Affects Deep Neural Networks in Fine-Grained Image Classification Joseph Smith et.al. 2405.05742 null
2024-05-09 End-to-End Generative Semantic Communication Powered by Shared Semantic Knowledge Base Shuling Li et.al. 2405.05738 null
2024-05-09 Using Machine Translation to Augment Multilingual Classification Adam King et.al. 2405.05478 null
2024-05-08 AFEN: Respiratory Disease Classification using Ensemble Learning Rahul Nadkarni et.al. 2405.05467 null
2024-05-08 XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples Peiqin Lin et.al. 2405.05116 link
2024-05-08 Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution Shuo Shao et.al. 2405.04825 null
2024-05-07 Exploring Explainable AI Techniques for Improved Interpretability in Lung and Colon Cancer Classification Mukaffi Bin Moin et.al. 2405.04610 link
2024-05-07 Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs Antonio Bikić et.al. 2405.04386 null
2024-05-07 Semi-Supervised Disease Classification based on Limited Medical Image Data Yan Zhang et.al. 2405.04295 null
2024-05-07 DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects Da Fu et.al. 2405.04093 null
2024-05-07 Feature Map Convergence Evaluation for Functional Module Ludan Zhang et.al. 2405.04041 null
2024-05-07 VMambaCC: A Visual State Space Model for Crowd Counting Hao-Yuan Ma et.al. 2405.03978 null
2024-05-06 On Adversarial Examples for Text Classification by Perturbing Latent Representations Korn Sooksatra et.al. 2405.03789 null
2024-05-06 CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification Sankalp Sinha et.al. 2405.03660 null
2024-05-06 Deep Space Separable Distillation for Lightweight Acoustic Scene Classification ShuQi Ye et.al. 2405.03567 null
2024-05-06 Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing Han Liu et.al. 2405.03565 null
2024-05-06 A Lightweight Neural Architecture Search Model for Medical Image Classification Lunchen Xie et.al. 2405.03462 null
2024-05-06 Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification Matteo Bianchi et.al. 2405.03301 null
2024-05-06 TED: Accelerate Model Training by Internal Generalization Jinying Xiao et.al. 2405.03228 null
2024-05-06 Advancing Multimodal Medical Capabilities of Gemini Lin Yang et.al. 2405.03162 null
2024-05-05 A scoping review of using Large Language Models (LLMs) to investigate Electronic Health Records (EHRs) Lingyao Li et.al. 2405.03066 null
2024-05-05 Parameter-Efficient Fine-Tuning with Discrete Fourier Transform Ziqi Gao et.al. 2405.03003 null
2024-05-04 MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning Vishal Nedungadi et.al. 2405.02771 null
2024-05-03 Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification Siqi Yin et.al. 2405.02155 null
2024-05-03 The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification Minh Duc Bui et.al. 2405.02010 null
2024-05-03 Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts Felicia Riethmüller et.al. 2405.01904 null
2024-05-02 PVF (Parameter Vulnerability Factor): A Quantitative Metric Measuring AI Vulnerability and Resilience Against Parameter Corruptions Xun Jiao et.al. 2405.01741 null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 link
2024-05-02 SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients Tushar Verma et.al. 2405.01699 null
2024-05-02 Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey Rokas Gipiškis et.al. 2405.01636 null
2024-05-02 Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models Nishad Singhi et.al. 2405.01531 null
2024-05-03 Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks Mikkel Jordahn et.al. 2405.01196 null
2024-05-02 Uncertainty-aware self-training with expectation maximization basis transformation Zijia Wang et.al. 2405.01175 null
2024-05-02 Transformers Fusion across Disjoint Samples for Hyperspectral Image Classification Muhammad Ahmad et.al. 2405.01095 null
2024-05-02 Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation Tianyi Chen et.al. 2405.01041 null
2024-05-02 Benchmarking Representations for Speech, Music, and Acoustic Events Moreno La Quatra et.al. 2405.00934 link
2024-05-01 Digital-analog quantum convolutional neural networks for image classification Anton Simen et.al. 2405.00548 null
2024-05-03 BiomedRAG: A Retrieval Augmented Large Language Model for Biomedicine Mingchen Li et.al. 2405.00465 null
2024-05-01 Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol Konstantinos Apostolidis et.al. 2405.00384 null
2024-05-01 Data Augmentation Policy Search for Long-Term Forecasting Liran Nochumsohn et.al. 2405.00319 null
2024-04-30 Let's Focus: Focused Backdoor Attack against Federated Transfer Learning Marco Arazzi et.al. 2404.19420 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 Enhancing Intrinsic Features for Debiasing via Investigating Class-Discerning Common Attributes in Bias-Contrastive Pair Jeonghoon Park et.al. 2404.19250 null
2024-04-29 Spectral-Spatial Mamba for Hyperspectral Image Classification Lingbo Huang et.al. 2404.18401 null
2024-04-28 TextGram: Towards a better domain-adaptive pretraining Sharayu Hiwarkhedkar et.al. 2404.18228 null
2024-04-28 L3Cube-MahaNews: News-based Short Text and Long Document Classification Datasets in Marathi Saloni Mittal et.al. 2404.18216 link
2024-04-28 S $^2$ Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification Guanchun Wang et.al. 2404.18213 null
2024-04-27 Implicit Generative Prior for Bayesian Neural Networks Yijia Liu et.al. 2404.18008 link
2024-04-27 Towards Privacy-Preserving Audio Classification Systems Bhawana Chhaglani et.al. 2404.18002 null
2024-04-27 A Method of Moments Embedding Constraint and its Application to Semi-Supervised Learning Michael Majurski et.al. 2404.17978 null
2024-04-27 Spatial, Temporal, and Geometric Fusion for Remote Sensing Images Hessah Albanwan et.al. 2404.17851 null
2024-04-27 Leveraging Cross-Modal Neighbor Representation for Improved CLIP Classification Chao Yi et.al. 2404.17753 link
2024-04-26 SPLICE -- Streamlining Digital Pathology Image Processing Areej Alsaafin et.al. 2404.17704 null
2024-04-26 SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes Georgia Baltsou et.al. 2404.17255 null
2024-04-25 Incorporating Lexical and Syntactic Knowledge for Unsupervised Cross-Lingual Transfer Jianyu Zheng et.al. 2404.16627 link
2024-04-25 IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks Zitong Huang et.al. 2404.16331 null
2024-04-25 Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis Akshatha Mohan et.al. 2404.16268 link
2024-04-24 MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification Models Grace Guo et.al. 2404.16174 null
2024-04-24 MoDE: CLIP Data Experts via Clustering Jiawei Ma et.al. 2404.16030 link
2024-04-26 A Survey on Visual Mamba Hanwei Zhang et.al. 2404.15956 null
2024-04-24 Vision Transformer-based Adversarial Domain Adaptation Yahan Li et.al. 2404.15817 link
2024-04-24 Rethinking Model Prototyping through the MedMNIST+ Dataset Collection Sebastian Doerrich et.al. 2404.15786 null
2024-04-24 Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning Zuheng Kang et.al. 2404.15704 null
2024-04-24 Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification Liang Qu et.al. 2404.15585 null
2024-04-23 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan et.al. 2404.15518 null
2024-04-23 Deep multi-prototype capsule networks Saeid Abbassi et.al. 2404.15445 null
2024-04-23 A review of deep learning-based information fusion techniques for multimodal medical image classification Yihao Li et.al. 2404.15022 null
2024-04-23 Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case Muhammad Asif Auyb et.al. 2404.14977 null
2024-04-23 Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14955 link
2024-04-23 Pyramid Hierarchical Transformer for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14945 link
2024-04-23 Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification Muhammad Ahmad et.al. 2404.14944 link
2024-04-23 CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models Teodor Chiaburu et.al. 2404.14830 link
2024-04-22 WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models Ronald Xie et.al. 2404.14567 null
2024-04-22 CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective Wencheng Zhu et.al. 2404.14109 null
2024-04-21 EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder Hasanul Mahmud et.al. 2404.13770 null
2024-04-21 PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure Feiqi Cao et.al. 2404.13645 link
2024-04-21 I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning Songlin Dong et.al. 2404.13576 null
2024-04-21 IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models Tao Feng et.al. 2404.13504 null
2024-04-20 Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing Yuang Liu et.al. 2404.13434 null
2024-04-20 Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge Khuyagbaatar Batsuren et.al. 2404.13292 link
2024-04-20 3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification Shyam Varahagiri et.al. 2404.13252 link
2024-04-19 On-board classification of underwater images using hybrid classical-quantum CNN based method Sreeraj Rajan Warrier et.al. 2404.13130 null
2024-04-19 Next Generation Loss Function for Image Classification Shakhnaz Akhmedova et.al. 2404.12948 null
2024-04-19 A Hybrid Generative and Discriminative PointNet on Unordered Point Sets Yang Ye et.al. 2404.12925 null
2024-04-19 Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment Danqing Ma et.al. 2404.12634 null
2024-04-18 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Asaf Yehudai et.al. 2404.12365 null
2024-04-18 Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training Jin Gao et.al. 2404.12210 link
2024-04-18 Concept Induction using LLMs: a user experiment for assessment Adrita Barua et.al. 2404.11875 null
2024-04-17 Pretraining Billion-scale Geospatial Foundational Models on Frontier Aristeidis Tsaris et.al. 2404.11706 null
2024-04-17 AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts Meng Jiang et.al. 2404.11449 null
2024-04-17 Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured Hanlin Mo et.al. 2404.11309 null
2024-04-17 A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene Wenbo Zhang et.al. 2404.11249 null
2024-04-17 A Novel ICD Coding Framework Based on Associated and Hierarchical Code Description Distillation Bin Zhang et.al. 2404.11132 null
2024-04-17 Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification Pierre Lepagnol et.al. 2404.11122 null
2024-04-18 Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification Mohammad Shiri et.al. 2404.11052 null
2024-04-17 InfoMatch: Entropy Neural Estimation for Semi-Supervised Image Classification Qi Han et.al. 2404.11003 link
2024-04-16 Incubating Text Classifiers Following User Instruction with Nothing but LLM Letian Peng et.al. 2404.10877 null
2024-04-16 Vocabulary-free Image Classification and Semantic Segmentation Alessandro Conti et.al. 2404.10864 link
2024-04-16 Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification Tasks Mohsen Hami et.al. 2404.10664 null
2024-04-16 Tree Bandits for Generative Bayes Sean O'Hagan et.al. 2404.10436 null
2024-04-16 AudioProtoPNet: An interpretable deep learning model for bird sound classification René Heinrich et.al. 2404.10420 null
2024-04-16 Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport Eduardo Fernandes Montesuma et.al. 2404.10261 null
2024-04-15 Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection Lisang Zhou et.al. 2404.10026 null
2024-04-15 Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models Hyeonggeun Yun et.al. 2404.09828 null
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737 null
2024-04-15 Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model Masahito Toba et.al. 2404.09585 null
2024-04-14 Breast Cancer Image Classification Method Based on Deep Transfer Learning Weimin Wang et.al. 2404.09226 null
2024-04-14 Coreset Selection for Object Detection Hojun Lee et.al. 2404.09161 null
2024-04-13 Exploring Explainability in Video Action Recognition Avinab Saha et.al. 2404.09067 null
2024-04-13 Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification Denis Huseljic et.al. 2404.08981 link
2024-04-13 PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification Zhenwei Wang et.al. 2404.08915 null
2024-04-12 VertAttack: Taking advantage of Text Classifiers' horizontal vision Jonathan Rusert et.al. 2404.08538 null
2024-04-12 SpectralMamba: Efficient Mamba for Hyperspectral Image Classification Jing Yao et.al. 2404.08489 null
2024-04-12 OTTER: Improving Zero-Shot Classification via Optimal Transport Changho Shin et.al. 2404.08461 null
2024-04-12 A Survey of Neural Network Robustness Assessment in Image Recognition Jie Wang et.al. 2404.08285 null
2024-04-12 Convolutional neural network classification of cancer cytopathology images: taking breast cancer as an example MingXuan Xiao et.al. 2404.08279 null
2024-04-11 HGRN2: Gated Linear RNNs with State Expansion Zhen Qin et.al. 2404.07904 link
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 null
2024-04-11 Contrastive-Based Deep Embeddings for Label Noise-Resilient Histopathology Image Classification Lucas Dedieu et.al. 2404.07605 link
2024-04-11 Learning to Classify New Foods Incrementally Via Compressed Exemplars Justin Yang et.al. 2404.07507 null
2024-04-11 Interactive Prompt Debugging with Sequence Salience Ian Tenney et.al. 2404.07498 null
2024-04-11 Privacy preserving layer partitioning for Deep Neural Network models Kishore Rajasekar et.al. 2404.07437 null
2024-04-11 CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models Sheng Wang et.al. 2404.07424 null
2024-04-11 Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling Sourajit Saha et.al. 2404.07410 null
2024-04-10 Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations Ofir Shifman et.al. 2404.07153 null
2024-04-10 Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization Michael Kohler et.al. 2404.07128 null
2024-04-10 Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach Anam Hashmi et.al. 2404.06941 null
2024-04-10 Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark Marina Ceccon et.al. 2404.06859 null
2024-04-10 Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution Brandon Morgan et.al. 2404.06679 null
2024-04-09 Variational Stochastic Gradient Descent for Deep Neural Networks Haotian Chen et.al. 2404.06549 link
2024-04-09 On adversarial training and the 1 Nearest Neighbor classifier Amir Hagai et.al. 2404.06313 link
2024-04-09 Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models David Kurzendörfer et.al. 2404.06309 link
2024-04-09 Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training Ming-Kun Xie et.al. 2404.06287 null
2024-04-09 Quantum Circuit $C^*$ -algebra Net Yuka Hashimoto et.al. 2404.06218 null
2024-04-09 VI-OOD: A Unified Representation Learning Framework for Textual Out-of-distribution Detection Li-Ming Zhan et.al. 2404.06217 link
2024-04-09 Symmetry-guided gradient descent for quantum neural networks Kaiming Bian et.al. 2404.06108 null
2024-04-10 Using Few-Shot Learning to Classify Primary Lung Cancer and Other Malignancy with Lung Metastasis in Cytological Imaging via Endobronchial Ultrasound Procedures Ching-Kai Lin et.al. 2404.06080 null
2024-04-08 Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images Michael Deutges et.al. 2404.05584 null
2024-04-08 On the Convergence of Continual Learning with Adaptive Methods Seungyub Han et.al. 2404.05555 null
2024-04-08 Multi-Task Learning for Features Extraction in Financial Annual Reports Syrielle Montariol et.al. 2404.05281 link
2024-04-08 Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy Giang Nguyen et.al. 2404.05238 null
2024-04-08 iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection Nan Zhou et.al. 2404.05207 null
2024-04-08 Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods Roopkatha Dey et.al. 2404.05159 null
2024-04-07 PairAug: What Can Augmented Image-Text Pairs Do for Radiology? Yutong Xie et.al. 2404.04960 link
2024-04-07 GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets Dongjing Shan et.al. 2404.04924 null
2024-04-06 Focused Active Learning for Histopathological Image Classification Arne Schmidt et.al. 2404.04663 null
2024-04-06 Trustless Audits without Revealing Data or Models Suppakit Waiwitlikhit et.al. 2404.04500 null
2024-04-05 Evaluating Adversarial Robustness: A Comparison Of FGSM, Carlini-Wagner Attacks, And The Role of Distillation as Defense Mechanism Trilokesh Ranjan Sarkar et.al. 2404.04245 null
2024-04-05 Noisy Label Processing for Classification: A Survey Mengting Li et.al. 2404.04159 null
2024-04-05 Learning Correlation Structures for Vision Transformers Manjin Kim et.al. 2404.03924 null
2024-04-05 LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification Judy X Yang et.al. 2404.03883 null
2024-04-04 Dendrites endow artificial neural networks with accurate, robust and parameter-efficient learning Spyridon Chavlis et.al. 2404.03708 null
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493 null
2024-04-04 Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks Lei Zhang et.al. 2404.03340 null
2024-04-04 Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Andrei Semenov et.al. 2404.03323 link
2024-04-04 FACTUAL: A Novel Framework for Contrastive Learning Based Robust SAR Image Classification Xu Wang et.al. 2404.03225 null
2024-04-03 Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas E. Resck et.al. 2404.03098 link
2024-04-03 Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds Kamalika Chaudhuri et.al. 2404.02866 link
2024-04-03 FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang et.al. 2404.02772 link
2024-04-03 Adversarial Attacks and Dimensionality in Text Classifiers Nandish Chattopadhyay et.al. 2404.02660 null
2024-04-04 Non-negative Subspace Feature Representation for Few-shot Learning in Medical Imaging Keqiang Fan et.al. 2404.02656 null
2024-04-03 Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations Emilio Villa-Cueva et.al. 2404.02452 link
2024-04-03 A Novel Approach to Breast Cancer Histopathological Image Classification Using Cross-Colour Space Feature Fusion and Quantum-Classical Stack Ensemble Method Sambit Mallick et.al. 2404.02447 null
2024-04-03 Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data Parth Patwa et.al. 2404.02422 null
2024-04-02 Smooth Deep Saliency Rudolf Herdt et.al. 2404.02282 null
2024-04-02 Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models Matthew Kowal et.al. 2404.02233 null
2024-04-02 ImageNot: A contrast with ImageNet preserves model rankings Olawale Salaudeen et.al. 2404.02112 null
2024-04-02 Explainability in JupyterLab and Beyond: Interactive XAI Systems for Integrated and Collaborative Workflows Grace Guo et.al. 2404.02081 null
2024-04-02 Ukrainian Texts Classification: Exploration of Cross-lingual Knowledge Transfer Approaches Daryna Dementieva et.al. 2404.02043 null
2024-04-02 CAM-Based Methods Can See through Walls Magamed Taimeskhanov et.al. 2404.01964 link
2024-04-02 Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss Jaeha Kim et.al. 2404.01692 null
2024-04-02 A Universal Knowledge Embedded Contrastive Learning Framework for Hyperspectral Image Classification Quanwei Liu et.al. 2404.01673 null
2024-04-01 Can Biases in ImageNet Models Explain Generalization? Paul Gavrikov et.al. 2404.01509 link
2024-04-01 Parallel Proportional Fusion of Spiking Quantum Neural Network for Optimizing Image Classification Zuyu Xu et.al. 2404.01359 null
2024-04-01 Bridging Remote Sensors with Multisensor Geospatial Foundation Models Boran Han et.al. 2404.01260 link
2024-04-01 Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models Amir Faghihi et.al. 2404.01160 null
2024-03-29 Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Jaisidh Singh et.al. 2403.20312 link
2024-03-29 MCNet: A crowd denstity estimation network based on integrating multiscale attention module Qiang Guo et.al. 2403.20173 null
2024-03-29 Segmentation, Classification and Interpretation of Breast Cancer Medical Images using Human-in-the-Loop Machine Learning David Vázquez-Lema et.al. 2403.20112 null
2024-03-29 Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion Juhwan Choi et.al. 2403.20015 null
2024-03-29 Diverse Feature Learning by Self-distillation and Reset Sejik Park et.al. 2403.19941 null
2024-03-29 Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification Jianfeng Cai et.al. 2403.19902 link
2024-03-28 X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization Anna Kukleva et.al. 2403.19811 link
2024-03-28 RSMamba: Remote Sensing Image Classification with State Space Model Keyan Chen et.al. 2403.19654 link
2024-03-28 Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model Zhicai Wang et.al. 2403.19600 link
2024-03-28 The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation Ozgu Goksu et.al. 2403.19579 null
2024-03-28 Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach Wei Dong et.al. 2403.19067 link
2024-03-27 Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data Yuting Guo et.al. 2403.19031 null
2024-03-27 Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning Soumyendu Sarkar et.al. 2403.18985 null
2024-03-27 The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision Andreas Müller et.al. 2403.18587 link
2024-03-27 Uncertainty-Aware SAR ATR: Defending Against Adversarial Attacks via Bayesian Neural Networks Tian Ye et.al. 2403.18318 null
2024-03-27 Multi-scale Unified Network for Image Classification Wenzhuo Liu et.al. 2403.18294 null
2024-03-26 The Need for Speed: Pruning Transformers with One Recipe Samir Khaki et.al. 2403.17921 link
2024-03-26 Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation Carlos Gomes et.al. 2403.17886 null
2024-03-26 PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang et.al. 2403.17695 link
2024-03-26 Language Models for Text Classification: Is In-Context Learning Enough? Aleksandra Edwards et.al. 2403.17661 null
2024-03-26 Boosting Few-Shot Learning with Disentangled Self-Supervised Learning and Meta-Learning for Medical Image Classification Eva Pachetti et.al. 2403.17530 null
2024-03-26 HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification He Zhu et.al. 2403.17307 link
2024-03-25 Histogram Layers for Neural Engineered Features Joshua Peeples et.al. 2403.17176 link
2024-03-25 Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships Rangel Daroya et.al. 2403.17173 link
2024-03-25 CipherFormer: Efficient Transformer Private Inference with Low Round Complexity Weize Wang et.al. 2403.16860 null
2024-03-25 Assessing the Performance of Deep Learning for Automated Gleason Grading in Prostate Cancer Dominik Müller et.al. 2403.16695 null
2024-03-25 DeepGleason: a System for Automated Gleason Grading of Prostate Cancer using Deep Neural Networks Dominik Müller et.al. 2403.16678 link
2024-03-25 LARA: Linguistic-Adaptive Retrieval-Augmented LLMs for Multi-Turn Intent Classification Liu Junhua et.al. 2403.16504 null
2024-03-24 On machine learning analysis of atomic force microscopy images for image classification, sample surface recognition Igor Sokolov et.al. 2403.16230 null
2024-03-24 Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis Shaojie Li et.al. 2403.16212 null
2024-03-24 Multi-Task Learning with Multi-Task Optimization Lu Bai et.al. 2403.16162 null
2024-03-24 CBGT-Net: A Neuromimetic Architecture for Robust Classification of Streaming Data Shreya Sharma et.al. 2403.15974 link
2024-03-23 A Deep Learning Architectures for Kidney Disease Classification Muhammad Shoaib Farooq et.al. 2403.15895 null
2024-03-23 VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding Phong Nguyen-Thuan Do et.al. 2403.15882 null
2024-03-23 VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification Lanfeng Zhong et.al. 2403.15836 null
2024-03-22 Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion Sofia Casarin et.al. 2403.15194 null
2024-03-22 Image Classification with Rotation-Invariant Variational Quantum Circuits Paul San Sebastian et.al. 2403.15031 null
2024-03-22 Extracting Human Attention through Crowdsourced Patch Labeling Minsuk Chang et.al. 2403.15013 null
2024-03-22 Clean-image Backdoor Attacks Dazhong Rong et.al. 2403.15010 null
2024-03-22 ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding Novendra Setyawan et.al. 2403.15004 null
2024-03-22 MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection Sadiya Sayara Chowdhury Puspo et.al. 2403.14989 null
2024-03-21 Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention Ethan N. Evans et.al. 2403.14753 null
2024-03-21 Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images Tom Burgert et.al. 2403.14547 null
2024-03-21 Multi-Level Explanations for Generative Language Models Lucas Monteiro Paes et.al. 2403.14459 null
2024-03-21 Tensor network compressibility of convolutional models Sukhbinder Singh et.al. 2403.14379 null
2024-03-21 LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding Masato Fujitake et.al. 2403.14252 null
2024-03-21 Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations Xun Lin et.al. 2403.14250 null
2024-03-21 Improving Image Classification Accuracy through Complementary Intra-Class and Inter-Class Mixup Ye Xu et.al. 2403.14137 link
2024-03-20 Bridge the Modality and Capacity Gaps in Vision-Language Model Selection Chao Yi et.al. 2403.13797 null
2024-03-20 Leveraging feature communication in federated learning for remote sensing image classification Anh-Kiet Duong et.al. 2403.13575 null
2024-03-20 MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Di Wang et.al. 2403.13430 link
2024-03-20 Building Optimal Neural Architectures using Interpretable Knowledge Keith G. Mills et.al. 2403.13293 link
2024-03-19 LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images Jing Zhang et.al. 2403.13171 null
2024-03-19 Improved EATFormer: A Vision Transformer for Medical Image Classification Yulong Shisu et.al. 2403.13167 null
2024-03-19 SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification Yuexi Du et.al. 2403.13148 link
2024-03-19 Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs Raphael Norman-Tenazas et.al. 2403.13105 null
2024-03-19 Investigating Text Shortening Strategy in BERT: Truncation vs Summarization Mirza Alim Mutasodirin et.al. 2403.12799 link
2024-03-18 Posterior Uncertainty Quantification in Neural Networks using Data Augmentation Luhuan Wu et.al. 2403.12729 null
2024-03-19 SEVEN: Pruning Transformer Model by Reserving Sentinels Jinying Xiao et.al. 2403.12688 link
2024-03-19 Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service Mirza Alim Mutasodirin et.al. 2403.12563 null
2024-03-19 Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification Yi Lin et.al. 2403.12537 null
2024-03-19 CrossTune: Black-Box Few-Shot Classification with Label Enhancement Danqing Luo et.al. 2403.12468 null
2024-03-18 Generalizing deep learning models for medical image classification Matta Sarah et.al. 2403.12167 null
2024-03-19 Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks K. P. Santoso et.al. 2403.12009 null
2024-03-18 High-energy physics image classification: A Survey of Jet Applications Hamza Kheddar et.al. 2403.11934 null
2024-03-18 Better (pseudo-)labels for semi-supervised instance segmentation François Porcher et.al. 2403.11675 null
2024-03-18 Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2403.11530 link
2024-03-18 Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting Mingkui Tan et.al. 2403.11491 null
2024-03-17 Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability Haiyang Shi et.al. 2403.11331 null
2024-03-17 A Modified Word Saliency-Based Adversarial Attack on Text Classification Models Hetvi Waghela et.al. 2403.11297 null
2024-03-17 Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation Silvia Corbara et.al. 2403.11265 null
2024-03-17 Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image Classification Shahabedin Nabavi et.al. 2403.11226 null
2024-03-16 Forward Learning of Graph Neural Networks Namyong Park et.al. 2403.11004 null
2024-03-16 Understanding Robustness of Visual State Space Models for Image Classification Chengbin Du et.al. 2403.10935 null
2024-03-16 Automatic location detection based on deep learning Anjali Karangiya et.al. 2403.10912 null
2024-03-14 Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models Akhil Kedia et.al. 2403.09635 link
2024-03-14 XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization Yequan Bie et.al. 2403.09410 null
2024-03-14 ConDiSR: Contrastive Disentanglement and Style Regularization for Single Domain Generalization Aleksandr Matsun et.al. 2403.09400 null
2024-03-14 A Hierarchical Fused Quantum Fuzzy Neural Network for Image Classification Sheng-Yao Wu et.al. 2403.09318 null
2024-03-14 CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification Yiming Ma et.al. 2403.09281 null
2024-03-14 Are Vision Language Models Texture or Shape Biased and Can We Steer Them? Paul Gavrikov et.al. 2403.09193 null
2024-03-14 Randomized Principal Component Analysis for Hyperspectral Image Classification Mustafa Ustuner et.al. 2403.09117 null
2024-03-14 CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram Classification Hyunkyung Han et.al. 2403.09108 link
2024-03-14 The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? Qinyu Zhao et.al. 2403.09037 link
2024-03-13 PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning Qifeng Zhou et.al. 2403.08967 null
2024-03-13 DAM: Dynamic Adapter Merging for Continual Video QA Learning Feng Cheng et.al. 2403.08755 link
2024-03-13 Leveraging Compressed Frame Sizes For Ultra-Fast Video Classification Yuxing Han et.al. 2403.08580 null
2024-03-13 HOLMES: HOLonym-MEronym based Semantic inspection for Convolutional Image Classifiers Francesco Dibitonto et.al. 2403.08536 link
2024-03-13 Pig aggression classification using CNN, Transformers and Recurrent Networks Junior Silva Souza et.al. 2403.08528 null
2024-03-13 Reduced Jeffries-Matusita distance: A Novel Loss Function to Improve Generalization Performance of Deep Classification Models Mohammad Lashkari et.al. 2403.08408 null
2024-03-13 Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification Shuhan Li et.al. 2403.08407 null
2024-03-13 Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks Khondoker Murad Hossain et.al. 2403.08208 null
2024-03-13 Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks Fuzhi Wu et.al. 2403.08157 link
2024-03-12 Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection Tharindu Kumarage et.al. 2403.08035 null
2024-03-13 Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion Dongyang Li et.al. 2403.07721 link
2024-03-12 FPT: Fine-grained Prompt Tuning for Parameter and Memory Efficient Fine Tuning in High-resolution Medical Image Classification Yijin Huang et.al. 2403.07576 null
2024-03-12 Backdoor Attack with Mode Mixture Latent Modification Hongwei Zhang et.al. 2403.07463 null
2024-03-12 In-context learning enables multimodal large language models to classify cancer pathology images Dyke Ferber et.al. 2403.07407 null
2024-03-12 Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning Mark D. McDonnell et.al. 2403.07356 null
2024-03-12 How does promoting the minority fraction affect generalization? A theoretical study of the one-hidden-layer neural network on group imbalance Hongkang Li et.al. 2403.07310 null
2024-03-12 A Bayesian Approach to OOD Robustness in Image Classification Prakhar Kaushik et.al. 2403.07277 null
2024-03-11 LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations Mohammad Alkhalefi et.al. 2403.06813 null
2024-03-11 Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification Shuai Li et.al. 2403.06798 null
2024-03-11 Leveraging Internal Representations of Model for Magnetic Image Classification Adarsh N L et.al. 2403.06797 null
2024-03-11 Shortcut Learning in Medical Image Segmentation Manxi Lin et.al. 2403.06748 null
2024-03-11 Active Generation for Image Classification Tao Huang et.al. 2403.06517 null
2024-03-11 Evolving Knowledge Distillation with Large Language Models and Active Learning Chengyuan Liu et.al. 2403.06414 null
2024-03-11 'One size doesn't fit all': Learning how many Examples to use for In-Context Learning for Improved Text Classification Manish Chandra et.al. 2403.06402 null
2024-03-10 Probing Image Compression For Class-Incremental Learning Justin Yang et.al. 2403.06288 null
2024-03-10 Bayesian Random Semantic Data Augmentation for Medical Image Classification Yaoyao Zhu et.al. 2403.06138 link
2024-03-10 Universal Debiased Editing for Fair Medical Image Classification Ruinan Jin et.al. 2403.06104 null
2024-03-08 Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets Lorenzo Brigato et.al. 2403.05532 null
2024-03-08 Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation Yu Han et.al. 2403.05388 null
2024-03-08 The Impact of Quantization on the Robustness of Transformer-based Text Classifiers Seyed Parsa Neshaei et.al. 2403.05365 null
2024-03-08 Multiple Instance Learning with random sampling for Whole Slide Image Classification H. Keshvarikhojasteh et.al. 2403.05351 null
2024-03-08 Learning Expressive And Generalizable Motion Features For Face Forgery Detection Jingyi Zhang et.al. 2403.05172 null
2024-03-08 Defending Against Unforeseen Failure Modes with Latent Adversarial Training Stephen Casper et.al. 2403.05030 link
2024-03-07 Fooling Neural Networks for Motion Forecasting via Adversarial Attacks Edgar Medina et.al. 2403.04954 null
2024-03-07 T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers Mariano V. Ntrougkas et.al. 2403.04523 null
2024-03-07 Source Matters: Source Dataset Impact on Model Robustness in Medical Imaging Dovile Juodelyte et.al. 2403.04484 link
2024-03-07 Advancing Biomedical Text Mining with Community Challenges Hui Zong et.al. 2403.04261 null
2024-03-07 Scalable On-Chip Optical Linear Processing Unit Using a Single Thin-Film Lithium Niobate Ring Modulator Zhaoang Deng et.al. 2403.04216 null
2024-03-07 Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models Evelyn Mannix et.al. 2403.04125 null
2024-03-07 Privacy-preserving Fine-tuning of Large Language Models through Flatness Tiejin Chen et.al. 2403.04124 null
2024-03-06 MedMamba: Vision Mamba for Medical Image Classification Yubiao Yue et.al. 2403.03849 link
2024-03-06 On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder Tingxu Han et.al. 2403.03846 link
2024-03-06 RADIA -- Radio Advertisement Detection with Intelligent Analytics Jorge Álvarez et.al. 2403.03538 null
2024-03-06 Inverse-Free Fast Natural Gradient Descent Method for Deep Learning Xinwei Ou et.al. 2403.03473 null
2024-03-06 Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN Biswadeep Chakraborty et.al. 2403.03409 null
2024-03-05 RulePrompt: Weakly Supervised Text Classification with Prompting PLMs and Self-Iterative Logical Rules Miaomiao Li et.al. 2403.02932 link
2024-03-05 Demonstrating Mutual Reinforcement Effect through Information Flow Chengguang Gan et.al. 2403.02902 null
2024-03-05 Quantum Mixed-State Self-Attention Network Fu Chen et.al. 2403.02871 null
2024-03-05 SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix Gayathri C et.al. 2403.02833 null
2024-03-05 SGD with Partial Hessian for Deep Neural Networks Optimization Ying Sun et.al. 2403.02681 link
2024-03-05 G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth Juan Zou et.al. 2403.02667 null
2024-03-05 Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad Sayantan Choudhury et.al. 2403.02648 link
2024-03-05 Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use Imad Eddine Toubal et.al. 2403.02626 null
2024-03-04 When do Convolutional Neural Networks Stop Learning? Sahan Ahmad et.al. 2403.02473 link
2024-03-04 NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function Abdullah Nazhat Abdullah et.al. 2403.02411 link
2024-03-02 Can a Confident Prior Replace a Cold Posterior? Martin Marek et.al. 2403.01272 link
2024-03-02 Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery Pedro H. V. Valois et.al. 2403.01183 null
2024-03-02 Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation Lian Xu et.al. 2403.01156 null
2024-03-02 ELA: Efficient Local Attention for Deep Convolutional Neural Networks Wei Xu et.al. 2403.01123 null
2024-03-01 Margin Discrepancy-based Adversarial Training for Multi-Domain Text Classification Yuan Wu et.al. 2403.00888 null
2024-03-01 Text classification of column headers with a controlled vocabulary: leveraging LLMs for metadata enrichment Margherita Martorana et.al. 2403.00884 null
2024-03-01 SURE: SUrvey REcipes for building reliable and robust deep networks Yuting Li et.al. 2403.00543 link
2024-03-01 Invariant Test-Time Adaptation for Vision-Language Model Generalization Huan Ma et.al. 2403.00376 null
2024-02-29 TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision Yunyi Zhang et.al. 2403.00165 null
2024-02-29 Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance Huakun Shen et.al. 2402.19401 null
2024-02-29 Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification Delfina Sol Martinez Pandiani et.al. 2402.19339 null
2024-02-29 Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction Hao Li et.al. 2402.19326 null
2024-02-29 Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation Fahimeh Hosseini Noohdani et.al. 2402.18919 null
2024-02-29 Utilizing Local Hierarchy with Adversarial Training for Hierarchical Text Classification Zihan Wang et.al. 2402.18825 link
2024-02-28 Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance Indu Panigrahi et.al. 2402.18742 link
2024-02-28 Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains Hafiz Tiomoko Ali et.al. 2402.18614 null
2024-02-28 Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling Mahdi Karami et.al. 2402.18508 null
2024-02-28 Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization Deng Li et.al. 2402.18447 null
2024-02-29 A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation Francesco Barbato et.al. 2402.18402 null
2024-02-28 A Multimodal Handover Failure Detection Dataset and Baselines Santosh Thoduka et.al. 2402.18319 null
2024-02-28 Classes Are Not Equal: An Empirical Study on Image Recognition Fairness Jiequan Cui et.al. 2402.18133 null
2024-02-27 Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers Yiwei Lu et.al. 2402.17710 null
2024-02-27 SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification Mohammed Q. Alkhatib et.al. 2402.17672 link
2024-02-27 Predict the Next Word: Evgenia Ilia et.al. 2402.17527 null
2024-02-27 Scaling Supervised Local Learning with Augmented Auxiliary Networks Chenxiang Ma et.al. 2402.17318 link
2024-02-26 Offline Writer Identification Using Convolutional Neural Network Activation Features Vincent Christlein et.al. 2402.17029 null

(back to top)

Object Detection

Publish Date Title Authors PDF Code
2024-11-22 A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles Irfan Nafiz Shahan et.al. 2411.15110 null
2024-11-22 MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving Hongsi Liu et.al. 2411.15016 null
2024-11-22 VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving Haiming Zhang et.al. 2411.14716 null
2024-11-21 Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection Ali Awad et.al. 2411.14626 null
2024-11-21 DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Tianhe Ren et.al. 2411.14347 link
2024-11-21 AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection Jialin Lu et.al. 2411.14243 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-21 Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation Ming Zhao et.al. 2411.13847 null
2024-11-20 MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection Tong Ning et.al. 2411.13628 null
2024-11-20 DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines Mizanur Rahman Jewel et.al. 2411.13544 null
2024-11-20 A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data Kavin Chandrasekaran et.al. 2411.13311 link
2024-11-20 VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation Chengjie Huang et.al. 2411.13186 null
2024-11-20 RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Christoph Reinders et.al. 2411.13150 link
2024-11-20 YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization Thomas Pöllabauer et.al. 2411.13149 link
2024-11-20 Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension Yongdong Luo et.al. 2411.13093 link
2024-11-20 Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors Satoru Koda et.al. 2411.13047 null
2024-11-20 Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection Xinhao Zhong et.al. 2411.13001 null
2024-11-19 Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images Matteo Toso et.al. 2411.12620 null
2024-11-19 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Shaoqing Xu et.al. 2411.12452 null
2024-11-19 Physics-Guided Detector for SAR Airplanes Zhongling Huang et.al. 2411.12301 link
2024-11-18 Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster J. Alex Hurt et.al. 2411.12038 null
2024-11-18 LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection Günel Jabbarlı et.al. 2411.11826 null
2024-11-18 WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images Lars Nieradzik et.al. 2411.11738 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 SL-YOLO: A Stronger and Lighter Drone Target Detection Model Defan Chen et.al. 2411.11477 null
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Vision Eagle Attention: A New Lens for Advancing Image Classification Mahmudul Hasan et.al. 2411.10564 link
2024-11-15 Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions Xumin Gao et.al. 2411.10357 null
2024-11-15 RETR: Multi-View Radar Detection Transformer for Indoor Perception Ryoma Yataka et.al. 2411.10293 null
2024-11-15 Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Jingru Yang et.al. 2411.10252 null
2024-11-15 Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras Ishrath Ahamed et.al. 2411.10072 null
2024-11-15 Diachronic Document Dataset for Semantic Layout Analysis Thibault Clérice et.al. 2411.10068 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration Yifan Shao et.al. 2411.09604 link
2024-11-14 Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction Chen-Long Duan et.al. 2411.09453 null
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines Junqi Liu et.al. 2411.09308 null
2024-11-14 Cross-Modal Consistency in Multimodal Large Language Models Xiang Zhang et.al. 2411.09273 null
2024-11-14 LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection Chanyeong Park et.al. 2411.09180 null
2024-11-13 Multimodal Object Detection using Depth and Image Data for Manufacturing Parts Nazanin Mahjourian et.al. 2411.09062 null
2024-11-13 DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models Yongdong Wang et.al. 2411.09022 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance Anton Kuznietsov et.al. 2411.08482 null
2024-11-13 V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion Xun Huang et.al. 2411.08402 link
2024-11-12 Large-scale Remote Sensing Image Target Recognition and Automatic Annotation Wuzheng Dong et.al. 2411.07802 link
2024-11-12 Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning Jianhao Li et.al. 2411.07742 null
2024-11-12 Depthwise Separable Convolutions with Deep Residual Convolutions Md Arid Hasan et.al. 2411.07544 null
2024-11-11 Transformers for Charged Particle Track Reconstruction in High Energy Physics Samuel Van Stroud et.al. 2411.07149 null
2024-11-11 Multi-scale Frequency Enhancement Network for Blind Image Deblurring Yawen Xiang et.al. 2411.06893 null
2024-11-11 Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction Miguel Antunes-García et.al. 2411.06851 link
2024-11-11 AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness Yizhuo Yang et.al. 2411.06789 null
2024-11-11 United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images Yanguang Sun et.al. 2411.06703 link
2024-11-11 Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs Jia Syuen Lim et.al. 2411.06702 null
2024-11-11 LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection Zhengyi Liu et.al. 2411.06652 null
2024-11-09 Robust Detection of LLM-Generated Text: A Comparative Analysis Yongye Su et.al. 2411.06248 null
2024-11-09 LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation Weijie Ma et.al. 2411.06173 link
2024-11-09 AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems Zhiyu Zhu et.al. 2411.06146 null
2024-11-08 Open-set object detection: towards unified problem formulation and benchmarking Hejer Ammar et.al. 2411.05564 null
2024-11-08 ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving Tao Ma et.al. 2411.05311 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data Aitor Martinez-Seras et.al. 2411.04586 null
2024-11-07 l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion Gargi Panda et.al. 2411.04519 null
2024-11-07 Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory Ali K. AlShami et.al. 2411.04501 null
2024-11-07 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation Xun Tu et.al. 2411.04386 null
2024-11-07 UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection Xinhua Jiang et.al. 2411.04348 null
2024-11-07 GazeGen: Gaze-Driven User Interaction for Visual Content Generation He-Yen Hsieh et.al. 2411.04335 null
2024-11-06 An Enhancement of Haar Cascade Algorithm Applied to Face Recognition for Gate Pass Security Clarence A. Antipona et.al. 2411.03831 null
2024-11-06 Understanding the Effects of Human-written Paraphrases in LLM-generated Text Detection Hiu Ting Lau et.al. 2411.03806 link
2024-11-06 Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection Pengfei Lyu et.al. 2411.03728 link
2024-11-06 Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage Claus D. Hansen et.al. 2411.03724 null
2024-11-06 Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions Arunkumar Rathinam et.al. 2411.03576 null
2024-11-05 An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Anthony Palladino et.al. 2411.03491 null
2024-11-05 Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data Irum Mehboob et.al. 2411.03082 null
2024-11-05 CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection Jisong Kim et.al. 2411.03013 null
2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing Yuka Ogino et.al. 2411.02799 null
2024-11-05 Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes Xu Han et.al. 2411.02794 link
2024-11-05 Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection Yifan Wang et.al. 2411.02747 null
2024-11-05 Analysis of Multi-epoch JWST Images of $\sim 300$ Little Red Dots: Tentative Detection of Variability in a Minority of Sources Zijian Zhang et.al. 2411.02729 null
2024-11-04 Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems Youssef Elmir et.al. 2411.02632 null
2024-11-04 SIRA: Scalable Inter-frame Relation and Association for Radar Perception Ryoma Yataka et.al. 2411.02220 null
2024-11-04 Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery Robert Fonod et.al. 2411.02136 null
2024-11-04 Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation Yan Li et.al. 2411.02057 link
2024-11-04 V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams Muhammad Waqas Ashraf et.al. 2411.01963 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection Jinyin Chen et.al. 2411.01889 link
2024-11-03 ROAD-Waymo: Action Awareness at Scale for Autonomous Driving Salman Khan et.al. 2411.01683 null
2024-11-03 OSAD: Open-Set Aircraft Detection in SAR Images Xiayang Xiao et.al. 2411.01597 null
2024-11-03 One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection Zhenyu Wang et.al. 2411.01584 null
2024-11-03 A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning Fei Wang et.al. 2411.01445 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-31 Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images Yakun Xie et.al. 2410.23991 null
2024-10-31 Uncertainty Estimation for 3D Object Detection via Evidential Learning Nikita Durasov et.al. 2410.23910 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-31 Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem Louis Soum-Fontez et.al. 2410.23767 null
2024-10-31 DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios Junchao Wu et.al. 2410.23746 link
2024-10-31 GigaCheck: Detecting LLM-generated Content Irina Tolstykh et.al. 2410.23728 null
2024-10-31 Context-Aware Token Selection and Packing for Enhanced Vision Transformer Tianyi Zhang et.al. 2410.23608 null
2024-10-30 EMMA: End-to-End Multimodal Model for Autonomous Driving Jyh-Jing Hwang et.al. 2410.23262 null
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-30 First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 Tengfei Zhang et.al. 2410.23077 null
2024-10-30 AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection Yujin Wang et.al. 2410.22939 null
2024-10-30 YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems Mujadded Al Rabbani Alif et.al. 2410.22898 null
2024-10-29 Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection Gyusam Chang et.al. 2410.22461 null
2024-10-29 Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels Ruigang Fu et.al. 2410.22139 link
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 null
2024-10-29 Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation Hong-fu Chou et.al. 2410.21916 null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 link
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 link
2024-10-28 TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors Adonisz Dimitriu et.al. 2410.21443 null
2024-10-28 Joint Audio-Visual Idling Vehicle Detection with Streamlined Input Dependencies Xiwen Li et.al. 2410.21170 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 null
2024-10-28 DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning Xun Guo et.al. 2410.20964 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-28 SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity Kunyun Wang et.al. 2410.20790 null
2024-10-27 Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network Chongxiao Liu et.al. 2410.20546 null
2024-10-27 Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution Zhicheng Zhao et.al. 2410.20466 link
2024-10-27 Open-Vocabulary Object Detection via Language Hierarchy Jiaxing Huang et.al. 2410.20371 null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665 null
2024-10-25 Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Shenghao Fu et.al. 2410.19635 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 null
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 null
2024-10-25 In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators Dmytro Humeniuk et.al. 2410.19277 null
2024-10-24 HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision Burak Ercan et.al. 2410.19164 null
2024-10-24 Optimizing Edge Offloading Decisions for Object Detection Jiaming Qiu et.al. 2410.18919 link
2024-10-24 You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection Mingbo Hong et.al. 2410.18398 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-23 KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark Vannkinh Nom et.al. 2410.18277 null
2024-10-23 Automated Defect Detection and Grading of Piarom Dates Using Deep Learning Nasrin Azimi et.al. 2410.18208 null
2024-10-23 DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection Qingpeng Li et.al. 2410.17822 link
2024-10-23 YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions Xiguang Li et.al. 2410.17734 null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-23 PlantCamo: Plant Camouflage Detection Jinyu Yang et.al. 2410.17598 link
2024-10-23 OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Haiji Liang et.al. 2410.17534 link
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion Junzhou Chen et.al. 2410.17144 null
2024-10-22 FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control Oleg Sautenkov et.al. 2410.16943 null
2024-10-22 AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models Yongjian Wu et.al. 2410.16820 link
2024-10-22 DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units Liam Boyle et.al. 2410.16769 null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 null
2024-10-22 Fire and Smoke Detection with Burning Intensity Representation Xiaoyi Han et.al. 2410.16642 link
2024-10-21 Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Yufei Zhan et.al. 2410.16163 link
2024-10-21 Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data Nikos Sakellariou et.al. 2410.16089 null
2024-10-21 Few-shot target-driven instance detection based on open-vocabulary object detection models Ben Crulis et.al. 2410.16028 null
2024-10-21 How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? Maximilian Ulmer et.al. 2410.15766 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-21 Online Pseudo-Label Unified Object Detection for Multiple Datasets Training XiaoJun Tang et.al. 2410.15569 null
2024-10-20 TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool Thinh Phan et.al. 2410.15518 null
2024-10-20 YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary Hao-Tang Tsui et.al. 2410.15346 null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 null
2024-10-18 MultiOrg: A Multi-rater Organoid-detection Dataset Christina Bukas et.al. 2410.14612 null
2024-10-18 Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement Zihao Cheng et.al. 2410.14259 null
2024-10-18 Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech Shuwei He et.al. 2410.14101 link
2024-10-18 Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines Kosuke Tatsumura et.al. 2410.14093 null
2024-10-17 FaceSaliencyAug: Mitigating Geographic, Gender and Stereotypical Biases via Saliency-Based Data Augmentation Teerath Kumar et.al. 2410.14070 null
2024-10-17 Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring Kristina Telegraph et.al. 2410.13616 null
2024-10-17 RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images Kejun Ren et.al. 2410.13532 null
2024-10-16 Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar Aayush Agrawal et.al. 2410.12953 null
2024-10-16 MambaBEV: An efficient 3D detection model with Mamba2 Zihan You et.al. 2410.12673 null
2024-10-16 On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs Herun Wan et.al. 2410.12600 null
2024-10-16 Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion Minkyoung Cho et.al. 2410.12592 null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 null
2024-10-16 Real-time Stereo-based 3D Object Detection for Streaming Perception Changcai Li et.al. 2410.12394 link
2024-10-16 Context-Infused Visual Grounding for Art Selina Khan et.al. 2410.12369 link
2024-10-16 Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang et.al. 2410.12274 null
2024-10-16 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm Guanming Huang et.al. 2410.12259 null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-16 Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection Qishun Wang et.al. 2410.12143 null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 null
2024-10-15 POLO -- Point-based, multi-class animal detection Giacomo May et.al. 2410.11741 null
2024-10-15 YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection Olalekan Akindele et.al. 2410.11727 null
2024-10-15 SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection Shuhan Dong et.al. 2410.11358 null
2024-10-15 Open World Object Detection: A Survey Yiming Li et.al. 2410.11301 null
2024-10-15 Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training Bryan Bo Cao et.al. 2410.11233 null
2024-10-15 TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement Zhiwei Lin et.al. 2410.11228 null
2024-10-15 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-14 UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles Hui Ye et.al. 2410.11125 null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 link
2024-10-14 Learning to Ground VLMs without Forgetting Aritra Bhowmik et.al. 2410.10491 null
2024-10-14 SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments Khaled Gabr et.al. 2410.10409 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-14 ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Jiwei Chen et.al. 2410.10298 null
2024-10-14 Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors Tao Lin et.al. 2410.10091 link
2024-10-15 Optimizing Waste Management with Advanced Object Detection for Garbage Classification Everest Z. Kuang et.al. 2410.09975 null
2024-10-13 EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition Jingyu Liu et.al. 2410.09954 null
2024-10-13 LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond Md Tanvir Islam et.al. 2410.09831 link
2024-10-11 DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection Haochen Li et.al. 2410.09004 null
2024-10-11 LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection Mingjia Li et.al. 2410.08810 null
2024-10-11 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Robert Turnbull et.al. 2410.08740 null
2024-10-11 MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation Qihang Yang et.al. 2410.08739 null
2024-10-11 Boosting Open-Vocabulary Object Detection by Handling Background Samples Ruizhe Zeng et.al. 2410.08645 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-11 VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Zekun Qian et.al. 2410.08529 null
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection Botao Ren et.al. 2410.08210 null
2024-10-10 Robust AI-Generated Text Detection by Restricted Embeddings Kristian Kuznetsov et.al. 2410.08113 null
2024-10-10 Dynamic Object Catching with Quadruped Robot Front Legs André Schakkal et.al. 2410.08065 null
2024-10-10 HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective Pei Liu et.al. 2410.07758 null
2024-10-10 O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out Mısra Yavuz et.al. 2410.07514 null
2024-10-09 Progressive Multi-Modal Fusion for Robust 3D Object Detection Rohit Mohan et.al. 2410.07475 null
2024-10-09 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 null
2024-10-09 SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy Yuhan Kang et.al. 2410.06842 link
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-09 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 null
2024-10-09 QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation Yuxin Li et.al. 2410.06516 null
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 Training-free LLM-generated Text Detection by Mining Token Probability Sequences Yihuai Xu et.al. 2410.06072 null
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-08 Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga Takara Taniguchi et.al. 2410.05935 null
2024-10-08 Unobserved Object Detection using Generative Models Subhransu S. Bhattacharjee et.al. 2410.05869 null
2024-10-07 Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection Cedric Le Gentil et.al. 2410.05152 null
2024-10-07 Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava Mehdi Azarafza et.al. 2410.05096 null
2024-10-07 Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou et.al. 2410.05058 null
2024-10-07 Windshield Integration of Thermal and Color Fusion for Automatic Emergency Braking in Low Visibility Conditions Gabriel Jobert et.al. 2410.04928 null
2024-10-07 Improved detection of discarded fish species through BoxAL active learning Maria Sokolova et.al. 2410.04880 link
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 link
2024-10-05 AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text Ximing Lu et.al. 2410.04265 null
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-05 Fast Object Detection with a Machine Learning Edge Device Richard C. Rodriguez et.al. 2410.04173 null
2024-10-05 Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception Zhengru Fang et.al. 2410.04168 null
2024-10-04 DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline Yong-Kun Zhang et.al. 2410.03200 null
2024-10-03 Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review Sungduk Yu et.al. 2410.03019 null
2024-10-04 Learning 3D Perception from Others' Predictions Jinsu Yoo et.al. 2410.02646 null
2024-10-02 Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Xinlong Hou et.al. 2410.01966 null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 link
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-10-02 Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps Jiyun Jang et.al. 2410.01319 null
2024-10-02 Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices Jeho Lee et.al. 2410.01270 null
2024-10-02 High and Low Resolution Tradeoffs in Roadside Multimodal Sensing Shaozu Ding et.al. 2410.01250 null
2024-10-02 Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions Ashutosh Kumar et.al. 2410.01225 link
2024-10-02 A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles Arda Genc et.al. 2410.01213 link
2024-10-01 Synthetic imagery for fuzzy object detection: A comparative study Siavash H. Khajavi et.al. 2410.01124 null
2024-10-01 Generating Seamless Virtual Immunohistochemical Whole Slide Images with Content and Color Consistency Sitong Liu et.al. 2410.01072 null
2024-10-01 ARPOV: Expanding Visualization of Object Detection in AR with Panoramic Mosaic Stitching Erin McGowan et.al. 2410.01055 null
2024-09-30 Accelerating Non-Maximum Suppression: A Graph Theory Perspective King-Siong Si et.al. 2409.20520 link
2024-09-30 NUTRIVISION: A System for Automatic Diet Management in Smart Healthcare Madhumita Veeramreddy et.al. 2409.20508 null
2024-09-30 Navigating Threats: A Survey of Physical Adversarial Attacks on LiDAR Perception Systems in Autonomous Vehicles Amira Guesmi et.al. 2409.20426 null
2024-09-30 Training a Computer Vision Model for Commercial Bakeries with Primarily Synthetic Images Thomas H. Schmitt et.al. 2409.20122 null
2024-09-30 GearTrack: Automating 6D Pose Estimation Yu Deng et.al. 2409.19986 null
2024-09-30 TSdetector: Temporal-Spatial Self-correction Collaborative Learning for Colonoscopy Video Detection Kaini Wang et.al. 2409.19983 null
2024-09-30 DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction Zhen Yang et.al. 2409.19972 link
2024-09-30 HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes Changfeng Feng et.al. 2409.19833 link
2024-09-29 Applying the Lower-Biased Teacher Model in Semi-Suepervised Object Detection Shuang Wang et.al. 2409.19703 null
2024-09-29 OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images Jiaqi Zhao et.al. 2409.19648 link
2024-09-27 Spectral Wavelet Dropout: Regularization in the Wavelet Domain Rinor Cakaj et.al. 2409.18951 null
2024-09-27 MCUBench: A Benchmark of Tiny Object Detectors on MCUs Sudhakar Sah et.al. 2409.18866 link
2024-09-27 A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation Jer Pelhan et.al. 2409.18686 null
2024-09-27 Query matching for spatio-temporal action detection with query-based object detector Shimon Hori et.al. 2409.18408 null
2024-09-26 Efficient Microscopic Image Instance Segmentation for Food Crystal Quality Control Xiaoyu Ji et.al. 2409.18291 null
2024-09-26 Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing Huthaifa I. Ashqar et.al. 2409.18286 null
2024-09-26 GSON: A Group-based Social Navigation Framework with Large Multimodal Model Shangyi Luo et.al. 2409.18084 null
2024-09-27 A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts Aurel Pjetri et.al. 2409.17851 null
2024-09-26 Scene Understanding in Pick-and-Place Tasks: Analyzing Transformations Between Initial and Final Scenes Seraj Ghasemi et.al. 2409.17720 null
2024-09-26 SLO-Aware Task Offloading within Collaborative Vehicle Platoons Boris Sedlak et.al. 2409.17667 null
2024-09-26 CAMOT: Camera Angle-aware Multi-Object Tracking Felix Limanta et.al. 2409.17533 null
2024-09-25 Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving Ce Zhou et.al. 2409.17403 null
2024-09-25 AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards Uddhav Bhattarai et.al. 2409.17400 null
2024-09-25 Energy-Efficient & Real-Time Computer Vision with Intelligent Skipping via Reconfigurable CMOS Image Sensors Md Abdullah-Al Kaiser et.al. 2409.17341 null
2024-09-25 BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices Yongqi Xu et.al. 2409.17093 link
2024-09-25 EventHDR: from Event to High-Speed HDR Videos and Beyond Yunhao Zou et.al. 2409.17029 null
2024-09-25 Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection Xu Han et.al. 2409.16827 null
2024-09-25 XAI-guided Insulator Anomaly Detection for Imbalanced Datasets Maximilian Andreas Hoefler et.al. 2409.16821 null
2024-09-25 Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera Xu Han et.al. 2409.16820 null
2024-09-25 Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices Daghash K. Alqahtani et.al. 2409.16808 null
2024-09-25 Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation Youngwan Jin et.al. 2409.16706 null
2024-09-25 TSBP: Improving Object Detection in Histology Images via Test-time Self-guided Bounding-box Propagation Tingting Yang et.al. 2409.16678 link
2024-09-25 Source-Free Domain Adaptation for YOLO Object Detection Simon Varailhon et.al. 2409.16538 null
2024-09-24 Real-Time Detection of Electronic Components in Waste Printed Circuit Boards: A Transformer-Based Approach Muhammad Mohsin et.al. 2409.16496 null
2024-09-24 Tiny Robotics Dataset and Benchmark for Continual Object Detection Francesco Pasti et.al. 2409.16215 link
2024-09-24 Seeing Faces in Things: A Model and Dataset for Pareidolia Mark Hamilton et.al. 2409.16143 null
2024-09-24 HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection Yuqi Ma et.al. 2409.16136 null
2024-09-24 Neuromorphic Drone Detection: an Event-RGB Multimodal Approach Gabriele Magrini et.al. 2409.16099 null
2024-09-24 Open-World Object Detection with Instance Representation Learning Sunoh Lee et.al. 2409.16073 null
2024-09-24 Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis Xianda Zhang et.al. 2409.16057 null
2024-09-24 Zero-Shot Detection of AI-Generated Images Davide Cozzolino et.al. 2409.15875 null
2024-09-24 Automated Assessment of Multimodal Answer Sheets in the STEM domain Rajlaxmi Patil et.al. 2409.15749 null
2024-09-24 Real-Time Pedestrian Detection on IoT Edge Devices: A Lightweight Deep Learning Approach Muhammad Dany Alfikri et.al. 2409.15740 null
2024-09-24 PDT: Uav Target Detection Dataset for Pests and Diseases Tree Mingle Zhou et.al. 2409.15679 link
2024-09-18 Applications of Knowledge Distillation in Remote Sensing: A Survey Yassine Himeur et.al. 2409.12111 null
2024-09-18 Agglomerative Token Clustering Joakim Bruslund Haurum et.al. 2409.11923 null
2024-09-18 RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework Xiaoyu Li et.al. 2409.11749 null
2024-09-17 Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching Kurran Singh et.al. 2409.11555 null
2024-09-17 VALO: A Versatile Anytime Framework for LiDAR-based Object Detection Deep Neural Networks Ahmet Soyyigit et.al. 2409.11542 link
2024-09-17 STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking Jianbo Ma et.al. 2409.11234 link
2024-09-19 Vision foundation models: can they be applied to astrophysics data? E. Lastufka et.al. 2409.11175 null
2024-09-17 UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height Zichen Yu et.al. 2409.11160 null
2024-09-17 Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation Rui Yu et.al. 2409.11018 null
2024-09-17 TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection Philip Jacobson et.al. 2409.10901 null
2024-09-18 Context-Dependent Interactable Graphical User Interface Element Detection for Spatial Computing Applications Shuqing Li et.al. 2409.10811 null
2024-09-16 Online Learning via Memory: Retrieval-Augmented Detector Adaptation Yanan Jian et.al. 2409.10716 null
2024-09-16 CoMamba: Real-time Cooperative Perception Unlocked with State Space Models Jinlong Li et.al. 2409.10699 null
2024-09-16 Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation Yifan Xu et.al. 2409.10350 null
2024-09-16 Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data Roni Blushtein-Livnon et.al. 2409.10272 null
2024-09-16 Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings Xi Wang et.al. 2409.10259 null
2024-09-16 DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion Yuchen Guo et.al. 2409.10080 null
2024-09-16 Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation Meng Chen et.al. 2409.10071 link
2024-09-16 LithoHoD: A Litho Simulator-Powered Framework for IC Layout Hotspot Detection Hao-Chiang Shao et.al. 2409.10021 null
2024-09-16 Comprehensive Study on Sentiment Analysis: From Rule-based to modern LLM based system Shailja Gupta et.al. 2409.09989 null
2024-09-15 Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings Oriel Perl et.al. 2409.09841 null
2024-09-15 Template-based Multi-Domain Face Recognition Anirudh Nanduri et.al. 2409.09832 null
2024-09-15 PersonaMark: Personalized LLM watermarking for model protection and user attribution Yuehan Zhang et.al. 2409.09739 null
2024-09-13 Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing Minh-Duc Vu et.al. 2409.08885 null
2024-09-13 Direct-CP: Directed Collaborative Perception for Connected and Autonomous Vehicles via Proactive Attention Yihang Tao et.al. 2409.08840 null
2024-09-13 RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision Shuo Wang et.al. 2409.08475 null
2024-09-12 X-ray Fluoroscopy Guided Localization and Steering of Medical Microrobots through Virtual Enhancement Husnu Halid Alabay et.al. 2409.08337 null
2024-09-12 What is YOLOv9: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector Muhammad Yaseen et.al. 2409.07813 null
2024-09-11 Object Depth and Size Estimation using Stereo-vision and Integration with SLAM Layth Hamad et.al. 2409.07623 null
2024-09-11 Zero-Shot Machine-Generated Text Detection Using Mixture of Large Language Models Matthieu Dubois et.al. 2409.07615 null
2024-09-11 ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection Transformers Giorgos Savathrakis et.al. 2409.07541 link
2024-09-11 Watchlist Challenge: 3rd Open-set Face Detection and Identification Furkan Kasım et.al. 2409.07220 null
2024-09-11 SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images Xuexue Li et.al. 2409.07024 null
2024-09-11 ODYSSEE: Oyster Detection Yielded by Sensor Systems on Edge Electronics Xiaomin Lin et.al. 2409.07003 null
2024-09-11 Brain-Inspired Stepwise Patch Merging for Vision Transformers Yonghao Yu et.al. 2409.06963 null
2024-09-10 Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds Mu Cai et.al. 2409.06827 link
2024-09-10 Technical Report of Mobile Manipulator Robot for Industrial Environments Erfan Amoozad Khalili et.al. 2409.06693 null
2024-09-10 A comprehensive study on Blood Cancer detection and classification using Convolutional Neural Network Md Taimur Ahad et.al. 2409.06689 null
2024-09-10 When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking Emirhan Bayar et.al. 2409.06617 link
2024-09-10 Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception Xiang Zhang et.al. 2409.06584 null
2024-09-10 Semi-Supervised 3D Object Detection with Chanel Augmentation using Transformation Equivariance Minju Kang et.al. 2409.06583 null
2024-09-10 Knowledge Distillation via Query Selection for Detection Transformer Yi Liu et.al. 2409.06443 null
2024-09-10 An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection Pengfei Qi et.al. 2409.06300 null
2024-09-09 Replay Consolidation with Label Propagation for Continual Object Detection Riccardo De Monte et.al. 2409.05650 null
2024-09-09 Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery Fan Zhang et.al. 2409.05624 null
2024-09-09 LEROjD: Lidar Extended Radar-Only Object Detection Patrick Palmer et.al. 2409.05564 link
2024-09-09 Proto-OOD: Enhancing OOD Object Detection with Prototype Feature Similarity Junkun Chen et.al. 2409.05466 null
2024-09-09 Distribution Discrepancy and Feature Heterogeneity for Active 3D Object Detection Huang-Yu Chen et.al. 2409.05425 null
2024-09-08 A Low-Computational Video Synopsis Framework with a Standard Dataset Ramtin Malekpour et.al. 2409.05230 link
2024-09-08 Can OOD Object Detectors Learn from Foundation Models? Jiahui Liu et.al. 2409.05162 link
2024-09-08 WaterSeeker: Efficient Detection of Watermarked Segments in Large Documents Leyi Pan et.al. 2409.05112 null
2024-09-08 Visual Grounding with Multi-modal Conditional Adaptation Ruilin Yao et.al. 2409.04999 link
2024-09-08 Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception Rongsong Li et.al. 2409.04980 null
2024-09-06 Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences Rui Yu et.al. 2409.04390 null
2024-09-06 UniDet3D: Multi-dataset Indoor 3D Object Detection Maksim Kolodiazhnyi et.al. 2409.04234 link
2024-09-06 Feature Compression for Cloud-Edge Multimodal 3D Object Detection Chongzhen Tian et.al. 2409.04123 null
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-09-06 BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection Yangguang Chen et.al. 2409.04025 null
2024-09-05 LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones Moritz Nottebaum et.al. 2409.03460 link
2024-09-05 Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications Tong Bu et.al. 2409.03368 null
2024-09-05 YOLO-PPA based Efficient Traffic Sign Detection for Cruise Control in Autonomous Driving Jingyu Zhang et.al. 2409.03320 null
2024-09-05 Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints Keisuke Toida et.al. 2409.03252 null
2024-09-04 Boundless: Generating Photorealistic Synthetic Data for Object Detection in Urban Streetscapes Mehmet Kerem Turkcan et.al. 2409.03022 link
2024-09-04 Real-Time Dynamic Scale-Aware Fusion Detection Network: Take Road Damage Detection as an example Weichao Pan et.al. 2409.02546 null
2024-09-04 TP-GMOT: Tracking Generic Multiple Object by Textual Prompt with Motion-Appearance Cost (MAC) SORT Duy Le Dinh Anh et.al. 2409.02490 link
2024-09-04 Rapid Automatic Multiple Moving Objects Detection Method Based on Feature Extraction from Images with Non-sidereal Tracking Lei Wang et.al. 2409.02405 null
2024-09-04 Pluralistic Salient Object Detection Xuelu Feng et.al. 2409.02368 null
2024-09-03 Site Selection for the Second Flyeye Telescope: A Simulation Study for Optimizing Near-Earth Object Discovery D. Föhring et.al. 2409.02329 null
2024-09-03 K-Origins: Better Colour Quantification for Neural Networks Lewis Mason et.al. 2409.02281 null
2024-09-03 Evaluation and Comparison of Visual Language Models for Transportation Engineering Problems Sanjita Prajapati et.al. 2409.02278 null
2024-09-03 A Modern Take on Visual Relationship Reasoning for Grasp Planning Paolo Rabino et.al. 2409.02035 null
2024-09-03 Latent Distillation for Continual Object Detection at the Edge Francesco Pasti et.al. 2409.01872 link
2024-09-03 Real-Time Indoor Object Detection based on hybrid CNN-Transformer Approach Salah Eddine Laidoudi et.al. 2409.01871 null
2024-08-30 Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations Ahmed Hammam et.al. 2408.17311 null
2024-08-30 Hybrid Classification-Regression Adaptive Loss for Dense Object Detection Yanquan Huang et.al. 2408.17182 null
2024-08-30 UTrack: Multi-Object Tracking with Uncertain Detections Edgardo Solano-Carrillo et.al. 2408.17098 link
2024-08-30 PIB: Prioritized Information Bottleneck Framework for Collaborative Edge Video Analytics Zhengru Fang et.al. 2408.17047 null
2024-08-30 CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection Xuejing Li et.al. 2408.17036 null
2024-08-30 MakeWay: Object-Aware Costmaps for Proactive Indoor Navigation Using LiDAR Binbin Xu et.al. 2408.17034 null
2024-08-29 Analyzing Errors in Controlled Turret System Given Target Location Input from Artificial Intelligence Methods in Automatic Target Recognition Matthew Karlson et.al. 2408.16923 null
2024-08-29 Space3D-Bench: Spatial 3D Question Answering Benchmark Emilia Szymanska et.al. 2408.16662 null
2024-08-29 SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection Rohit Venkata Sai Dulam et.al. 2408.16645 null
2024-08-29 UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation Piotr Rudol et.al. 2408.16501 null
2024-08-29 Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition Yongcun Zhang et.al. 2408.16451 link
2024-08-29 Enhancing Sound Source Localization via False Negative Elimination Zengjie Song et.al. 2408.16448 link
2024-08-29 High-yield large-scale suspended graphene membranes over closed cavities for sensor applications Sebastian Lukas et.al. 2408.16408 null
2024-08-29 FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules Yukang Huo et.al. 2408.16313 null
2024-08-29 Anno-incomplete Multi-dataset Detection Yiran Xu et.al. 2408.16247 null
2024-08-29 PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View Zichen Yu et.al. 2408.16200 null
2024-08-28 ChartEye: A Deep Learning Framework for Chart Information Extraction Osama Mustafa et.al. 2408.16123 null
2024-08-28 microYOLO: Towards Single-Shot Object Detection on Microcontrollers Mark Deutel et.al. 2408.15865 null
2024-08-28 What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector Muhammad Yaseen et.al. 2408.15857 null
2024-08-28 Network transferability of adversarial patches in real-time object detection Jens Bayer et.al. 2408.15833 link
2024-08-28 Object Detection for Vehicle Dashcams using Transformers Osama Mustafa et.al. 2408.15809 null
2024-08-29 RIDE: Boosting 3D Object Detection for LiDAR Point Clouds via Rotation-Invariant Analysis Zhaoxuan Wang et.al. 2408.15643 null
2024-08-28 MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion Yanglin Deng et.al. 2408.15641 link
2024-08-28 Semantic and goal-oriented edge computing for satellite Earth Observation Beatriz Soret et.al. 2408.15639 null
2024-08-28 Transfer Learning from Simulated to Real Scenes for Monocular 3D Object Detection Sondos Mohamed et.al. 2408.15637 null
2024-08-28 Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail Bianca Lamm et.al. 2408.15626 null
2024-08-28 RoboSense: Large-scale Dataset and Benchmark for Multi-sensor Low-speed Autonomous Driving Haisheng Su et.al. 2408.15503 null
2024-08-27 A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships Gracile Astlin Pereira et.al. 2408.15178 null
2024-08-27 Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance Kunpeng Wang et.al. 2408.15063 null
2024-08-27 Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection Siyuan Yao et.al. 2408.15020 link
2024-08-27 Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation Elona Shatri et.al. 2408.15002 null
2024-08-27 BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization Mario A. V. Saucedo et.al. 2408.14941 null
2024-08-26 PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection Yidi Li et.al. 2408.14600 null
2024-08-26 A Survey of Camouflaged Object Detection and Beyond Fengyang Xiao et.al. 2408.14562 null
2024-08-26 Beyond Few-shot Object Detection: A Detailed Survey Vishal Chudasama et.al. 2408.14249 null
2024-08-26 TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation Anh-Dzung Doan et.al. 2408.14227 null
2024-08-26 EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection Pengyu Li et.al. 2408.14189 null
2024-08-26 More Pictures Say More: Visual Intersection Network for Open Set Object Detection Bingcheng Dong et.al. 2408.14032 null
2024-08-25 Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems Mohammad Hossein Amini et.al. 2408.13950 null
2024-08-25 OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation Muhammad Rameez ur Rahman et.al. 2408.13936 link
2024-08-25 Infrared Domain Adaptation with Zero-Shot Quantization Burak Sevsay et.al. 2408.13925 null
2024-08-25 TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training Li Li et.al. 2408.13902 null
2024-08-25 Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection Seongmin Park et.al. 2408.13798 null
2024-08-24 Mean Height Aided Post-Processing for Pedestrian Detection Jing Yuan et.al. 2408.13646 null
2024-08-23 MCTR: Multi Camera Tracking Transformer Alexandru Niculescu-Mizil et.al. 2408.13243 null
2024-08-23 DeTPP: Leveraging Object Detection for Robust Long-Horizon Event Prediction Ivan Karpukhin et.al. 2408.13131 null
2024-08-23 VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models Wentao Wu et.al. 2408.13031 link
2024-08-23 Can AI Assistance Aid in the Grading of Handwritten Answer Sheets? Pritam Sil et.al. 2408.12870 null
2024-08-23 Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen et.al. 2408.12772 null
2024-08-22 CatFree3D: Category-agnostic 3D Object Detection with Diffusion Wenjing Bian et.al. 2408.12747 null
2024-08-22 Revisiting Cross-Domain Problem for LiDAR-based 3D Object Detection Ruixiao Zhang et.al. 2408.12708 null
2024-08-22 xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Can Qin et.al. 2408.12590 null
2024-08-22 Enhanced Parking Perception by Multi-Task Fisheye Cross-view Transformers Antonyo Musabini et.al. 2408.12575 null
2024-08-22 Comparing YOLOv5 Variants for Vehicle Detection: A Performance Analysis Athulya Sundaresan Geetha et.al. 2408.12550 null
2024-08-22 UMAD: University of Macau Anomaly Detection Benchmark Dataset Dong Li et.al. 2408.12527 link
2024-08-22 Class-balanced Open-set Semi-supervised Object Detection for Medical Images Zhanyun Lu et.al. 2408.12355 null
2024-08-22 OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion Guoting Wei et.al. 2408.12246 null
2024-08-22 On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World Bao Gia Doan et.al. 2408.12122 null
2024-08-21 CARLA Drone: Monocular 3D Object Detection from a Different Perspective Johannes Meier et.al. 2408.11958 null
2024-08-21 SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance Zhiqiang Wu et.al. 2408.11760 null
2024-08-21 Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections Ahmed S. Abdelrahman et.al. 2408.11649 null
2024-08-21 Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection Liang Yao et.al. 2408.11407 null
2024-08-20 On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes Sadia Ilyas et.al. 2408.11221 null
2024-08-20 Quantum Inverse Contextual Vision Transformers (Q-ICVT): A New Frontier in 3D Object Detection for AVs Sanjay Bhargav Dharavath et.al. 2408.11207 link
2024-08-20 A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection Vladislav Li et.al. 2408.10940 null
2024-08-20 Aligning Object Detector Bounding Boxes with Human Preference Ombretta Strafforello et.al. 2408.10844 null
2024-08-20 LightMDETR: A Lightweight Approach for Low-Cost Open-Vocabulary Object Detection Training Binta Sow et.al. 2408.10787 null
2024-08-20 Just a Hint: Point-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10777 null
2024-08-21 Generative AI in Industrial Machine Vision -- A Review Hans Aoyang Zhou et.al. 2408.10775 null
2024-08-20 Detection of Intracranial Hemorrhage for Trauma Patients Antoine P. Sanner et.al. 2408.10768 null
2024-08-20 SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection Huafeng Chen et.al. 2408.10760 null
2024-08-20 Leveraging Temporal Contexts to Enhance Vehicle-Infrastructure Cooperative Perception Jiaru Zhong et.al. 2408.10531 null
2024-08-19 Leveraging Superfluous Information in Contrastive Representation Learning Xuechu Yu et.al. 2408.10292 null
2024-08-19 SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition Wiktor Mucha et.al. 2408.10037 null
2024-08-19 Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving Jun Yan et.al. 2408.09839 link
2024-08-19 Latent Diffusion for Guided Document Table Generation Syed Jawwad Haider Hamdani et.al. 2408.09800 null
2024-08-18 Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection Kaiwen Wang et.al. 2408.09431 null
2024-08-18 Boundary-Recovering Network for Temporal Action Detection Jihwan Kim et.al. 2408.09354 null
2024-08-18 YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems Chien-Yao Wang et.al. 2408.09332 null
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-17 PADetBench: Towards Benchmarking Physical Attacks against Object Detection Jiawei Lian et.al. 2408.09181 link
2024-08-17 MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation Xiao Zhao et.al. 2408.09122 null
2024-08-17 Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Jiancheng Pan et.al. 2408.09110 null
2024-08-16 SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Xinyu Xiong et.al. 2408.08870 link
2024-08-16 Multimodal Relational Triple Extraction with Query-based Entity Object Transformer Lei Hei et.al. 2408.08709 null
2024-08-16 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024-08-15 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Dongshuo Yin et.al. 2408.08345 link
2024-08-15 Learned Multimodal Compression for Autonomous Driving Hadi Hadizadeh et.al. 2408.08211 null
2024-08-16 OC3D: Weakly Supervised Outdoor 3D Object Detection with Only Coarse Click Annotation Qiming Xia et.al. 2408.08092 null
2024-08-15 CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection Xunfa Lai et.al. 2408.08050 null
2024-08-15 Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement Wenxuan Li et.al. 2408.07999 null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 link
2024-08-14 See It All: Contextualized Late Aggregation for 3D Dense Captioning Minjung Kim et.al. 2408.07648 null
2024-08-14 Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving Yuqing Wen et.al. 2408.07605 null
2024-08-14 Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Zhonglin Chen et.al. 2408.07455 null
2024-08-14 Sign language recognition based on deep learning and low-cost handcrafted descriptors Alvaro Leandro Cavalcante Carneiro et.al. 2408.07244 link
2024-08-13 Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces Zhiling Chen et.al. 2408.07146 null
2024-08-13 Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries Qi Song et.al. 2408.06901 null
2024-08-13 Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection Matthias Bartolo et.al. 2408.06803 link
2024-08-13 Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions Miao Zhang et.al. 2408.06772 null
2024-08-13 Unified-IoU: For High-Quality Object Detection Xiangjie Luo et.al. 2408.06636 link
2024-08-13 A lightweight YOLOv5-FFM model for occlusion pedestrian detection Xiangjie Luo et.al. 2408.06633 null
2024-08-13 MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers Zichao Dong et.al. 2408.06604 null
2024-08-12 Latent Disentanglement for Low Light Image Enhancement Zhihao Zheng et.al. 2408.06245 null
2024-08-12 MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception Sven Teufel et.al. 2408.06137 link
2024-08-12 DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection Junjie Guo et.al. 2408.06123 null
2024-08-12 Optimizing Vision Transformers with Data-Free Knowledge Transfer Gousia Habib et.al. 2408.05952 null
2024-08-12 MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection Zitian Wang et.al. 2408.05945 null
2024-08-12 Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes Ke Zhou et.al. 2408.05936 null
2024-08-12 Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts Peng Wu et.al. 2408.05905 null
2024-08-12 Toward Pedestrian Head Tracking: A Benchmark Dataset and an Information Fusion Network Kailai Sun et.al. 2408.05877 null
2024-08-11 U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training Zhuoyan Liu et.al. 2408.05780 link
2024-08-11 FADE: A Dataset for Detecting Falling Objects around Buildings in Video Zhigang Tu et.al. 2408.05750 null
2024-08-09 DeepInteraction++: Multi-Modality Interaction for Autonomous Driving Zeyu Yang et.al. 2408.05075 link
2024-08-09 RadarPillars: Efficient Object Detection from 4D Radar Point Clouds Alexander Musiat et.al. 2408.05020 null
2024-08-09 Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation Yifan Feng et.al. 2408.04804 link
2024-08-08 SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes Boshra Khalili et.al. 2408.04786 null
2024-08-08 Data-Driven Pixel Control: Challenges and Prospects Saurabh Farkya et.al. 2408.04767 null
2024-08-10 SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More Tianrun Chen et.al. 2408.04579 null
2024-08-07 Impact Analysis of Data Drift Towards The Development of Safety-Critical Automotive System Md Shahi Amran Hossain et.al. 2408.04476 null
2024-08-08 Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework Subhasis Dasgupta et.al. 2408.04360 null
2024-08-08 Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection Shixuan Gao et.al. 2408.04326 null
2024-08-08 LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection Mervat Abassy et.al. 2408.04284 null
2024-08-08 Learning to Rewrite: Generalized LLM-Generated Text Detection Wei Hao et.al. 2408.04237 null
2024-08-07 PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation Blessing Agyei Kyem et.al. 2408.04110 link
2024-08-07 Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection Christian Fruhwirth-Reisinger et.al. 2408.03790 null
2024-08-07 Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model Guoqing Zhu et.al. 2408.03748 link
2024-08-07 CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Tianfang Zhang et.al. 2408.03703 link
2024-08-07 L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection Xun Huang et.al. 2408.03677 null
2024-08-07 Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks Jaewook Lee et.al. 2408.03663 null
2024-08-07 Leveraging LLMs for Enhanced Open-Vocabulary 3D Scene Understanding in Autonomous Driving Amirhosein Chahe et.al. 2408.03516 null
2024-08-07 GUI Element Detection Using SOTA YOLO Deep Learning Models Seyed Shayan Daneshvar et.al. 2408.03507 null
2024-08-06 AI Foundation Models in Remote Sensing: A Survey Siqi Lu et.al. 2408.03464 null
2024-08-06 Biomedical Image Segmentation: A Systematic Literature Review of Deep Learning Based Object Detection Methods Fazli Wahid et.al. 2408.03393 null
2024-08-06 Nighttime Pedestrian Detection Based on Fore-Background Contrast Learning He Yao et.al. 2408.03030 null
2024-08-06 Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection Sen Nie et.al. 2408.02891 null
2024-08-05 HQOD: Harmonious Quantization for Object Detection Long Huang et.al. 2408.02561 null
2024-08-05 Tensorial template matching for fast cross-correlation with rotations and its application for tomography Antonio Martinez-Sanchez et.al. 2408.02398 null
2024-08-05 Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization Changtao Miao et.al. 2408.02306 null
2024-08-05 AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines Renjith Prasad et.al. 2408.02181 null
2024-08-04 KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving Zhihao Lai et.al. 2408.02088 null
2024-08-06 A Survey and Evaluation of Adversarial Attacks for Object Detection Khoi Nguyen Tiet Nguyen et.al. 2408.01934 null
2024-08-04 CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery Zilin Chen et.al. 2408.01897 null
2024-08-03 Supervised Image Translation from Visible to Infrared Domain for Object Detection Prahlad Anand et.al. 2408.01843 null
2024-08-03 Domain penalisation for improved Out-of-Distribution Generalisation Shuvam Jena et.al. 2408.01746 null
2024-08-03 LAM3D: Leveraging Attention for Monocular 3D Object Detection Diana-Alexandra Sas et.al. 2408.01739 null
2024-08-02 A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes Vito Mengers et.al. 2408.01322 null
2024-08-02 Underwater Object Detection Enhancement via Channel Stabilization Muhammad Ali et.al. 2408.01293 null
2024-08-02 PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network Changqun Xia et.al. 2408.01137 null
2024-08-02 Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions Ajinkya Shinde et.al. 2408.01085 null
2024-08-02 Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model Yang Jin et.al. 2408.01044 null
2024-08-02 MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection Xiangbo Gao et.al. 2408.01037 null
2024-08-02 Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach Yabin Zhu et.al. 2408.00969 null
2024-08-01 Joint Neural Networks for One-shot Object Recognition and Detection Camilo J. Vargas et.al. 2408.00701 null
2024-08-01 Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection Ruiyang Zhang et.al. 2408.00619 null
2024-08-01 U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight Tongtong Feng et.al. 2408.00606 null
2024-08-01 MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection Xiangyuan Peng et.al. 2408.00565 null
2024-08-01 Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval Gangyan Zeng et.al. 2408.00441 null
2024-08-01 MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection Youjia Fu et.al. 2408.00438 null
2024-08-01 DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training Yu Xie et.al. 2408.00355 null
2024-08-01 A Simple Background Augmentation Method for Object Detection with Diffusion Model Yuhang Li et.al. 2408.00350 null
2024-08-01 Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection Jiacheng Deng et.al. 2408.00286 null
2024-08-01 RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment Zhe Huang et.al. 2408.00257 null
2024-07-31 Dynamic Object Queries for Transformer-based Incremental Object Detection Jichuan Zhang et.al. 2407.21687 null
2024-07-31 Spatial Transformer Network YOLO Model for Agricultural Object Detection Yash Zambre et.al. 2407.21652 null
2024-07-31 Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2 Lv Tang et.al. 2407.21596 null
2024-07-31 InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios Xiaofei Zhang et.al. 2407.21581 null
2024-07-31 Voxel Scene Graph for Intracranial Hemorrhage Antoine P. Sanner et.al. 2407.21580 null
2024-07-31 MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection Kuo Wang et.al. 2407.21465 null
2024-07-31 Generalized Tampered Scene Text Detection in the era of Generative AI Chenfan Qu et.al. 2407.21422 null
2024-07-30 Candidate Distant Trans-Neptunian Objects Detected by the New Horizons Subaru TNO Survey Wesley C. Fraser et.al. 2407.21142 null
2024-07-30 What is YOLOv5: A deep look into the internal features of the popular object detector Rahima Khanam et.al. 2407.20892 null
2024-07-30 WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection Xingcheng Zhou et.al. 2407.20818 null
2024-07-31 Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection Xinhao Luo et.al. 2407.20708 link
2024-07-29 Uncertainty-Rectified YOLO-SAM for Weakly Supervised ICH Segmentation Pascal Spiegler et.al. 2407.20461 null
2024-07-29 MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset Zaid A. El Shair et.al. 2407.20446 null
2024-07-30 AxiomVision: Accuracy-Guaranteed Adaptive Visual Model Selection for Perspective-Aware Video Analytics Xiangxiang Dai et.al. 2407.20124 link
2024-07-29 Octave-YOLO: Cross frequency detection network with octave convolution Sangjune Shin et.al. 2407.19746 null
2024-07-29 Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images Zewen Du et.al. 2407.19696 null
2024-07-29 Practical Video Object Detection via Feature Selection and Aggregation Yuheng Shi et.al. 2407.19650 link
2024-07-28 Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data Azmyin Md. Kamal et.al. 2407.19518 link
2024-07-28 Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets Tianxiao Zhang et.al. 2407.19394 link
2024-07-27 Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network Gang Pan et.al. 2407.19271 null
2024-07-27 Enhancing Tree Type Detection in Forest Fire Risk Assessment: Multi-Stage Approach and Color Encoding with Forest Fire Risk Evaluation Framework for UAV Imagery Jinda Zhang et.al. 2407.19184 null
2024-07-27 Reducing Spurious Correlation for Federated Domain Generalization Shuran Ma et.al. 2407.19174 null
2024-07-27 Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble Juhan Cha et.al. 2407.19156 link
2024-07-26 Local Binary Pattern(LBP) Optimization for Feature Extraction Zeinab Sedaghatjoo et.al. 2407.18665 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-25 XS-VID: An Extremely Small Video Object Detection Dataset Jiahao Guo et.al. 2407.18137 null
2024-07-25 SaccadeDet: A Novel Dual-Stage Architecture for Rapid and Accurate Detection in Gigapixel Images Wenxi Li et.al. 2407.17956 null
2024-07-25 A Novel Perception Entropy Metric for Optimizing Vehicle Perception with LiDAR Deployment Yongjiang He et.al. 2407.17942 null
2024-07-25 Hierarchical Object Detection and Recognition Framework for Practical Plant Disease Diagnosis Kohei Iwano et.al. 2407.17906 null
2024-07-25 Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey Shahab Saquib Sohail et.al. 2407.17877 null
2024-07-25 Enhancing Fine-grained Object Detection in Aerial Images via Orthogonal Mapping Haoran Zhu et.al. 2407.17738 link
2024-07-26 Unsqueeze [CLS] Bottleneck to Learn Rich Representations Qing Su et.al. 2407.17671 link
2024-07-24 SDLNet: Statistical Deep Learning Network for Co-Occurring Object Detection and Identification Binay Kumar Singh et.al. 2407.17664 null
2024-07-24 PEEKABOO: Hiding parts of an image for unsupervised object localization Hasib Zunair et.al. 2407.17628 link
2024-07-24 ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only Saad Lahlali et.al. 2407.17197 null
2024-07-24 DVPE: Divided View Position Embedding for Multi-View 3D Object Detection Jiasen Wang et.al. 2407.16955 link
2024-07-23 What Matters in Range View 3D Object Detection Benjamin Wilson et.al. 2407.16789 link
2024-07-23 A Framework for Pupil Tracking with Event Cameras Khadija Iddrisu et.al. 2407.16665 null
2024-07-24 Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles Seamie Hayes et.al. 2407.16636 null
2024-07-23 COALA: A Practical and Vision-Centric Federated Learning Platform Weiming Zhuang et.al. 2407.16560 link
2024-07-23 Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection Trinh Le Ba Khanh et.al. 2407.16497 link
2024-07-23 MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection Youngmin Oh et.al. 2407.16448 link
2024-07-23 ESOD: Efficient Small Object Detection on High-Resolution Images Kai Liu et.al. 2407.16424 null
2024-07-23 Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection Youqian Zhang et.al. 2407.16327 null
2024-07-23 DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions Aditya Kapoor et.al. 2407.16302 null
2024-07-23 FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network Weiying Xie et.al. 2407.16129 link
2024-07-22 PLayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips Håkon Maric Solberg et.al. 2407.16076 null
2024-07-22 Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video Guiqiu Liao et.al. 2407.15794 null
2024-07-22 Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis Brian K. S. Isaac-Medina et.al. 2407.15763 null
2024-07-22 Counter Turing Test ( $CT^2$): Investigating AI-Generated Text Detection for Hindi -- Ranking LLMs based on Hindi AI Detectability Index ($ADI_{hi}$ ) Ishan Kavathekar et.al. 2407.15694 null
2024-07-22 YOLOv10 for Automated Fracture Detection in Pediatric Wrist Trauma X-rays Ammar Ahmed et.al. 2407.15689 link
2024-07-22 SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection Daniel Jakab et.al. 2407.15646 null
2024-07-22 YOLO-pdd: A Novel Multi-scale PCB Defect Detection Method Using Deep Representations with Sequential Images Bowen Liu et.al. 2407.15427 null
2024-07-22 Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection Zhili Chen et.al. 2407.15354 null
2024-07-22 Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection Yiran Yang et.al. 2407.15334 null
2024-07-21 Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection Kwanyong Park et.al. 2407.15296 null
2024-07-21 Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis Jingwei Guo et.al. 2407.15199 null
2024-07-19 Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation Dongyang Wu et.al. 2407.14498 null
2024-07-19 MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images Majedaldein Almahasneh et.al. 2407.14473 null
2024-07-19 EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition Youssef Doulfoukar et.al. 2407.14314 null
2024-07-19 Bucketed Ranking-based Losses for Efficient Training of Object Detectors Feyza Yavuz et.al. 2407.14204 link
2024-07-19 Visual Text Generation in the Wild Yuanzhi Zhu et.al. 2407.14138 link
2024-07-18 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model Abdelrahman Shaker et.al. 2407.13772 link
2024-07-18 General Geometry-aware Weakly Supervised 3D Object Detection Guowen Zhang et.al. 2407.13748 link
2024-07-18 Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation Ilhoon Yoon et.al. 2407.13524 link
2024-07-18 The use of the symmetric finite difference in the local binary pattern (symmetric LBP) Zeinab Sedaghatjoo et.al. 2407.13178 null
2024-07-18 Learning Camouflaged Object Detection from Noisy Pseudo Label Jin Zhang et.al. 2407.13157 null
2024-07-18 DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection Zhourui Zhang et.al. 2407.13147 null
2024-07-18 FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection Jianwei Zhao et.al. 2407.13133 null
2024-07-17 AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer Zhuguanyu Wu et.al. 2407.12951 link
2024-07-17 Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients Dohyung Kim et.al. 2407.12637 null
2024-07-17 CerberusDet: Unified Multi-Task Object Detection Irina Tolstykh et.al. 2407.12632 link
2024-07-17 Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation Prantik Howlader et.al. 2407.12630 link
2024-07-17 Enhancing Wrist Abnormality Detection with YOLO: Analysis of State-of-the-art Single-stage Detection Models Ammar Ahmed et.al. 2407.12597 link
2024-07-17 Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection Hu Cao et.al. 2407.12582 null
2024-07-17 Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation Kaixin Bai et.al. 2407.12449 null
2024-07-17 GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval Han Zhou et.al. 2407.12431 link
2024-07-17 Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection Zhenni Yu et.al. 2407.12339 null
2024-07-16 AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs Yunling Zheng et.al. 2407.12217 null
2024-07-16 The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities Natalia Konovalova et.al. 2407.12184 null
2024-07-16 A Case for Application-Aware Space Radiation Tolerance in Orbital Computing Meiqi Wang et.al. 2407.11853 null
2024-07-16 Improving Unsupervised Video Object Segmentation via Fake Flow Generation Suhwan Cho et.al. 2407.11714 link
2024-07-16 Relation DETR: Exploring Explicit Position Relation Prior for Object Detection Xiuquan Hou et.al. 2407.11699 link
2024-07-16 Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection Qijie Mo et.al. 2407.11499 null
2024-07-16 Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes Zhi Cai et.al. 2407.11464 link
2024-07-16 Generative AI Driven Task-Oriented Adaptive Semantic Communications Yuzhou Fu et.al. 2407.11354 null
2024-07-16 LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction Penghui Du et.al. 2407.11335 null
2024-07-16 TCFormer: Visual Recognition via Token Clustering Transformer Wang Zeng et.al. 2407.11321 link
2024-07-16 PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer Pierre-David Letourneau et.al. 2407.11306 null
2024-07-15 OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models Zijian Zhou et.al. 2407.11213 null
2024-07-15 Interpreting Hand gestures using Object Detection and Digits Classification Sangeetha K et.al. 2407.10902 null
2024-07-15 RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception Chunliang Li et.al. 2407.10876 link
2024-07-15 OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection Jinghua Hou et.al. 2407.10753 null
2024-07-15 Anticipating Future Object Compositions without Forgetting Youssef Zahran et.al. 2407.10723 null
2024-07-15 OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer Yu Wang et.al. 2407.10655 link
2024-07-15 Backdoor Attacks against Image-to-Image Networks Wenbo Jiang et.al. 2407.10445 null
2024-07-14 Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data Tuo Feng et.al. 2407.10200 link
2024-07-14 LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection Sanmin Kim et.al. 2407.10164 link
2024-07-14 FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection Zheng Jiang et.al. 2407.10135 null
2024-07-14 When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset Yi Zhang et.al. 2407.10125 null
2024-07-12 DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training Chen Xin et.al. 2407.09174 link
2024-07-12 Open Vocabulary Multi-Label Video Classification Rohit Gupta et.al. 2407.09073 null
2024-07-12 DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects Peng Wang et.al. 2407.09051 null
2024-07-12 Task-driven single-image super-resolution reconstruction of document scans Maciej Zyrek et.al. 2407.08993 null
2024-07-11 OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects Akshay Krishnan et.al. 2407.08711 null
2024-07-11 Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene Ruiyang Zhang et.al. 2407.08569 link
2024-07-11 Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation Zeyang Zhao et.al. 2407.08489 null
2024-07-11 Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer Tahira Shehzadi et.al. 2407.08460 null
2024-07-11 PowerYOLO: Mixed Precision Model for Hardware Efficient Object Detection with Event Data Dominika Przewlocka-Rus et.al. 2407.08272 null
2024-07-11 Knowledge distillation to effectively attain both region-of-interest and global semantics from an image where multiple objects appear Seonwhee Jin et.al. 2407.08257 link
2024-07-11 Enrich the content of the image Using Context-Aware Copy Paste Qiushi Guo et.al. 2407.08151 null
2024-07-11 DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing Minghang Zhou et.al. 2407.08132 null
2024-07-10 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Ali Hatamizadeh et.al. 2407.08083 link
2024-07-10 Bayesian Detector Combination for Object Detection with Crowdsourced Annotations Zhi Qin Tan et.al. 2407.07958 link
2024-07-10 Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher Jiangming Chen et.al. 2407.07780 null
2024-07-10 LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving Jörg Gamerdinger et.al. 2407.07740 null
2024-07-10 Few-Shot Domain Adaptive Object Detection for Microscopic Images Sumayya Inayat et.al. 2407.07633 null
2024-07-10 Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights Yan Hao et.al. 2407.07586 link
2024-07-09 Exploring Camera Encoder Designs for Autonomous Driving Perception Barath Lakshmanan et.al. 2407.07276 null
2024-07-09 ConvNLP: Image-based AI Text Detection Suriya Prakash Jambunathan et.al. 2407.07225 null
2024-07-09 Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images Chuanrui Zhang et.al. 2407.06984 null
2024-07-09 Cue Point Estimation using Object Detection Giulia Argüello et.al. 2407.06823 link
2024-07-09 CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection Shuang Hao et.al. 2407.06780 link
2024-07-09 Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions Yu-Guan Hsieh et.al. 2407.06723 null
2024-07-08 Stochastic Traveling Salesperson Problem with Neighborhoods for Object Detection Cheng Peng et.al. 2407.06366 null
2024-07-08 GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images Jon Crall et.al. 2407.06337 null
2024-07-08 Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection Chenxu Wang et.al. 2407.05909 link
2024-07-08 Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework Hao Jing et.al. 2407.05769 null
2024-07-08 Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge Hyunjin Cho et.al. 2407.05713 link
2024-07-08 Weakly Supervised Test-Time Domain Adaptation for Object Detection Anh-Dzung Doan et.al. 2407.05607 null
2024-07-08 Towards Reflected Object Detection: A Benchmark Zhongtian Wang et.al. 2407.05575 null
2024-07-08 GMC: A General Framework of Multi-stage Context Learning and Utilization for Visual Detection Tasks Xuan Wang et.al. 2407.05566 null
2024-07-07 CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs Akshat Ramachandran et.al. 2407.05266 link
2024-07-07 Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image Pengkun Jiao et.al. 2407.05256 null
2024-07-06 SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention Yunzhong Si et.al. 2407.05128 null
2024-07-06 Quantizing YOLOv7: A Comprehensive Study Mohammadamin Baghbanbashi et.al. 2407.04943 null
2024-07-05 SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry Hafiz Mughees Ahmad et.al. 2407.04590 link
2024-07-05 Optimizing the image correction pipeline for pedestrian detection in the thermal-infrared domain Christophe Karam et.al. 2407.04484 null
2024-07-05 Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection Zhiqiang Yang et.al. 2407.04381 link
2024-07-05 Towards Stable 3D Object Detection Jiabao Wang et.al. 2407.04305 null
2024-07-05 Research, Applications and Prospects of Event-Based Pedestrian Detection: A Survey Han Wang et.al. 2407.04277 null
2024-07-04 LiDAR-based Real-Time Object Detection and Tracking in Dynamic Environments Wenqiang Du et.al. 2407.04115 null
2024-07-04 FIPGNet:Pyramid grafting network with feature interaction strategies Ziyi Ding et.al. 2407.04085 null
2024-07-04 Detect Closer Surfaces that can be Seen: New Modeling and Evaluation in Cross-domain 3D Object Detection Ruixiao Zhang et.al. 2407.04061 null
2024-07-04 The Solution for the GAIIC2024 RGB-TIR object detection Challenge Xiangyu Wu et.al. 2407.03872 null
2024-07-04 StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection Yunshuang Yuan et.al. 2407.03825 null
2024-07-03 Visual Grounding with Attention-Driven Constraint Balancing Weitai Kang et.al. 2407.03243 null
2024-07-03 Category-Aware Dynamic Label Assignment with High-Quality Oriented Proposal Mingkui Feng et.al. 2407.03205 null
2024-07-03 SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding Weitai Kang et.al. 2407.03200 link
2024-07-03 Global Context Modeling in YOLOv8 for Pediatric Wrist Fracture Detection Rui-Yang Ju et.al. 2407.03163 link
2024-07-03 YOLOv5, YOLOv8 and YOLOv10: The Go-To Detectors for Real-time Vision Muhammad Hussain et.al. 2407.02988 null
2024-07-03 Mast Kalandar at SemEval-2024 Task 8: On the Trail of Textual Origins: RoBERTa-BiLSTM Approach to Detect AI-Generated Text Jainit Sushil Bafna et.al. 2407.02978 null
2024-07-03 A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection Jie Shao et.al. 2407.02835 null
2024-07-03 ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers Yanfeng Jiang et.al. 2407.02763 null
2024-07-02 SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection Anay Majee et.al. 2407.02665 null
2024-07-02 Robust ADAS: Enhancing Robustness of Machine Learning-based Advanced Driver Assistance Systems for Adverse Weather Muhammad Zaeem Shahzad et.al. 2407.02581 null
2024-07-02 Similarity Distance-Based Label Assignment for Tiny Object Detection Shuohao Shi et.al. 2407.02394 link
2024-07-02 OpenSlot: Mixed Open-set Recognition with Object-centric Learning Xu Yin et.al. 2407.02386 null
2024-07-02 DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection Kaixin Xu et.al. 2407.02098 null
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-07-02 Adaptive Modality Balanced Online Knowledge Distillation for Brain-Eye-Computer based Dim Object Detection Zixing Li et.al. 2407.01894 link
2024-07-01 Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision Balaji VS et.al. 2407.01435 null
2024-07-01 Formal Verification of Object Detection Avraham Raviv et.al. 2407.01295 null
2024-07-01 Cross-Architecture Auxiliary Feature Space Translation for Efficient Few-Shot Personalized Object Detection Francesco Barbato et.al. 2407.01193 null
2024-07-01 Eliminating Position Bias of Language Models: A Mechanistic Approach Ziqi Wang et.al. 2407.01100 null
2024-07-01 No More Potentially Dynamic Objects: Static Point Cloud Map Generation based on 3D Object Detection and Ground Projection Soojin Woo et.al. 2407.01073 null
2024-06-28 Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood Yang Xu et.al. 2406.19874 link
2024-07-01 Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding Yifan Tang et.al. 2406.19791 null
2024-06-28 Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking Qingrui Hu et.al. 2406.19655 null
2024-06-27 Robustness Testing of Black-Box Models Against CT Degradation Through Test-Time Augmentation Jack Highton et.al. 2406.19557 null
2024-06-27 BOrg: A Brain Organoid-Based Mitosis Dataset for Automatic Analysis of Brain Diseases Muhammad Awais et.al. 2406.19556 link
2024-06-27 Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results Jialin Yue et.al. 2406.19540 null
2024-06-27 Stereo Vision Based Robot for Remote Monitoring with VR Support Mohamed Fazil M. S. et.al. 2406.19498 null
2024-06-27 HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection Liujuan Cao et.al. 2406.19394 link
2024-06-27 STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning Yanan Zhang et.al. 2406.19362 null
2024-06-27 Towards Reducing Data Acquisition and Labeling for Defect Detection using Simulated Data Lukas Malte Kemeter et.al. 2406.19175 null
2024-06-27 FDLite: A Single Stage Lightweight Face Detector Network Yogesh Aggarwal et.al. 2406.19107 null
2024-06-27 Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO Fuseini Mumuni et.al. 2406.19057 null
2024-06-27 BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection Yang Song et.al. 2406.19048 null
2024-06-27 A Universal Railway Obstacle Detection System based on Semi-supervised Segmentation And Optical Flow Qiushi Guo et.al. 2406.18908 null
2024-06-26 SpY: A Context-Based Approach to Spacecraft Component Detection Trupti Mahendrakar et.al. 2406.18709 null
2024-06-26 Unveiling the Unknown: Conditional Evidence Decoupling for Unknown Rejection Zhaowei Wu et.al. 2406.18443 link
2024-06-26 Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated Jiazhou Ji et.al. 2406.18259 null
2024-06-26 CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection Meiying Zhang et.al. 2406.18129 null
2024-06-26 The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval Meinardus Boris et.al. 2406.18113 link
2024-06-25 Unmasking the Imposters: In-Domain Detection of Human vs. Machine-Generated Tweets Bryan E. Tuck et.al. 2406.17967 null
2024-06-25 ET tu, CLIP? Addressing Common Object Errors for Unseen Environments Ye Won Byun et.al. 2406.17876 null
2024-06-25 MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection Michelle Adeline et.al. 2406.17654 link
2024-06-25 Embedded event based object detection with spiking neural network Jonathan Courtois et.al. 2406.17617 null
2024-06-27 Towards Open-set Camera 3D Object Detection Zhuolin He et.al. 2406.17297 null
2024-06-25 Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments Shilei Cao et.al. 2406.16439 null
2024-06-24 Artistic-style text detector and a new Movie-Poster dataset Aoxiang Ning et.al. 2406.16307 null
2024-06-24 Investigating the Influence of Prompt-Specific Shortcuts in AI Generated Text Detection Choonghyun Park et.al. 2406.16275 null
2024-06-23 Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain Maged Badawi et.al. 2406.16143 null
2024-06-22 Understanding Student and Academic Staff Perceptions of AI Use in Assessment and Feedback Jasper Roe et.al. 2406.15808 null
2024-06-22 Smart Feature is What You Need Zhaoxin Hu et.al. 2406.15805 link
2024-06-22 MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception Guanqun Wang et.al. 2406.15768 null
2024-06-21 Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection Lynn Vonderhaar et.al. 2406.15268 null
2024-06-21 DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection Jia Syuen Lim et.al. 2406.14924 null
2024-06-21 MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection Zhuoxiao Chen et.al. 2406.14878 null
2024-06-20 Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines Xinyi Ying et.al. 2406.14482 link
2024-06-20 Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification Muhammad Saif Ullah Khan et.al. 2406.14370 link
2024-06-20 HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting? Ivan Karpukhin et.al. 2406.14341 link
2024-06-20 LeYOLO, New Scalable and Efficient CNN Architecture for Object Detection Lilian Hollard et.al. 2406.14239 link
2024-06-20 SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease Diagnosis Zijian Cai et.al. 2406.13963 link
2024-06-20 Towards the in-situ Trunk Identification and Length Measurement of Sea Cucumbers via Bézier Curve Modelling Shuaixin Liu et.al. 2406.13951 link
2024-06-19 DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection Zhuoxiao Chen et.al. 2406.13891 link
2024-06-19 Semantic Enhanced Few-shot Object Detection Zheng Wang et.al. 2406.13498 null
2024-06-19 Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions Shivank Garg et.al. 2406.13473 link
2024-06-19 Strengthening Layer Interaction via Dynamic Layer Attention Kaishen Wang et.al. 2406.13392 link
2024-06-18 Privacy Preserving Federated Learning in Medical Imaging with Uncertainty Estimation Nikolas Koutsoubis et.al. 2406.12815 link
2024-06-18 Online Anchor-based Training for Image Classification Tasks Maria Tzelepi et.al. 2406.12662 null
2024-06-18 Applying Ensemble Methods to Model-Agnostic Machine-Generated Text Detection Ivan Ong et.al. 2406.12570 null
2024-06-18 MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts Dominik Macko et.al. 2406.12549 null
2024-06-18 ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection Junhao Lin et.al. 2406.12536 link
2024-06-18 SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions Yuexiong Ding et.al. 2406.12395 null
2024-06-18 Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines Honglei Zhang et.al. 2406.12367 null
2024-06-18 Certified ML Object Detection for Surveillance Missions Mohammed Belcaid et.al. 2406.12362 null
2024-06-18 DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection Haodong Li et.al. 2406.12285 null
2024-06-18 The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge Hongpeng Pan et.al. 2406.12225 null
2024-06-17 V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results Jiaqi Wang et.al. 2406.11739 null
2024-06-17 YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection Tamara R. Lenhard et.al. 2406.11641 null
2024-06-17 Low-power Ship Detection in Satellite Images Using Neuromorphic Hardware Gregor Lenz et.al. 2406.11319 null
2024-06-17 Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object Detection Yecheol Kim et.al. 2406.11313 link
2024-06-17 Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection Yunsong Wang et.al. 2406.11311 null
2024-06-17 Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Yunsong Wang et.al. 2406.11283 null
2024-06-17 YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism Sompote Youwai et.al. 2406.11254 link
2024-06-16 GANmut: Generating and Modifying Facial Expressions Maria Surani et.al. 2406.11079 null
2024-06-16 Exploring the Limitations of Detecting Machine-Generated Text Jad Doughman et.al. 2406.11073 null
2024-06-16 Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP Shuyang Lin et.al. 2406.10961 null
2024-06-14 EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models Julian Straub et.al. 2406.10224 null
2024-06-14 YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain Mujadded Al Rabbani Alif et.al. 2406.10139 null
2024-06-14 Shelf-Supervised Multi-Modal Pre-Training for 3D Object Detection Mehar Khurana et.al. 2406.10115 null
2024-06-14 Automated GIS-Based Framework for Detecting Crosswalk Changes from Bi-Temporal High-Resolution Aerial Images Richard Boadu Antwi et.al. 2406.09731 null
2024-06-14 An alternate approach for estimating grain-growth kinetics Manoj Prabakar et.al. 2406.09653 null
2024-06-13 Scene Graph Generation in Large-Size VHR Satellite Imagery: A Large-Scale Dataset and A Context-Aware Approach Yansheng Li et.al. 2406.09410 link
2024-06-13 Towards Evaluating the Robustness of Visual State Space Models Hashmat Shadab Malik et.al. 2406.09407 link
2024-06-13 Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models Yushi Hu et.al. 2406.09403 null
2024-06-13 Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024 Peixi Wu et.al. 2406.09201 null
2024-06-13 Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors Ying Zhou et.al. 2406.08922 link
2024-06-13 Computer vision-based model for detecting turning lane features on Florida's public roadways Richard Boadu Antwi et.al. 2406.08822 null
2024-06-13 BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection Wenjie Wang et.al. 2406.08785 null
2024-06-12 UnO: Unsupervised Occupancy Fields for Perception and Forecasting Ben Agro et.al. 2406.08691 null
2024-06-12 Transformation-Dependent Adversarial Attacks Yaoteng Tan et.al. 2406.08443 null
2024-06-12 Dataset Enhancement with Instance-Level Augmentations Orest Kupyn et.al. 2406.08249 link
2024-06-12 Chemistry3D: Robotic Interaction Benchmark for Chemistry Experiments Shoujie Li et.al. 2406.08160 null
2024-06-12 CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer Hualian Sheng et.al. 2406.08152 null
2024-06-12 MWIRSTD: A MWIR Small Target Detection Dataset Nikhil Kumar et.al. 2406.08063 link
2024-06-12 Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing Sina Tayebati et.al. 2406.07833 null
2024-06-11 A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7 Md. Shariful Islam et.al. 2406.07707 null
2024-06-11 Transforming a rare event search into a not-so-rare event search in real-time with deep learning-based object detection J. Schueler et.al. 2406.07538 null
2024-06-11 Understanding Visual Concepts Across Models Brandon Trabucco et.al. 2406.07506 link
2024-06-11 Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach Challapalli Phanindra Revanth et.al. 2406.07332 null
2024-06-11 Unsupervised Object Detection with Theoretical Guarantees Marian Longa et.al. 2406.07284 null
2024-06-11 Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation Jinyuan Li et.al. 2406.07268 null
2024-06-11 EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network Yining Shi et.al. 2406.07042 link
2024-06-11 RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks Zhechao Wang et.al. 2406.07032 null
2024-06-12 LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection Jiahua Xu et.al. 2406.07023 null
2024-06-11 Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection Junfei Yi et.al. 2406.06999 null
2024-06-10 UnSupDLA: Towards Unsupervised Document Layout Analysis Talha Uddin Sheikh et.al. 2406.06236 null
2024-06-10 UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection Fan Liu et.al. 2406.06230 link
2024-06-10 ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery Xian Sun et.al. 2406.06028 null
2024-06-10 Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024 Jinwoo Ahn et.al. 2406.05963 null
2024-06-10 Open-Vocabulary Part-Based Grasping Tjeard van Oort et.al. 2406.05951 null
2024-06-09 Stealthy Targeted Backdoor Attacks against Image Captioning Wenshu Fan et.al. 2406.05874 null
2024-06-09 Scaling Graph Convolutions for Mobile Vision William Avery et.al. 2406.05850 link
2024-06-09 Mamba YOLO: SSMs-Based YOLO For Object Detection Zeyu Wang et.al. 2406.05835 link
2024-06-09 ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving Chen Ma et.al. 2406.05810 null
2024-06-09 SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention Muhammad Nawfal Meeran et.al. 2406.05802 link
2024-06-07 Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment Venkanna Babu Guthula et.al. 2406.04949 null
2024-06-07 EGOR: Efficient Generated Objects Replay for incremental object detection Zijia An et.al. 2406.04829 null
2024-06-07 UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping Pengju Tian et.al. 2406.04648 null
2024-06-07 UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection Yuchao Wang et.al. 2406.04647 null
2024-06-06 CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset Abdelrahman Abdallah et.al. 2406.04493 link
2024-06-06 DeTra: A Unified Model for Object Detection and Trajectory Forecasting Sergio Casas et.al. 2406.04426 null
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330 link
2024-06-06 LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification Xin Cai et.al. 2406.04129 null
2024-06-06 Semmeldetector: Application of Machine Learning in Commercial Bakeries Thomas H. Schmitt et.al. 2406.04050 null
2024-06-06 Frequency-based Matcher for Long-tailed Semantic Segmentation Shan Li et.al. 2406.03917 link
2024-06-06 Instance Segmentation and Teeth Classification in Panoramic X-rays Devichand Budagam et.al. 2406.03747 link
2024-06-05 FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles Cyprien Quéméneur et.al. 2406.03611 link
2024-06-05 LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection Qiang Chen et.al. 2406.03459 link
2024-06-05 Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models Qutub Syed Sha et.al. 2406.03229 null
2024-06-05 Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection Qutub Syed et.al. 2406.03188 null
2024-06-05 Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework Eliraz Orfaig et.al. 2406.03129 null
2024-06-04 Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation Mohamed El Amine Boudjoghra et.al. 2406.02548 link
2024-06-04 SatSplatYOLO: 3D Gaussian Splatting-based Virtual Object Detection Ensembles for Satellite Feature Recognition Van Minh Nguyen et.al. 2406.02533 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-04 Low-Rank Adaption on Transformer-based Oriented Object Detector for Satellite Onboard Processing of Remote Sensing Images Xinyang Pu et.al. 2406.02385 link
2024-06-04 Radar Spectra-Language Model for Automotive Scene Parsing Mariia Pushkareva et.al. 2406.02158 null
2024-06-04 Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning Heather Doig et.al. 2406.01932 null
2024-06-04 GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer Ding Jia et.al. 2406.01210 link
2024-06-03 Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection Kunpeng Wang et.al. 2406.01127 link
2024-06-03 Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline Jan Lippemeier et.al. 2406.01071 null
2024-06-03 Multi-Object Tracking based on Imaging Radar 3D Object Detection Patrick Palmer et.al. 2406.01011 null
2024-05-31 Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection Jin-Hee Lee et.al. 2405.20720 link
2024-05-30 On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines Selim Kuzucu et.al. 2405.20459 null
2024-05-30 RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection Fangyi Chen et.al. 2405.19854 null
2024-05-30 Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology Frank A. Ruis et.al. 2405.19822 null
2024-05-30 Towards Unified Multi-granularity Text Detection with Interactive Attention Xingyu Wan et.al. 2405.19765 null
2024-05-30 Fully Test-Time Adaptation for Monocular 3D Object Detection Hongbin Lin et.al. 2405.19682 null
2024-05-30 YotoR-You Only Transform One Representation José Ignacio Díaz Villa et.al. 2405.19629 null
2024-05-29 Enabling Visual Recognition at Radio Frequency Haowen Lai et.al. 2405.19516 null
2024-05-29 Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles Saurabh Pathak et.al. 2405.19179 null
2024-05-29 RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision Jinzhong Wang et.al. 2405.18955 null
2024-05-29 SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving Yiming Cui et.al. 2405.18857 null
2024-05-29 PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram Sifan Zhou et.al. 2405.18734 null
2024-05-28 A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection during the COVID-19 Pandemic Ioanna Gogou et.al. 2405.18387 link
2024-05-28 Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? Yifan Bai et.al. 2405.18361 null
2024-05-28 Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention Weitai Kang et.al. 2405.18295 null
2024-05-28 DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture Shentong Mo et.al. 2405.17995 null
2024-05-28 Transformer and Hybrid Deep Learning Based Models for Machine-Generated Text Detection Teodor-George Marchitan et.al. 2405.17964 null
2024-05-28 Self-supervised Pre-training for Transferable Multi-modal Perception Xiaohao Xu et.al. 2405.17942 null
2024-05-28 Boosting General Trimap-free Matting in the Real-World Image Leo Shan Wenzhang Zhou Grace Zhao et.al. 2405.17916 null
2024-05-28 The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention Xingyu Ding et.al. 2405.17776 null
2024-05-27 Understanding differences in applying DETR to natural and medical images Yanqi Xu et.al. 2405.17677 null
2024-05-27 Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection Shuai Zeng et.al. 2405.17422 link
2024-05-27 Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association Tingwei Liu et.al. 2405.17323 null
2024-05-27 Enhanced Automotive Radar Collaborative Sensing By Exploiting Constructive Interference Lifan Xu et.al. 2405.17297 null
2024-05-27 SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving Avinash Nittur Ramesh et.al. 2405.17030 null
2024-05-27 Collective Perception Datasets for Autonomous Driving: A Comprehensive Review Sven Teufel et.al. 2405.16973 null
2024-05-27 OED: Towards One-stage End-to-End Dynamic Scene Graph Generation Guan Wang et.al. 2405.16925 link
2024-05-27 ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection Ziying Song et.al. 2405.16873 null
2024-05-27 A re-calibration method for object detection with multi-modal alignment bias in autonomous driving Zhihang Song et.al. 2405.16848 null
2024-05-26 A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing Yusaku Ando et.al. 2405.16580 null
2024-05-26 AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm Hao Wang et.al. 2405.16422 null
2024-05-24 UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes Ted Lentsch et.al. 2405.15688 null
2024-05-24 Multimodal Object Detection via Probabilistic a priori Information Integration Hafsa El Hafyani et.al. 2405.15596 null
2024-05-24 Scale-Invariant Feature Disentanglement via Adversarial Learning for UAV-based Object Detection Fan Liu et.al. 2405.15465 null
2024-05-24 Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets Hoàng-Ân Lê et.al. 2405.15394 null
2024-05-24 Towards Global Optimal Visual In-Context Learning Prompt Selection Chengming Xu et.al. 2405.15279 null
2024-05-24 Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection Yajing Liu et.al. 2405.15225 null
2024-05-24 ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models Jingyuan Zhu et.al. 2405.15199 null
2024-05-24 MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method Pan Liao et.al. 2405.15176 null
2024-05-23 Learning to Detect and Segment Mobile Objects from Unlabeled Videos Yihong Sun et.al. 2405.14841 null
2024-05-23 Designing A Sustainable Marine Debris Clean-up Framework without Human Labels Raymond Wang et.al. 2405.14815 null
2024-05-23 Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond Zhechao Wang et.al. 2405.14674 null
2024-05-23 Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment Muhammad Sohail Danish et.al. 2405.14497 null
2024-05-23 YOLOv10: Real-Time End-to-End Object Detection Ao Wang et.al. 2405.14458 link
2024-05-23 Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations Mohammed Baharoon et.al. 2405.14239 null
2024-05-22 Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation Mykhailo Uss et.al. 2405.14024 null
2024-05-22 TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System Diogo Lavado et.al. 2405.13989 null
2024-05-22 Class-Conditional self-reward mechanism for improved Text-to-Image models Safouane El Ghazouali et.al. 2405.13473 link
2024-05-22 Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing Jiarun Ding et.al. 2405.13403 null
2024-05-21 BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once Theodore Zhao et.al. 2405.12971 null
2024-05-21 AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection Zizhao Chen et.al. 2405.12944 link
2024-05-21 Predicting the Influence of Adverse Weather on Pedestrian Detection with Automotive Radar and Lidar Sensors Daniel Weihmayr et.al. 2405.12736 null
2024-05-21 Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text Yafu Li et.al. 2405.12689 null
2024-05-21 Automating Attendance Management in Human Resources: A Design Science Approach Using Computer Vision and Facial Recognition Bao-Thien Nguyen-Tat et.al. 2405.12633 null
2024-05-21 FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors Shuai Liu et.al. 2405.12601 link
2024-05-21 Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering Hiba Maryam et.al. 2405.12533 null
2024-05-21 Active Object Detection with Knowledge Aggregation and Distillation from Large Models Dejie Yang et.al. 2405.12509 null
2024-05-21 Mutual Information Analysis in Multimodal Learning Systems Hadi Hadizadeh et.al. 2405.12456 null
2024-05-20 Multi-View Attentive Contextualization for Multi-View 3D Object Detection Xianpeng Liu et.al. 2405.12200 null
2024-05-20 Bangladeshi Native Vehicle Detection in Wild Bipin Saha et.al. 2405.12150 link
2024-05-20 Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments Jooyong Park et.al. 2405.11855 null
2024-05-20 DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment Jianhong Han et.al. 2405.11765 link
2024-05-20 Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation Runou Yang et.al. 2405.11754 link
2024-05-19 FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention Ziang Guo et.al. 2405.11682 link
2024-05-19 SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization Jialong Guo et.al. 2405.11582 link
2024-05-19 The First Swahili Language Scene Text Detection and Recognition Dataset Fadila Wendigoundi Douamba et.al. 2405.11437 link
2024-05-18 InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images Wuzhou Li et.al. 2405.11293 null
2024-05-18 Visible and Clear: Finding Tiny Objects in Difference Map Bing Cao et.al. 2405.11276 null
2024-05-17 A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model Mingxiang Fu et.al. 2405.10890 null
2024-05-17 DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts Anastasia Voznyuk et.al. 2405.10629 link
2024-05-17 DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection Zhe Huang et.al. 2405.10577 null
2024-05-16 Drone-type-Set: Drone types detection benchmark for drone detection and tracking Kholoud AlDosari et.al. 2405.10398 null
2024-05-16 Grounded 3D-LLM with Referent Tokens Yilun Chen et.al. 2405.10370 null
2024-05-16 Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Tianhe Ren et.al. 2405.10300 link
2024-05-16 Towards Task-Compatible Compressible Representations Anderson de Andrade et.al. 2405.10244 link
2024-05-16 SpecDETR: A Transformer-based Hyperspectral Point Object Detection Network Zhaoxu Li et.al. 2405.10148 null
2024-05-16 SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection Mingxuan Liu et.al. 2405.10053 null
2024-05-16 FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection Siliang Ma et.al. 2405.09942 null
2024-05-16 Infrared Adversarial Car Stickers Xiaopei Zhu et.al. 2405.09924 null
2024-05-16 PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features Xusheng Li et.al. 2405.09828 null
2024-05-16 Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection Feiran Li et.al. 2405.09782 link
2024-05-15 Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation Guo Yachan et.al. 2405.09682 null
2024-05-15 Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels Guozhang Liu et.al. 2405.09024 null
2024-05-14 CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Pavan Kumar Anasosalu Vasu et.al. 2405.08911 null
2024-05-14 Open-Vocabulary Object Detection via Neighboring Region Attention Alignment Sunyuan Qiang et.al. 2405.08593 null
2024-05-14 Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method Mian Zou et.al. 2405.08487 null
2024-05-14 RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images Zong-Wei Hong et.al. 2405.08483 link
2024-05-14 Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events Xin Wu et.al. 2405.08251 link
2024-05-13 RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors Liam Dugan et.al. 2405.07940 null
2024-05-13 oTTC: Object Time-to-Contact for Motion Estimation in Autonomous Driving Abdul Hannan Khan et.al. 2405.07698 null
2024-05-13 MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders Xueying Jiang et.al. 2405.07696 null
2024-05-13 Quality-aware Selective Fusion Network for V-D-T Salient Object Detection Liuxin Bao et.al. 2405.07655 link
2024-05-13 Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying Thomas Pöllabauer et.al. 2405.07653 null
2024-05-13 Integrity Monitoring of 3D Object Detection in Automated Driving Systems using Raw Activation Patterns and Spatial Filtering Hakan Yekta Yatbaz et.al. 2405.07600 null
2024-05-13 Environmental Matching Attack Against Unmanned Aerial Vehicles Object Detection Dehong Kong et.al. 2405.07595 null
2024-05-13 Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis Tianci Bi et.al. 2405.07481 null
2024-05-13 Enhancing 3D Object Detection by Using Neural Network with Self-adaptive Thresholding Houze Liu et.al. 2405.07479 null
2024-05-12 MAML MOT: Multiple Object Tracking based on Meta-Learning Jiayi Chen et.al. 2405.07272 null
2024-05-10 How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models? Engin Uzun et.al. 2405.06383 null
2024-05-10 Precise Apple Detection and Localization in Orchards using YOLOv5 for Robotic Harvesting Systems Jiang Ziyue et.al. 2405.06260 null
2024-05-09 CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks Nick et.al. 2405.05755 null
2024-05-09 Depth Awakens: A Depth-perceptual Attention Fusion Network for RGB-D Camouflaged Object Detection Xinran Liua et.al. 2405.05614 null
2024-05-09 The object detection model uses combined extraction with KNN and RF classification Florentina Tatrin Kurniati et.al. 2405.05551 null
2024-05-08 Reviewing Intelligent Cinematography: AI research for camera-based video production Adrian Azzarelli et.al. 2405.05039 null
2024-05-07 A Novel Wide-Area Multiobject Detection System with High-Probability Region Searching Xianlei Long et.al. 2405.04589 null
2024-05-07 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving Chen Min et.al. 2405.04390 null
2024-05-07 A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields Raiyan Rahman et.al. 2405.04305 null
2024-05-07 ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers Jinke Li et.al. 2405.04299 null
2024-05-07 Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Junchao Wu et.al. 2405.04286 null
2024-05-07 Deep Event-based Object Detection in Autonomous Driving: A Survey Bingquan Zhou et.al. 2405.03995 null
2024-05-06 BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection Saket S. Chaturvedi et.al. 2405.03884 null
2024-05-06 RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection Thennarasi Balakrishnan et.al. 2405.03541 link
2024-05-06 Low-light Object Detection Pengpeng Li et.al. 2405.03519 null
2024-05-06 Salient Object Detection From Arbitrary Modalities Nianchang Huang et.al. 2405.03352 null
2024-05-06 Modality Prompts for Arbitrary Modality Salient Object Detection Nianchang Huang et.al. 2405.03351 null
2024-05-06 Vietnamese AI Generated Text Detection Quang-Dan Tran et.al. 2405.03206 null
2024-05-06 PTQ4SAM: Post-Training Quantization for Segment Anything Chengtao Lv et.al. 2405.03144 link
2024-05-05 Performance Evaluation of Real-Time Object Detection for Electric Scooters Dong Chen et.al. 2405.03039 link
2024-05-05 SalFAU-Net: Saliency Fusion Attention U-Net for Salient Object Detection Kassaw Abraham Mulat et.al. 2405.02906 null
2024-05-07 Adaptive Guidance Learning for Camouflaged Object Detection Zhennan Chen et.al. 2405.02824 null
2024-05-05 PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection Zhaoqi Leng et.al. 2405.02811 null
2024-05-02 Segmentation-Free Outcome Prediction in Head and Neck Cancer: Deep Learning-based Feature Extraction from Multi-Angle Maximum Intensity Projections (MA-MIPs) of PET Images Amirhosein Toosi et.al. 2405.01756 null
2024-05-02 PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems Walter Zimmer et.al. 2405.01750 null
2024-05-02 Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey Guoping Xu et.al. 2405.01725 link
2024-05-02 SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients Tushar Verma et.al. 2405.01699 null
2024-05-02 Imagine the Unseen: Occluded Pedestrian Detection via Adversarial Feature Completion Shanshan Zhang et.al. 2405.01311 null
2024-05-02 Overcoming LLM Challenges using RAG-Driven Precision in Coffee Leaf Disease Remediation Dr. Selva Kumar S et.al. 2405.01310 null
2024-05-02 Towards Consistent Object Detection via LiDAR-Camera Synergy Kai Luo et.al. 2405.01258 link
2024-05-02 Federated Learning with Heterogeneous Data Handling for Robust Vehicular Object Detection Ahmad Khalil et.al. 2405.01108 null
2024-05-01 Grains of Saliency: Optimizing Saliency-based Training of Biometric Attack Detection Models Colton R. Crum et.al. 2405.00650 null
2024-05-01 Object detection under the linear subspace model with application to cryo-EM images Amitay Eldar et.al. 2405.00364 null
2024-04-30 Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Yunhao Ge et.al. 2404.19752 null
2024-04-30 Quantifying Nematodes through Images: Datasets, Models, and Baselines of Deep Learning Zhipeng Yuan et.al. 2404.19748 null
2024-04-30 Masked Multi-Query Slot Attention for Unsupervised Object Discovery Rishav Pramanik et.al. 2404.19654 link
2024-04-30 Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World Wen Yin et.al. 2404.19417 null
2024-04-30 UniFS: Universal Few-shot Instance Perception with Point Representations Sheng Jin et.al. 2404.19401 null
2024-04-30 Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection Zhanwei Zhang et.al. 2404.19384 null
2024-04-30 Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank Sungjune Park et.al. 2404.19299 null
2024-04-29 MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection Heitor R. Medeiros et.al. 2404.18849 null
2024-04-29 Leveraging PointNet and PointNet++ for Lyft Point Cloud Classification Challenge Rajat K. Doshi et.al. 2404.18665 null
2024-04-29 CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception Yunshuang Yuan et.al. 2404.18617 null
2024-04-29 Assessing Quality Metrics for Neural Reality Gap Input Mitigation in Autonomous Driving Testing Stefano Carlo Lambertenghi et.al. 2404.18577 null
2024-04-29 Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images Wenbin Guan et.al. 2404.18426 null
2024-04-29 Multi-modal Perception Dataset of In-water Objects for Autonomous Surface Vehicles Mingi Jeong et.al. 2404.18411 null
2024-04-28 FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method Yanbing Bai et.al. 2404.18245 null
2024-04-28 RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation Oded Bialer et.al. 2404.18150 null
2024-04-27 Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection Farzad Nozarian et.al. 2404.17910 link
2024-04-27 A Hybrid Approach for Document Layout Analysis in Document images Tahira Shehzadi et.al. 2404.17888 null
2024-04-26 Inhomogeneous illuminated image enhancement under extremely low visibility condition Libang Chen et.al. 2404.17503 null
2024-04-26 Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection Moussa Kassem Sbeyti et.al. 2404.17427 null
2024-04-26 Enhancing mmWave Radar Point Cloud via Visual-inertial Supervision Cong Fan et.al. 2404.17229 null
2024-04-26 MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection Chengpei Xu et.al. 2404.17151 null
2024-04-25 Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach Cristopher McIntyre-Garcia et.al. 2404.17020 link
2024-04-25 Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection Mehmet Kerem Turkcan et.al. 2404.16944 link
2024-04-25 Self-Balanced R-CNN for Instance Segmentation Leonardo Rossi et.al. 2404.16633 link
2024-04-25 Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System Daniel Dworak et.al. 2404.16548 null
2024-04-25 Commonsense Prototype for Outdoor Unsupervised 3D Object Detection Hai Wu et.al. 2404.16493 link
2024-04-25 IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks Zitong Huang et.al. 2404.16331 null
2024-04-25 CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions Haoyuan Li et.al. 2404.16302 link
2024-04-24 AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models Zhiqiang Tang et.al. 2404.16233 null
2024-04-24 Observational parameters of Blue Large-Amplitude Pulsators P. Pietrukowicz et.al. 2404.16089 null
2024-04-24 A Survey on Visual Mamba Hanwei Zhang et.al. 2404.15956 null
2024-04-24 Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks Erh-Chung Chen et.al. 2404.15881 null
2024-04-24 Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection Michael Kösel et.al. 2404.15879 link
2024-04-23 CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection Hongyi Cai et.al. 2404.15451 null
2024-04-23 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning Weifeng Chen et.al. 2404.15449 null
2024-04-23 Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions Xingguang Zhang et.al. 2404.15252 null
2024-04-23 Efficient Transformer Encoders for Mask2Former-style models Manyi Yao et.al. 2404.15244 null
2024-04-23 Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN Sara Dadjouy et.al. 2404.15129 null
2024-04-23 External Prompt Features Enhanced Parameter-efficient Fine-tuning for Salient Object Detection Wen Liang et.al. 2404.15008 null
2024-04-23 ContextualFusion: Context-Based Multi-Sensor Fusion for 3D Object Detection in Adverse Operating Conditions Shounak Sural et.al. 2404.14780 null
2024-04-23 Unified Unsupervised Salient Object Detection via Knowledge Transfer Yao Yuan et.al. 2404.14759 link
2024-04-22 SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection Yuxia Wang et.al. 2404.14183 null
2024-04-22 Text in the Dark: Extremely Low-Light Text Image Enhancement Che-Tsung Lin et.al. 2404.14135 null
2024-04-22 CKD: Contrastive Knowledge Distillation from A Sample-wise Perspective Wencheng Zhu et.al. 2404.14109 null
2024-04-22 Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation Liwen Wang et.al. 2404.13945 null
2024-04-22 NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation Chi Huang et.al. 2404.13921 null
2024-04-22 TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos Atom Scott et.al. 2404.13868 null
2024-04-22 Toward Robust LiDAR based 3D Object Detection via Density-Aware Adaptive Thresholding Eunho Lee et.al. 2404.13852 null
2024-04-21 A Nasal Cytology Dataset for Object Detection and Deep Learning Mauro Camporeale et.al. 2404.13745 null
2024-04-23 Clio: Real-time Task-Driven Open-Set 3D Scene Graphs Dominic Maggio et.al. 2404.13696 null
2024-04-20 FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving Ganesh Sistu et.al. 2404.13443 null
2024-04-19 A comparison between single-stage and two-stage 3D tracking algorithms for greenhouse robotics David Rapado-Rincon et.al. 2404.12963 null
2024-04-19 Language-Driven Active Learning for Diverse Open-Set 3D Object Detection Ross Greer et.al. 2404.12856 null
2024-04-19 ECOR: Explainable CLIP for Object Recognition Ali Rasekh et.al. 2404.12839 null
2024-04-19 A Point-Based Approach to Efficient LiDAR Multi-Task Perception Christopher Lang et.al. 2404.12798 null
2024-04-19 ELEV-VISION-SAM: Integrated Vision Language and Foundation Model for Automated Estimation of Building Lowest Floor Elevation Yu-Hsuan Ho et.al. 2404.12606 null
2024-04-18 The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models Cheng Shi et.al. 2404.11957 link
2024-04-18 Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition Xunsong Li et.al. 2404.11903 null
2024-04-17 TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation Thomas Monninger et.al. 2404.11803 null
2024-04-17 Multimodal 3D Object Detection on Unseen Domains Deepti Hegde et.al. 2404.11764 null
2024-04-17 Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection Deepti Hegde et.al. 2404.11737 null
2024-04-17 Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems Luca Bompani et.al. 2404.11488 link
2024-04-17 EcoMLS: A Self-Adaptation Approach for Architecting Green ML-Enabled Systems Meghana Tedla et.al. 2404.11411 null
2024-04-17 Detector Collapse: Backdooring Object Detection to Catastrophic Overload or Blindness Hangtao Zhang et.al. 2404.11357 null
2024-04-17 Simple In-place Data Augmentation for Surveillance Object Detection Munkh-Erdene Otgonbold et.al. 2404.11226 null
2024-04-17 Feature Corrective Transfer Learning: End-to-End Solutions to Object Detection in Non-Ideal Visual Conditions Chuheng Wei et.al. 2404.11214 null
2024-04-17 GhostNetV3: Exploring the Training Strategies for Compact Models Zhenhua Liu et.al. 2404.11202 null
2024-04-17 How to deal with glare for improved perception of Autonomous Vehicles Muhammad Z. Alam et.al. 2404.10992 null
2024-04-17 Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection Nawfal Guefrachi et.al. 2404.10978 null
2024-04-16 OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery Matthew Inkawhich et.al. 2404.10865 null
2024-04-16 Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark Jiangning Zhang et.al. 2404.10760 null
2024-04-16 Watch Your Step: Optimal Retrieval for Continual Learning at Scale Truman Hickok et.al. 2404.10758 null
2024-04-16 Efficient optimal dispersed Haar-like filters for face detection Zeinab Sedaghatjoo et.al. 2404.10476 null
2024-04-16 Camera clustering for scalable stream-based active distillation Dani Manjah et.al. 2404.10411 null
2024-04-15 Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets Dai Quoc Tran et.al. 2404.10078 link
2024-04-15 Explainable Light-Weight Deep Learning Pipeline for Improved Drought Stres Aswini Kumar Patra et.al. 2404.10073 null
2024-04-15 VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection Bonan Ding et.al. 2404.09431 null
2024-04-14 TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model Wiktor Mucha et.al. 2404.09254 null
2024-04-14 DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection Lewei Yao et.al. 2404.09216 null
2024-04-14 Coreset Selection for Object Detection Hojun Lee et.al. 2404.09161 null
2024-04-14 Fusion-Mamba for Cross-modality Object Detection Wenhao Dong et.al. 2404.09146 null
2024-04-13 The Snake's Beating Heart? A Millisecond Pulsar Binary in the Galactic Center Radio Filament G359.1 $-$ 0.2 Marcus E. Lower et.al. 2404.09098 null
2024-04-13 BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection Jian Zhang et.al. 2404.08979 null
2024-04-13 Shifting Spotlight for Co-supervision: A Simple yet Efficient Single-branch Network to See Through Camouflage Yang Hu et.al. 2404.08936 null
2024-04-12 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Yanhao Zheng et.al. 2404.08603 link
2024-04-12 FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation Riza Velioglu et.al. 2404.08582 null
2024-04-12 Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning Girmaw Abebe Tadesse et.al. 2404.08544 null
2024-04-12 MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion Zhe Li et.al. 2404.08406 null
2024-04-12 Overcoming Scene Context Constraints for Object Detection in wild using Defilters Vamshi Krishna Kancharla et.al. 2404.08293 null
2024-04-11 ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Lifan Jiang et.al. 2404.07773 null
2024-04-11 Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Ricardo Pereira et.al. 2404.07739 null
2024-04-11 Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns Hakan Yekta Yatbaz et.al. 2404.07685 null
2024-04-11 Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes Poulami Sinhamahapatra et.al. 2404.07664 null
2024-04-11 Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method Tashmoy Ghosh et.al. 2404.07649 null
2024-04-11 GLID: Pre-training a Generalist Encoder-Decoder Vision Model Jihao Liu et.al. 2404.07603 null
2024-04-11 SFSORT: Scene Features-based Simple Online Real-Time Tracker M. M. Morsali et.al. 2404.07553 link
2024-04-11 The Sydney Radio Star Catalogue: properties of radio stars at megahertz to gigahertz frequencies Laura N. Driessen et.al. 2404.07418 null
2024-04-11 Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing Jaemin Kang et.al. 2404.07405 null
2024-04-11 A fine-tuning workflow for automatic first-break picking with deep learning Amir Mardan et.al. 2404.07400 link
2024-04-10 Identification of Fine-grained Systematic Errors via Controlled Scene Generation Valentyn Boreiko et.al. 2404.07045 null
2024-04-10 Accurate Tennis Court Line Detection on Amateur Recorded Matches Sameer Agrawal et.al. 2404.06977 null
2024-04-10 SARA: Smart AI Reading Assistant for Reading Comprehension Enkeleda Thaqi et.al. 2404.06906 null
2024-04-10 Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data Aakash Kumar et.al. 2404.06715 null
2024-04-10 Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting Hao Lu et.al. 2404.06700 link
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 null
2024-04-09 Label-Efficient 3D Object Detection For Road-Side Units Minh-Quan Dao et.al. 2404.06256 null
2024-04-09 Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector Bach Ha et.al. 2404.06219 null
2024-04-09 YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images Chenguang Liu et.al. 2404.06180 null
2024-04-09 Enhanced Radar Perception via Multi-Task Learning: Towards Refined Data for Sensor Fusion Applications Huawei Sun et.al. 2404.06165 null
2024-04-09 Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation Zong-Wei Hong et.al. 2404.06029 null
2024-04-08 Retrieval-Augmented Open-Vocabulary Object Detection Jooyeon Kim et.al. 2404.05687 link
2024-04-08 3D-COCO: extension of MS-COCO dataset for image detection and 3D reconstruction modules Maxence Bideaux et.al. 2404.05641 null
2024-04-08 PetKaz at SemEval-2024 Task 8: Can Linguistics Capture the Specifics of LLM-generated Text? Kseniia Petukhova et.al. 2404.05483 null
2024-04-08 Detecting Every Object from Events Haitian Zhang et.al. 2404.05285 link
2024-04-08 MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues Xiahan Chen et.al. 2404.05280 null
2024-04-08 Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes Yu Sheng et.al. 2404.05164 null
2024-04-08 Better Monocular 3D Detectors with LiDAR from the Past Yurong You et.al. 2404.05139 link
2024-04-07 AirShot: Efficient Few-Shot Detection for Autonomous Exploration Zihan Wang et.al. 2404.05069 link
2024-04-07 PlateSegFL: A Privacy-Preserving License Plate Detection Using Federated Segmentation Learning Md. Shahriar Rahman Anuvab et.al. 2404.05049 null
2024-04-07 PathFinder: Attention-Driven Dynamic Non-Line-of-Sight Tracking with a Mobile Robot Shenbagaraj Kannapiran et.al. 2404.05024 null
2024-04-05 SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers Weile Li et.al. 2404.04179 link
2024-04-05 Designing Robots to Help Women Martin Cooney et.al. 2404.04123 null
2024-04-04 Is CLIP the main roadblock for fine-grained open-world perception? Lorenzo Bianchi et.al. 2404.03539 link
2024-04-04 DQ-DETR: DETR with Dynamic Query for Tiny Object Detection Yi-Xin Huang et.al. 2404.03507 null
2024-04-05 A Methodology to Study the Impact of Spiking Neural Network Parameters considering Event-Based Automotive Data Iqra Bano et.al. 2404.03493 null
2024-04-04 MonoCD: Monocular 3D Object Detection with Complementary Depths Longfei Yan et.al. 2404.03181 link
2024-04-03 DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection Felix Fent et.al. 2404.03015 null
2024-04-03 ALOHa: A New Measure for Hallucination in Captioning Models Suzanne Petryk et.al. 2404.02904 null
2024-04-03 FlightScope: A Deep Comprehensive Assessment of Aircraft Detection Algorithms in Satellite Imagery Safouane El Ghazouali et.al. 2404.02877 link
2024-04-03 HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras Zhongyu Xia et.al. 2404.02517 link
2024-04-04 TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression Ho-Joong Kim et.al. 2404.02405 null
2024-04-04 EGTR: Extracting Graph from Transformer for Scene Graph Generation Jinbae Im et.al. 2404.02072 link
2024-04-03 Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection Jicheng Yuan et.al. 2404.01988 link
2024-04-02 Towards Enhanced Analysis of Lung Cancer Lesions in EBUS-TBNA -- A Semi-Supervised Video Object Detection Method Jyun-An Lin et.al. 2404.01929 null
2024-04-02 Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack Ying Zhou et.al. 2404.01907 link
2024-04-02 Scene Adaptive Sparse Transformer for Event-based Object Detection Yansong Peng et.al. 2404.01882 link
2024-04-02 Semi-Supervised Domain Adaptation for Wildfire Detection JooYoung Jang et.al. 2404.01842 null
2024-04-02 Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection Tahira Shehzadi et.al. 2404.01819 null
2024-04-02 Analyzing the Single Event Upset Vulnerability of Binarized Neural Networks on SRAM FPGAs Ioanna Souvatzoglou et.al. 2404.01757 null
2024-04-02 Disentangled Pre-training for Human-Object Interaction Detection Zhuolong Li et.al. 2404.01725 null
2024-04-02 Task Integration Distillation for Object Detectors Hai Su et.al. 2404.01699 null
2024-03-29 PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets Ruining Yang et.al. 2403.19893 null
2024-03-29 MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection Ali Behrouz et.al. 2403.19888 null
2024-03-28 DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Donghyun Kim et.al. 2403.19588 link
2024-03-28 OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation Zhenyu Wang et.al. 2403.19580 null
2024-03-28 AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4 Alexander Shirnin et.al. 2403.19354 null
2024-03-28 Sparse Generation: Making Pseudo Labels Sparse for weakly supervision with points Tian Ma et.al. 2403.19306 null
2024-03-28 CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection Mikhail Kennerley et.al. 2403.19278 link
2024-03-28 Algorithmic Ways of Seeing: Using Object Detection to Facilitate Art Exploration Louie Søs Meyer et.al. 2403.19174 null
2024-03-28 CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation Lingjun Zhao et.al. 2403.19104 null
2024-03-28 A Real-Time Framework for Domain-Adaptive Underwater Object Detection with Image Enhancement Junjie Wen et.al. 2403.19079 null
2024-03-27 Illicit object detection in X-ray images using Vision Transformers Jorgen Cani et.al. 2403.19043 null
2024-03-27 Benchmarking Object Detectors with COCO: A New Path Forward Shweta Singh et.al. 2403.18819 link
2024-03-27 PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations Ehsan Latif et.al. 2403.18721 null
2024-03-27 CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection Jiayi Zhu et.al. 2403.18554 null
2024-03-27 BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection Changshun Wu et.al. 2403.18373 null
2024-03-27 Ship in Sight: Diffusion Models for Ship-Image Super Resolution Luigi Sigillo et.al. 2403.18370 link
2024-03-27 DODA: Diffusion for Object-detection Domain Adaptation in Agriculture Shuai Xiang et.al. 2403.18334 null
2024-03-27 Tracking-Assisted Object Detection with Event Cameras Ting-Kang Yen et.al. 2403.18330 null
2024-03-27 SGDM: Static-Guided Dynamic Module Make Stronger Visual Models Wenjie Xing et.al. 2403.18282 null
2024-03-27 Road Obstacle Detection based on Unknown Objectness Scores Chihiro Noguchi et.al. 2403.18207 null
2024-03-26 State of the art applications of deep learning within tracking and detecting marine debris: A survey Zoe Moorton et.al. 2403.18067 null
2024-03-26 The Solution for the CVPR 2023 1st foundation model challenge-Track2 Haonan Xu et.al. 2403.17702 null
2024-03-26 PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang et.al. 2403.17695 link
2024-03-26 UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps Maciej K Wozniak et.al. 2403.17633 null
2024-03-26 SSF3D: Strict Semi-Supervised 3D Object Detection with Switching Filter Songbur Wong et.al. 2403.17390 null
2024-03-26 Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection Jiacheng Zhang et.al. 2403.17387 null
2024-03-26 AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving Mingfu Liang et.al. 2403.17373 null
2024-03-26 Staircase Localization for Autonomous Exploration in Urban Environments Jinrae Kim et.al. 2403.17330 null
2024-03-25 Co-Occurring of Object Detection and Identification towards unlabeled object discovery Binay Kumar Singh et.al. 2403.17223 null
2024-03-25 Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions Ye Li et.al. 2403.17009 link
2024-03-25 Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance Jingyuan Zhu et.al. 2403.16954 null
2024-03-25 TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques Ashok Urlana et.al. 2403.16592 null
2024-03-25 RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection Zhiwei Lin et.al. 2403.16440 link
2024-03-25 ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose Estimation Hannah Schieber et.al. 2403.16400 null
2024-03-25 Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks Madhumitha Sakthi et.al. 2403.16338 null
2024-03-24 Cross-domain Multi-modal Few-shot Object Detection via Rich Text Zeyu Shangguan et.al. 2403.16188 null
2024-03-24 Semantic Is Enough: Only Semantic Information For NeRF Reconstruction Ruibo Wang et.al. 2403.16043 null
2024-03-23 Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions Kaiwen Wang et.al. 2403.15786 null
2024-03-23 EAGLE: A Domain Generalization Framework for AI-generated Text Detection Amrita Bhattacharjee et.al. 2403.15690 null
2024-03-25 Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection Hongzhi Gao et.al. 2403.15317 null
2024-03-22 CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking Nicolas Baumann et.al. 2403.15313 null
2024-03-22 IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection Junbo Yin et.al. 2403.15241 null
2024-03-22 MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection Taeheon Kim et.al. 2403.15209 null
2024-03-22 SFOD: Spiking Fusion Object Detector Yimeng Fan et.al. 2403.15192 link
2024-03-22 CRPlace: Camera-Radar Fusion with BEV Representation for Place Recognition Shaowei Fu et.al. 2403.15183 null
2024-03-22 An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning Víctor Toscano-Durán et.al. 2403.15150 null
2024-03-22 Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection Jiaming Li et.al. 2403.15127 link
2024-03-22 VRSO: Visual-Centric Reconstruction for Static Object Annotation Chenyao Yu et.al. 2403.15026 null
2024-03-22 Vehicle Detection Performance in Nordic Region Hamam Mokayed et.al. 2403.15017 null
2024-03-21 T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy Qing Jiang et.al. 2403.14610 link
2024-03-21 UAV-Assisted Maritime Search and Rescue: A Holistic Approach Martin Messmer et.al. 2403.14281 null
2024-03-21 Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection Tim Salzmann et.al. 2403.14270 null
2024-03-21 3D Object Detection from Point Cloud via Voting Step Diffusion Haoran Hou et.al. 2403.14133 null
2024-03-20 EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration Wenjun Huang et.al. 2403.14027 null
2024-03-20 RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition Ziyu Liu et.al. 2403.13805 link
2024-03-20 Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments Yang Yang et.al. 2403.13803 link
2024-03-20 Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization Danqing Ma et.al. 2403.13703 null
2024-03-20 Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments Djamahl Etchegaray et.al. 2403.13556 null
2024-03-20 MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Di Wang et.al. 2403.13430 link
2024-03-20 Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images Jiawei Zhou et.al. 2403.13375 null
2024-03-20 Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text Detection Zhixin Lai et.al. 2403.13335 null
2024-03-20 DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception Yibo Wang et.al. 2403.13304 null
2024-03-20 Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models Huachuan Qiu et.al. 2403.13250 null
2024-03-19 SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model Armen Avetisyan et.al. 2403.13064 null
2024-03-19 Wildfire danger prediction optimization with transfer learning Spiros Maggioros et.al. 2403.12871 link
2024-03-19 As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? Anjun Hu et.al. 2403.12693 null
2024-03-19 EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks Ziming Wang et.al. 2403.12574 null
2024-03-19 DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM Yixuan Wu et.al. 2403.12488 null
2024-03-19 TransformMix: Learning Transformation and Mixing Strategies from Data Tsz-Him Cheung et.al. 2403.12429 null
2024-03-19 VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation Hao Wang et.al. 2403.12415 null
2024-03-19 Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition Jielin Qiu et.al. 2403.12339 null
2024-03-18 EffiPerception: an Efficient Framework for Various Perception Tasks Xinhao Xiang et.al. 2403.12317 null
2024-03-18 Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D Benjamín Ojeda-Magaña et.al. 2403.12310 null
2024-03-18 Align and Distill: Unifying and Improving Domain Adaptive Object Detection Justin Kay et.al. 2403.12029 link
2024-03-18 TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction Ali Asghar Sharifi et.al. 2403.11695 null
2024-03-18 Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem Mincheol Chang et.al. 2403.11573 null
2024-03-18 R2SNet: Scalable Domain Adaptation for Object Detection in Cloud-Based Robots Ecosystems via Proposal Refinement Michele Antonazzi et.al. 2403.11567 null
2024-03-18 Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2403.11530 link
2024-03-17 V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions Baolu Li et.al. 2403.11371 null
2024-03-17 Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning Jesher Joshua M et.al. 2403.11291 null
2024-03-17 ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models Siyuan Huang et.al. 2403.11289 null
2024-03-17 CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations Yuwei Zhang et.al. 2403.11220 link
2024-03-17 GRA: Detecting Oriented Objects through Group-wise Rotating and Attention Jiangshan Wang et.al. 2403.11127 null
2024-03-17 Self-supervised co-salient object detection via feature correspondence at multiple scales Souradeep Chakraborty et.al. 2403.11107 link
2024-03-14 Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization Zhao Wang et.al. 2403.09433 null
2024-03-14 D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object Detection Dinh Phat Do et.al. 2403.09359 link
2024-03-14 Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Yufei Zhan et.al. 2403.09333 link
2024-03-14 EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection Jiaqing Zhang et.al. 2403.09323 link
2024-03-14 Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection Martin Aubard et.al. 2403.09313 link
2024-03-14 MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion Arul Selvam Periyasamy et.al. 2403.09309 null
2024-03-14 CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification Yiming Ma et.al. 2403.09281 null
2024-03-14 D-YOLO a robust framework for object detection in adverse weather conditions Zihan Chu et.al. 2403.09233 null
2024-03-14 Improving Distant 3D Object Detection Using 2D Box Supervision Zetong Yang et.al. 2403.09230 null
2024-03-14 PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest Jiajun Deng et.al. 2403.09212 null
2024-03-13 VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Enric Corona et.al. 2403.08764 null
2024-03-13 MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning Jialv Zou et.al. 2403.08760 link
2024-03-13 Data Augmentation in Human-Centric Vision Wentao Jiang et.al. 2403.08650 null
2024-03-13 PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections Matteo Taiana et.al. 2403.08586 null
2024-03-13 A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product Ao Xiang et.al. 2403.08511 null
2024-03-13 Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks Zongqing Qi et.al. 2403.08499 null
2024-03-13 IAMCV Multi-Scenario Vehicle Interaction Dataset Novel Certad et.al. 2403.08455 null
2024-03-13 Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks Khondoker Murad Hossain et.al. 2403.08208 null
2024-03-12 TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection Hanning Chen et.al. 2403.08108 null
2024-03-12 Aedes aegypti Egg Counting with Neural Networks for Object Detection Micheli Nayara de Oliveira Vicente et.al. 2403.08016 null
2024-03-12 Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference Changmin Jeon et.al. 2403.07598 null
2024-03-12 PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution Honghao Chen et.al. 2403.07589 null
2024-03-12 A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions Quoc-Vinh Lai-Dang et.al. 2403.07542 null
2024-03-12 JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection Hanyu Zhou et.al. 2403.07436 null
2024-03-12 Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection Jiahui Fu et.al. 2403.07372 null
2024-03-12 GPT-generated Text Detection: Benchmark Dataset and Tensor-based Detection Method Zubair Qazi et.al. 2403.07321 link
2024-03-12 MENTOR: Multilingual tExt detectioN TOward leaRning by analogy Hsin-Ju Lin et.al. 2403.07286 null
2024-03-12 SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection Hongcheng Zhang et.al. 2403.07284 null
2024-03-12 Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction Alexander Timans et.al. 2403.07263 null
2024-03-11 Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies Nieves Crasto et.al. 2403.07113 link
2024-03-11 Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head Tiancheng Zhao et.al. 2403.06892 null
2024-03-11 LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations Mohammad Alkhalefi et.al. 2403.06813 null
2024-03-11 Genetic Learning for Designing Sim-to-Real Data Augmentations Bram Vanherle et.al. 2403.06786 null
2024-03-11 Evaluating the Energy Efficiency of Few-Shot Learning for Object Detection in Industrial Settings Georgios Tsoumplekas et.al. 2403.06631 null
2024-03-11 Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers Alexander H. Berger et.al. 2403.06601 null
2024-03-11 SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection Yuxuan Li et.al. 2403.06534 link
2024-03-11 3D Semantic Segmentation-Driven Representations for 3D Object Detection Hayeon O et.al. 2403.06501 null
2024-03-11 Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection Konyul Park et.al. 2403.06433 null
2024-03-10 Transformer based Multitask Learning for Image Captioning and Object Detection Debolena Basak et.al. 2403.06292 null
2024-03-10 Poly Kernel Inception Network for Remote Sensing Detection Xinhao Cai et.al. 2403.06258 link
2024-03-08 EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAV Huiming Sun et.al. 2403.05422 null
2024-03-08 SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection Yahao Lu et.al. 2403.05416 link
2024-03-08 Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery Xavier Bou et.al. 2403.05381 null
2024-03-08 Frequency-Adaptive Dilated Convolution for Semantic Segmentation Linwei Chen et.al. 2403.05369 link
2024-03-08 VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model Junsu Kim et.al. 2403.05346 null
2024-03-08 Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks Hamed Hosseini et.al. 2403.05211 null
2024-03-08 LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves Jiayan Cao et.al. 2403.05155 null
2024-03-08 RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features Geonho Bang et.al. 2403.05061 null
2024-03-08 ActFormer: Scalable Collaborative Perception via Active Queries Suozhi Huang et.al. 2403.04968 null
2024-03-07 FriendNet: Detection-Friendly Dehazing Network Yihua Fan et.al. 2403.04443 null
2024-03-07 Effectiveness Assessment of Recent Large Vision-Language Models Yao Jiang et.al. 2403.04306 null
2024-03-07 ACC-ViT : Atrous Convolution's Comeback in Vision Transformers Nabil Ibtehaz et.al. 2403.04200 null
2024-03-07 CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images Guanlin Shen et.al. 2403.04198 null
2024-03-07 Scalable and Robust Transformer Decoders for Interpretable Image Classification with Foundation Models Evelyn Mannix et.al. 2403.04125 null
2024-03-07 CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection Gyusam Chang et.al. 2403.03721 null
2024-03-06 Adversarial Infrared Geometry: Using Geometry to Perform Adversarial Attack against Infrared Pedestrian Detectors Kalibinuer Tiliwalidi et.al. 2403.03674 null
2024-03-06 Towards Detecting AI-Generated Text within Human-AI Collaborative Hybrid Texts Zijie Zeng et.al. 2403.03506 null
2024-03-06 Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator Wonhyeok Choi et.al. 2403.03468 null
2024-03-06 FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion Hao Wang et.al. 2403.03463 null
2024-03-06 Performance Evaluation of Semi-supervised Learning Frameworks for Multi-Class Weed Detection Jiajia Li et.al. 2403.03390 link
2024-03-05 Detecting Concrete Visual Tokens for Multimodal Machine Translation Braeden Bowen et.al. 2403.03075 null
2024-03-05 Loss Design for Single-carrier Joint Communication and Neural Network-based Sensing Charlotte Muth et.al. 2403.02929 null
2024-03-05 Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud? Chenqiang Gao et.al. 2403.02818 null
2024-03-05 Bootstrapping Rare Object Detection in High-Resolution Satellite Imagery Akram Zaytar et.al. 2403.02736 null
2024-03-05 FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View Jiawei Hou et.al. 2403.02710 null
2024-03-05 False Positive Sampling-based Data Augmentation for Enhanced 3D Object Detection Accuracy Jiyong Oh et.al. 2403.02639 null
2024-03-05 BSDP: Brain-inspired Streaming Dual-level Perturbations for Online Open World Object Detection Yu Chen et.al. 2403.02637 null
2024-03-04 NiNformer: A Network in Network Transformer with Token Mixing Generated Gating Function Abdullah Nazhat Abdullah et.al. 2403.02411 link
2024-03-04 COMMIT: Certifying Robustness of Multi-Sensor Fusion Systems against Semantic Attacks Zijian Huang et.al. 2403.02329 null
2024-03-04 Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous Driving Yuxuan Liu et.al. 2403.02037 link
2024-03-02 TUMTraf V2X Cooperative Perception Dataset Walter Zimmer et.al. 2403.01316 null
2024-03-02 Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection Taeheon Kim et.al. 2403.01300 null
2024-03-02 Run-time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations Hakan Yekta Yatbaz et.al. 2403.01172 null
2024-03-02 ELA: Efficient Local Attention for Deep Convolutional Neural Networks Wei Xu et.al. 2403.01123 null
2024-03-02 Face Swap via Diffusion Model Feifei Wang et.al. 2403.01108 null
2024-03-02 Beyond Night Visibility: Adaptive Multi-Scale Fusion of Infrared and Visible Images Shufan Pei et.al. 2403.01083 null
2024-03-01 Learning Causal Features for Incremental Object Detection Zhenwei He et.al. 2403.00591 null
2024-03-01 Abductive Ego-View Accident Video Understanding for Safe Driving Perception Jianwu Fang et.al. 2403.00436 null
2024-03-04 DAMS-DETR: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion Junjie Guo et.al. 2403.00326 null
2024-03-01 ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting Chen Duan et.al. 2403.00303 null
2024-02-29 SeMoLi: What Moves Together Belongs Together Jenny Seidenschwarz et.al. 2402.19463 null
2024-02-29 Genie: Smart ROS-based Caching for Connected Autonomous Robots Zexin Li et.al. 2402.19410 null
2024-02-29 ProtoP-OD: Explainable Object Detection with Prototypical Parts Pavlos Rath-Manakidis et.al. 2402.19142 null
2024-02-29 Theoretically Achieving Continuous Representation of Oriented Bounding Boxes Zikai Xiao et.al. 2402.18975 link
2024-02-29 Boosting Semi-Supervised Object Detection in Remote Sensing Images With Active Teaching Boxuan Zhang et.al. 2402.18958 null
2024-02-29 Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering Xiang Chen et.al. 2402.18927 null
2024-02-29 A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection Chao Hao et.al. 2402.18922 null
2024-02-29 Privacy-Preserving Autoencoder for Collaborative Object Detection Bardia Azizian et.al. 2402.18864 null
2024-02-29 Debiased Novel Category Discovering and Localization Juexiao Feng et.al. 2402.18821 null
2024-02-28 Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond Ziyun Yang et.al. 2402.18698 null
2024-02-28 UniMODE: Unified Monocular 3D Object Detection Zhuoling Li et.al. 2402.18573 null
2024-02-28 Detection of Micromobility Vehicles in Urban Traffic Videos Khalil Sabri et.al. 2402.18503 link
2024-02-28 Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection Xun Huang et.al. 2402.18493 null
2024-02-28 Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization Deng Li et.al. 2402.18447 null
2024-02-28 Unveiling novel insights into Kirchhoff migration for effective object detection using experimental Fresnel dataset Won-Kwang Park et.al. 2402.18322 null
2024-02-28 Zero-Shot Aerial Object Detection with Visual Description Regularization Zhengqing Zang et.al. 2402.18233 null
2024-02-28 VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation Tao Peng et.al. 2402.18189 null
2024-02-27 SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection Junsu Kim et.al. 2402.17323 null
2024-02-27 A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track Zehui Chen et.al. 2402.17319 null
2024-02-27 Probing Multimodal Large Language Models for Global and Local Semantic Representation Mingxu Tao et.al. 2402.17304 null

(back to top)

Semantic Segmentation

Publish Date Title Authors PDF Code
2024-11-22 Effective SAM Combination for Open-Vocabulary Semantic Segmentation Minhyeok Lee et.al. 2411.14723 null
2024-11-21 Revisiting the Integration of Convolution and Attention for Vision Backbone Lei Zhu et.al. 2411.14429 link
2024-11-21 CompetitorFormer: Competitor Transformer for 3D Instance Segmentation Duanchu Wang et.al. 2411.14179 null
2024-11-21 CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Lin Sun et.al. 2411.13836 link
2024-11-21 Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals Hussni Mohd Zakir et.al. 2411.13774 null
2024-11-20 FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting Ola Shorinwa et.al. 2411.13753 null
2024-11-20 DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines Mizanur Rahman Jewel et.al. 2411.13544 null
2024-11-21 Entropy Bootstrapping for Weakly Supervised Nuclei Detection James Willoughby et.al. 2411.13528 null
2024-11-20 BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Umamaheswaran Raman Kumar et.al. 2411.13251 null
2024-11-20 XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation Ziyi Wang et.al. 2411.13243 link
2024-11-20 Automating Sonologists USG Commands with AI and Voice Interface Emad Mohamed et.al. 2411.13006 null
2024-11-19 Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Junlong Cheng et.al. 2411.12814 link
2024-11-19 A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation Jiaqi Yang et.al. 2411.12615 link
2024-11-19 SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Ron Keuth et.al. 2411.12602 link
2024-11-19 ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator Xiao Jiang et.al. 2411.12250 null
2024-11-18 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements M. Arda Aydın et.al. 2411.12044 link
2024-11-18 Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Hanieh Shojaei Miandashti et.al. 2411.11935 null
2024-11-18 MGNiceNet: Unified Monocular Geometric Scene Understanding Markus Schön et.al. 2411.11466 null
2024-11-18 MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models Harshita Sharma et.al. 2411.11362 null
2024-11-18 Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications Scarlett Raine et.al. 2411.11287 null
2024-11-18 Zero-Shot Automatic Annotation and Instance Segmentation using LLM-Generated Datasets: Eliminating Field Imaging and Manual Annotation for Deep Learning Model Development Ranjan Sapkota et.al. 2411.11285 null
2024-11-16 Attention-based U-Net Method for Autonomous Lane Detection Mohammadhamed Tangestanizadeh et.al. 2411.10902 null
2024-11-16 Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation Jaisidh Singh et.al. 2411.10845 null
2024-11-16 Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients Maria Monzon et.al. 2411.10755 null
2024-11-15 Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation Markus Karmann et.al. 2411.10411 null
2024-11-15 Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images Ammar Qammaz et.al. 2411.10334 null
2024-11-15 RETR: Multi-View Radar Detection Transformer for Indoor Perception Ryoma Yataka et.al. 2411.10293 null
2024-11-15 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Dengke Zhang et.al. 2411.10086 null
2024-11-14 OneNet: A Channel-Wise 1D Convolutional U-Net Sanghyun Byun et.al. 2411.09838 link
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Yuheng Shi et.al. 2411.09219 link
2024-11-14 Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Ashim Dahal et.al. 2411.09101 link
2024-11-13 CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation Xuming Zhang et.al. 2411.09023 null
2024-11-14 Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation Yangyang Li et.al. 2411.08756 null
2024-11-13 Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Jun Xie et.al. 2411.08592 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Detection and classification of radio sources with deep learning S. Riggi et.al. 2411.08519 null
2024-11-12 Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry Christopher Hahne et.al. 2411.07918 link
2024-11-12 INTRABENCH: Interactive Radiological Benchmark Constantin Ulrich et.al. 2411.07885 null
2024-11-12 Horticultural Temporal Fruit Monitoring via 3D Instance Segmentation and Re-Identification using Point Clouds Daniel Fusaro et.al. 2411.07799 link
2024-11-12 Semantic segmentation on multi-resolution optical and microwave data using deep learning Jai G Singla et.al. 2411.07581 null
2024-11-12 GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting Umangi Jain et.al. 2411.07555 null
2024-11-11 Data-Centric Learning Framework for Real-Time Detection of Aiming Beam in Fluorescence Lifetime Imaging Guided Surgery Mohamed Abul Hassan et.al. 2411.07395 null
2024-11-11 SAMPart3D: Segment Any Part in 3D Objects Yunhan Yang et.al. 2411.07184 link
2024-11-11 SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation Jiale Chen et.al. 2411.06991 null
2024-11-11 Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction Miguel Antunes-García et.al. 2411.06851 link
2024-11-11 Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision Yueyang Cang et.al. 2411.06727 null
2024-11-10 Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments Deegan Atha et.al. 2411.06632 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-08 Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model Shuchang Lyu et.al. 2411.05878 link
2024-11-08 Agricultural Landscape Understanding At Country-Scale Radhika Dua et.al. 2411.05359 null
2024-11-08 Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation Sien Li et.al. 2411.05307 link
2024-11-07 In the Era of Prompt Learning with Vision-Language Models Ankit Jha et.al. 2411.04892 null
2024-11-08 ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Olaf Wysocki et.al. 2411.04865 link
2024-11-06 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao et.al. 2411.03829 link
2024-11-06 SA3DIP: Segment Any 3D Instance with Potential 3D Priors Xi Yang et.al. 2411.03819 link
2024-11-06 Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model Yansong Qu et.al. 2411.03672 null
2024-11-05 Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation Zhiling Yue et.al. 2411.03551 null
2024-11-05 SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture Andrew Heschl et.al. 2411.03505 link
2024-11-05 Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need Qishuai Wen et.al. 2411.03033 link
2024-11-05 Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation Xavier Timoneda et.al. 2411.02969 null
2024-11-05 Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery Mohammad Kakooei et.al. 2411.02935 null
2024-11-05 CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation Jinchao Ge et.al. 2411.02715 null
2024-11-04 Deep Learning on 3D Semantic Segmentation: A Detailed Review Thodoris Betsas et.al. 2411.02104 null
2024-11-04 Tree level change detection over Ahmedabad city using very high resolution satellite images and Deep Learning Jai G Singla et.al. 2411.02009 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability Bo Gao et.al. 2411.01819 null
2024-11-04 Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations Thanh Nguyen Canh et.al. 2411.01816 null
2024-11-05 MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation Duc Dang Trung Tran et.al. 2411.01781 null
2024-11-03 PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Xinyu Xu et.al. 2411.01624 null
2024-11-01 Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions Lixiao Yang et.al. 2411.01039 null
2024-11-01 Event-guided Low-light Video Semantic Segmentation Zhen Yao et.al. 2411.00639 null
2024-11-01 Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors Valentina Vadori et.al. 2411.00561 null
2024-10-31 Federated Black-Box Adaptation for Semantic Segmentation Jay N. Paranjape et.al. 2410.24181 null
2024-10-31 COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes Muhammad Ali et.al. 2410.24139 link
2024-10-31 Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model Hao Zhang et.al. 2410.23905 link
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-31 CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation Ziyang Gong et.al. 2410.22629 link
2024-10-29 Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation Zhaochong An et.al. 2410.22489 null
2024-10-29 Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation Jintao Tong et.al. 2410.22135 null
2024-10-29 Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models Imad Ali Shah et.al. 2410.22101 null
2024-10-29 Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation Ruihao Xia et.al. 2410.21708 link
2024-10-28 Domain Adaptation with a Single Vision-Language Embedding Mohammad Fahes et.al. 2410.21361 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-27 A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models Camilo Espinosa-Curilem et.al. 2410.20595 link
2024-10-27 Unlocking Comics: The AI4VA Dataset for Visual Understanding Peter Grönquist et.al. 2410.20459 link
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation Kaixian Qu et.al. 2410.19697 null
2024-10-25 Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation Yao Wu et.al. 2410.19446 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Alexander Jaus et.al. 2410.18684 null
2024-10-24 Unsupervised semantic segmentation of urban high-density multispectral point clouds Oona Oinonen et.al. 2410.18520 null
2024-10-26 CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator Stefanos Pasios et.al. 2410.18238 null
2024-10-23 Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers Achille Chiuchiarelli et.al. 2410.17738 null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 LIMIS: Towards Language-based Interactive Medical Image Segmentation Lena Heinemann et.al. 2410.16939 null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 null
2024-10-22 SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments Jumman Hossain et.al. 2410.16686 null
2024-10-22 NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation Jiamu Wang et.al. 2410.16671 null
2024-10-21 PlaneSAM: Multimodal Plane Instance Segmentation Using the Segment Anything Model Zhongchen Deng et.al. 2410.16545 null
2024-10-21 TIPS: Text-Image Pretraining with Spatial Awareness Kevis-Kokitsi Maninis et.al. 2410.16512 null
2024-10-21 GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation Nazanin Moradinasab et.al. 2410.16485 null
2024-10-21 Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation Ruting Chi et.al. 2410.16063 null
2024-10-21 LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training Thomas Kreutz et.al. 2410.15833 link
2024-10-21 TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight Hyun-Kurl Jang et.al. 2410.15674 link
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-20 Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation Fnu Neha et.al. 2410.15472 null
2024-10-20 Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing Daniya Najiha Abdul Kareem et.al. 2410.15360 null
2024-10-18 On the Influence of Shape, Texture and Color for Learning Semantic Segmentation Annika Mütze et.al. 2410.14878 null
2024-10-18 Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ Arpan Mahara et.al. 2410.14836 null
2024-10-18 Impact of imperfect annotations on CNN training and performance for instance segmentation and classification in digital pathology Laura Gálvez Jiménez et.al. 2410.14365 null
2024-10-17 ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Guangda Ji et.al. 2410.13924 null
2024-10-17 Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks Clément Playout et.al. 2410.13822 link
2024-10-18 Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything Joonhyeon Song et.al. 2410.13621 link
2024-10-17 Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation Ziyang Chen et.al. 2410.13472 null
2024-10-17 SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing Bin Wang et.al. 2410.13471 link
2024-10-17 Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation Florian Wulff et.al. 2410.13383 null
2024-10-17 LESS: Label-Efficient and Single-Stage Referring 3D Segmentation Xuexun Liu et.al. 2410.13294 null
2024-10-17 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation Houze Liu et.al. 2410.13099 null
2024-10-16 Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation Wenbo Xu et.al. 2410.13094 null
2024-10-16 Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation Anthony Opipari et.al. 2410.12995 null
2024-10-16 Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation Jesús Alejandro Loera-Ponce et.al. 2410.12988 null
2024-10-16 VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo et.al. 2410.12694 null
2024-10-16 Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans Luca Marsilio et.al. 2410.12641 null
2024-10-16 Order-Aware Interactive Segmentation Bin Wang et.al. 2410.12214 null
2024-10-16 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-15 WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation Chenghao Qian et.al. 2410.12075 null
2024-10-15 Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning Rijun Wang et.al. 2410.11913 null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 null
2024-10-15 RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Anton Antonov et.al. 2410.11722 link
2024-10-15 InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Jiayi Lin et.al. 2410.11473 null
2024-10-15 MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation Xianping Ma et.al. 2410.11160 link
2024-10-14 Locality Alignment Improves Vision-Language Models Ian Covert et.al. 2410.11087 null
2024-10-14 Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes Tim Broedermann et.al. 2410.10791 null
2024-10-14 UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation Lihe Yang et.al. 2410.10777 link
2024-10-14 PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion Runsong Zhu et.al. 2410.10659 link
2024-10-14 Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation Daniel Fusaro et.al. 2410.10510 link
2024-10-14 LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Xuezhi Xiang et.al. 2410.10433 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-13 UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation Ye Sun et.al. 2410.09909 null
2024-10-13 AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model Yuchen Li et.al. 2410.09714 null
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-11 Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation Varduhi Yeghiazaryan et.al. 2410.08946 null
2024-10-11 Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Hanieh Shojaei et.al. 2410.08687 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 Interactive4D: Interactive 4D LiDAR Segmentation Ilya Fradlin et.al. 2410.08206 null
2024-10-10 Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation Zhiyi Pan et.al. 2410.08091 null
2024-10-10 Shift and matching queries for video semantic segmentation Tsubasa Mizuno et.al. 2410.07635 null
2024-10-10 3D Vision-Language Gaussian Splatting Qucheng Peng et.al. 2410.07577 null
2024-10-09 Segmenting objects with Bayesian fusion of active contour models and convnet priors Przemyslaw Polewski et.al. 2410.07421 null
2024-10-11 Bridge the Points: Graph-based Few-shot Segment Anything Semantically Anqi Zhang et.al. 2410.06964 null
2024-10-09 Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation Seungho Lee et.al. 2410.06893 null
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 Transesophageal Echocardiography Generation using Anatomical Models Emmanuel Oladokun et.al. 2410.06781 null
2024-10-09 Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Qinfeng Zhu et.al. 2410.06725 null
2024-10-09 Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Meng Yu et.al. 2410.06626 null
2024-10-09 Towards Natural Image Matting in the Wild via Real-Scenario Prior Ruihao Xia et.al. 2410.06593 link
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-07 Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation Vince Zhu et.al. 2410.04689 null
2024-10-06 In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding Shenghao Li et.al. 2410.04529 null
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-04 SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 Hao Yu et.al. 2410.03962 null
2024-10-04 Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features Benyuan Meng et.al. 2410.03558 link
2024-10-04 Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images Abhijeet Patil et.al. 2410.03289 link
2024-10-04 HRVMamba: High-Resolution Visual State Space Model for Dense Prediction Hao Zhang et.al. 2410.03174 null
2024-10-03 HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer Jingjing Ren et.al. 2410.02528 null
2024-10-06 SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations Nikolaos Giakoumoglou et.al. 2410.02401 link
2024-10-04 Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation Muzhi Zhu et.al. 2410.02369 null
2024-10-03 ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method Remco Royen et.al. 2410.02352 null
2024-10-03 RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds Remco Royen et.al. 2410.02323 null
2024-10-03 Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network Yangyang Qiu et.al. 2410.02224 null
2024-10-03 Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images Qingyuan Liu et.al. 2410.02207 null
2024-10-02 SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Kaiyu Li et.al. 2410.01768 link
2024-10-02 One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations Shaokang Wu et.al. 2410.01630 null
2024-10-02 Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation Zhaofeng Shi et.al. 2410.01341 null
2024-10-02 VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings Andrea Carrara et.al. 2410.01336 null
2024-10-01 RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation Yazhou Zhu et.al. 2410.01110 null
2024-10-01 Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer Vlatko Spasev et.al. 2410.01092 null
2024-10-01 Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time Chiao-An Yang et.al. 2410.01083 link
2024-10-01 DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles Robert Krajewski et.al. 2410.00769 null
2024-10-01 Optimizing Drug Delivery in Smart Pharmacies: A Novel Framework of Multi-Stage Grasping Network Combined with Adaptive Robotics Mechanism Rui Tang et.al. 2410.00753 null
2024-10-01 Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection Pengxi Zeng et.al. 2410.00582 null
2024-09-30 AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation Boyu Han et.al. 2409.20398 null
2024-09-30 Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation Tillmann Rheude et.al. 2409.20287 link
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-30 Segmenting Wood Rot using Computer Vision Models Roland Kammerbauer et.al. 2409.20137 null
2024-09-30 Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Heeseong Shin et.al. 2409.19846 null
2024-09-27 ProMerge: Prompt and Merge for Unsupervised Instance Segmentation Dylan Li et.al. 2409.18961 null
2024-09-27 Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation Raphael Hagmanns et.al. 2409.18788 null
2024-09-27 Learning from Pattern Completion: Self-supervised Controllable Generation Zhiqiang Chen et.al. 2409.18694 link
2024-09-27 Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast Xiaoke Hao et.al. 2409.18543 link
2024-10-01 Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization Siru Li et.al. 2409.18434 null
2024-09-27 Search3D: Hierarchical Open-Vocabulary 3D Segmentation Ayca Takmaz et.al. 2409.18431 null
2024-09-26 Efficient Microscopic Image Instance Segmentation for Food Crystal Quality Control Xiaoyu Ji et.al. 2409.18291 null
2024-09-26 Amodal Instance Segmentation with Diffusion Shape Prior Estimation Minh Tran et.al. 2409.18256 null
2024-09-26 Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning Siyi Lu et.al. 2409.17659 null
2024-09-26 Global-Local Medical SAM Adaptor Based on Full Adaption Meng Wang et.al. 2409.17486 null
2024-09-25 VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection Liangyu Zhong et.al. 2409.17330 null
2024-09-25 2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation Tommie Kerssies et.al. 2409.17208 link
2024-09-25 WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks Alberto Bacchin et.al. 2409.16999 link
2024-09-25 Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis Illia Tsiporenko et.al. 2409.16940 null
2024-09-24 A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation Avisha Kumar et.al. 2409.16441 null
2024-09-24 Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds Asad Ur Rahman et.al. 2409.16381 null
2024-09-24 Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation Yong Xien Chng et.al. 2409.16278 null
2024-09-24 Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation Hannah Kerner et.al. 2409.16252 link
2024-09-24 Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation Harry Rogers et.al. 2409.16213 link
2024-09-24 Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification Pang-Yuan Pao et.al. 2409.15846 null
2024-09-24 Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks Roberto Alcover-Couso et.al. 2409.15813 null
2024-09-24 DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation Soojin Jang et.al. 2409.15801 null
2024-09-24 Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis Camndon Reed et.al. 2409.15671 null
2024-09-23 Adapting Segment Anything Model for Unseen Object Instance Segmentation Rui Cao et.al. 2409.15481 null
2024-09-23 ZeroSCD: Zero-Shot Street Scene Change Detection Shyam Sundar Kannan et.al. 2409.15255 null
2024-09-23 Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer Minh Bui et.al. 2409.15117 null
2024-09-18 Applications of Knowledge Distillation in Remote Sensing: A Survey Yassine Himeur et.al. 2409.12111 null
2024-09-18 Panoptic-Depth Forecasting Juana Valeria Hurtado et.al. 2409.12008 null
2024-09-18 Particle-based Instance-aware Semantic Occupancy Mapping in Dynamic Environments Gang Chen et.al. 2409.11975 null
2024-09-17 Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks Edgar Heinert et.al. 2409.11373 null
2024-09-17 MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping Amirreza Fateh et.al. 2409.11316 link
2024-09-17 Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark Clifford Broni-Bediako et.al. 2409.11227 link
2024-09-17 HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Nick Theisen et.al. 2409.11205 link
2024-09-16 Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks? Kaleb Kassaw et.al. 2409.10775 null
2024-09-16 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning Amin Karimi Monsefi et.al. 2409.10362 null
2024-09-16 BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images Wentao Wang et.al. 2409.10269 null
2024-09-15 Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Zhanteng Xie et.al. 2409.09899 null
2024-09-15 Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Qilong Zhangli et.al. 2409.09893 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 One missing piece in Vision and Language: A Survey on Comics Understanding Emanuele Vivoli et.al. 2409.09502 link
2024-09-14 Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Hugo Porta et.al. 2409.09497 null
2024-09-14 LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation Qiyuan Wang et.al. 2409.09360 null
2024-09-16 QueryCAD: Grounded Question Answering for CAD Models Claudius Kienle et.al. 2409.08704 null
2024-09-13 AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation Zechao Sun et.al. 2409.08516 null
2024-09-13 VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation Ezra MacDonald et.al. 2409.08461 link
2024-09-12 Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding Hongyu Li et.al. 2409.08251 null
2024-09-12 Bayesian Self-Training for Semi-Supervised 3D Segmentation Ozan Unal et.al. 2409.08102 null
2024-09-12 Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes Siyu Chen et.al. 2409.07995 null
**2