diff --git a/README.md b/README.md index 178d0d0d0a0..7a3dcbce706 100644 --- a/README.md +++ b/README.md @@ -1,10 +1,44 @@ # arxiv-daily - Automated deployment @ 2024-10-22 20:33:19 Asia/Shanghai + Automated deployment @ 2024-10-22 21:03:35 Asia/Shanghai > Welcome to contribute! Add your topics and keywords in [`topic.yml`](https://github.com/beiyuouo/arxiv-daily/blob/main/database/topic.yml). > You can also view historical data through the [storage](https://github.com/beiyuouo/arxiv-daily/blob/main/database/storage). ## 3D Vision +### Point Cloud Registration +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**Nonlinear stability of extremal Reissner-Nordström black holes in spherical symmetry**|Yannis Angelopoulos et.al.|[2410.16234v1](http://arxiv.org/abs/2410.16234v1)|null| +|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| +|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| +|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| +|**2024-10-21**|**Locating the QCD critical point from first principles through contours of constant entropy density**|Hitansh Shah et.al.|[2410.16206v1](http://arxiv.org/abs/2410.16206v1)|null| +|**2024-10-21**|**Deformation of Matrix Geometry via Landau Level Evolution**|Kazuki Hasebe et.al.|[2410.16193v1](http://arxiv.org/abs/2410.16193v1)|null| +|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| +|**2024-10-21**|**Reconciling the kinematical constraint with the JIMWLK evolution equation: correlation functions non-local in rapidity**|Piotr Korcyl et.al.|[2410.16180v1](http://arxiv.org/abs/2410.16180v1)|null| +|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| +|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| +|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| +|**2024-10-21**|**Dynamic Biases of Static Panel Data Estimators**|Sylvia Klosin et.al.|[2410.16112v1](http://arxiv.org/abs/2410.16112v1)|null| +|**2024-10-21**|**In-situ observations of the three-dimensional energy cascade rate and Yaglom flux in the Earth's magnetosheath**|Francesco Pecora et.al.|[2410.16099v1](http://arxiv.org/abs/2410.16099v1)|null| +|**2024-10-21**|**Holographic Weyl Anomaly in 8d from General Higher Curvature Gravity**|Fei-Yu Chen et.al.|[2410.16097v1](http://arxiv.org/abs/2410.16097v1)|null| +|**2024-10-21**|**Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security**|Nathan Tallent et.al.|[2410.16093v1](http://arxiv.org/abs/2410.16093v1)|null| +|**2024-10-21**|**The $Λ_c^+\toηπ^+Λ$ reaction and the $Λ a_0^+(980)$ and $π^+Λ(1670)$ contributions**|Man-Yu Duan et.al.|[2410.16078v1](http://arxiv.org/abs/2410.16078v1)|null| +|**2024-10-21**|**Stochastic Exploration of Real Varieties via Variety Distributions**|David Kahle et.al.|[2410.16071v1](http://arxiv.org/abs/2410.16071v1)|null| +|**2024-10-21**|**Hubble Space Telescope Observations of Nearby Type 1 Quasars. I. Characterisation of the Extended [O III] 5007Å Emission**|Anna Trindade Falcão et.al.|[2410.16053v1](http://arxiv.org/abs/2410.16053v1)|null| +|**2024-10-21**|**Near-Optimal Algorithm for Non-Stationary Kernelized Bandits**|Shogo Iwazaki et.al.|[2410.16052v1](http://arxiv.org/abs/2410.16052v1)|null| +|**2024-10-21**|**Energy-Momentum tensor correlators in $φ^4$ theory I: The spin-zero sector**|Nikos Irges et.al.|[2410.16040v1](http://arxiv.org/abs/2410.16040v1)|null| +|**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| +|**2024-10-21**|**HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum**|Thomas Pusztai et.al.|[2410.16026v1](http://arxiv.org/abs/2410.16026v1)|null| +|**2024-10-21**|**Coexistent Topological and Chiral Phonons in Chiral RhGe: An ab initio study**|P. V. Sreenivasa Reddy et.al.|[2410.16000v1](http://arxiv.org/abs/2410.16000v1)|null| +|**2024-10-21**|**The essential m-dissipativity for degenerate infinite dimensional stochastic Hamiltonian systems and applications**|Benedikt Eisenhuth et.al.|[2410.15993v1](http://arxiv.org/abs/2410.15993v1)|null| +|**2024-10-21**|**Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock**|Hadi Madanian et.al.|[2410.15989v1](http://arxiv.org/abs/2410.15989v1)|null| +|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| + ### Point Cloud Completion |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | @@ -39,39 +73,39 @@ |**2024-10-21**|**Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security**|Nathan Tallent et.al.|[2410.16093v1](http://arxiv.org/abs/2410.16093v1)|null| |**2024-10-21**|**The $Λ_c^+\toηπ^+Λ$ reaction and the $Λ a_0^+(980)$ and $π^+Λ(1670)$ contributions**|Man-Yu Duan et.al.|[2410.16078v1](http://arxiv.org/abs/2410.16078v1)|null| -### 3D Object Tracking +### Point Cloud |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| -|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| -|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| -|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**Nonlinear stability of extremal Reissner-Nordström black holes in spherical symmetry**|Yannis Angelopoulos et.al.|[2410.16234v1](http://arxiv.org/abs/2410.16234v1)|null| |**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| -|**2024-10-21**|**Comparative analysis of 3D-CNN models, GARCH-ANN, and VAR models for determining equity prices**|Sydney Anuyah Mary Akinyemi et.al.|[2410.16205v1](http://arxiv.org/abs/2410.16205v1)|null| +|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| +|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| +|**2024-10-21**|**Locating the QCD critical point from first principles through contours of constant entropy density**|Hitansh Shah et.al.|[2410.16206v1](http://arxiv.org/abs/2410.16206v1)|null| +|**2024-10-21**|**Deformation of Matrix Geometry via Landau Level Evolution**|Kazuki Hasebe et.al.|[2410.16193v1](http://arxiv.org/abs/2410.16193v1)|null| |**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| -|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| -|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| -|**2024-10-21**|**Simulating quantum emitters in arbitrary photonic environments using FDTD: beyond the semi-classical regime**|Qingyi Zhou et.al.|[2410.16118v1](http://arxiv.org/abs/2410.16118v1)|null| -|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| -|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| -|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| +|**2024-10-21**|**Reconciling the kinematical constraint with the JIMWLK evolution equation: correlation functions non-local in rapidity**|Piotr Korcyl et.al.|[2410.16180v1](http://arxiv.org/abs/2410.16180v1)|null| +|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| +|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| +|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| +|**2024-10-21**|**Dynamic Biases of Static Panel Data Estimators**|Sylvia Klosin et.al.|[2410.16112v1](http://arxiv.org/abs/2410.16112v1)|null| +|**2024-10-21**|**In-situ observations of the three-dimensional energy cascade rate and Yaglom flux in the Earth's magnetosheath**|Francesco Pecora et.al.|[2410.16099v1](http://arxiv.org/abs/2410.16099v1)|null| +|**2024-10-21**|**Holographic Weyl Anomaly in 8d from General Higher Curvature Gravity**|Fei-Yu Chen et.al.|[2410.16097v1](http://arxiv.org/abs/2410.16097v1)|null| |**2024-10-21**|**Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security**|Nathan Tallent et.al.|[2410.16093v1](http://arxiv.org/abs/2410.16093v1)|null| -|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| +|**2024-10-21**|**The $Λ_c^+\toηπ^+Λ$ reaction and the $Λ a_0^+(980)$ and $π^+Λ(1670)$ contributions**|Man-Yu Duan et.al.|[2410.16078v1](http://arxiv.org/abs/2410.16078v1)|null| +|**2024-10-21**|**Stochastic Exploration of Real Varieties via Variety Distributions**|David Kahle et.al.|[2410.16071v1](http://arxiv.org/abs/2410.16071v1)|null| +|**2024-10-21**|**Hubble Space Telescope Observations of Nearby Type 1 Quasars. I. Characterisation of the Extended [O III] 5007Å Emission**|Anna Trindade Falcão et.al.|[2410.16053v1](http://arxiv.org/abs/2410.16053v1)|null| +|**2024-10-21**|**Near-Optimal Algorithm for Non-Stationary Kernelized Bandits**|Shogo Iwazaki et.al.|[2410.16052v1](http://arxiv.org/abs/2410.16052v1)|null| +|**2024-10-21**|**Energy-Momentum tensor correlators in $φ^4$ theory I: The spin-zero sector**|Nikos Irges et.al.|[2410.16040v1](http://arxiv.org/abs/2410.16040v1)|null| +|**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| +|**2024-10-21**|**HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum**|Thomas Pusztai et.al.|[2410.16026v1](http://arxiv.org/abs/2410.16026v1)|null| +|**2024-10-21**|**Coexistent Topological and Chiral Phonons in Chiral RhGe: An ab initio study**|P. V. Sreenivasa Reddy et.al.|[2410.16000v1](http://arxiv.org/abs/2410.16000v1)|null| +|**2024-10-21**|**The essential m-dissipativity for degenerate infinite dimensional stochastic Hamiltonian systems and applications**|Benedikt Eisenhuth et.al.|[2410.15993v1](http://arxiv.org/abs/2410.15993v1)|null| +|**2024-10-21**|**Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock**|Hadi Madanian et.al.|[2410.15989v1](http://arxiv.org/abs/2410.15989v1)|null| +|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| ### 3D Object Detection |Publish Date|Title|Authors|PDF|Code| @@ -141,39 +175,39 @@ |**2024-10-21**|**Energy-Momentum tensor correlators in $φ^4$ theory I: The spin-zero sector**|Nikos Irges et.al.|[2410.16040v1](http://arxiv.org/abs/2410.16040v1)|null| |**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| -### Point Cloud +### 3D Object Tracking |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| +|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| +|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| +|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Nonlinear stability of extremal Reissner-Nordström black holes in spherical symmetry**|Yannis Angelopoulos et.al.|[2410.16234v1](http://arxiv.org/abs/2410.16234v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| |**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| -|**2024-10-21**|**Locating the QCD critical point from first principles through contours of constant entropy density**|Hitansh Shah et.al.|[2410.16206v1](http://arxiv.org/abs/2410.16206v1)|null| -|**2024-10-21**|**Deformation of Matrix Geometry via Landau Level Evolution**|Kazuki Hasebe et.al.|[2410.16193v1](http://arxiv.org/abs/2410.16193v1)|null| +|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| +|**2024-10-21**|**Comparative analysis of 3D-CNN models, GARCH-ANN, and VAR models for determining equity prices**|Sydney Anuyah Mary Akinyemi et.al.|[2410.16205v1](http://arxiv.org/abs/2410.16205v1)|null| |**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Reconciling the kinematical constraint with the JIMWLK evolution equation: correlation functions non-local in rapidity**|Piotr Korcyl et.al.|[2410.16180v1](http://arxiv.org/abs/2410.16180v1)|null| -|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| -|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| -|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| -|**2024-10-21**|**Dynamic Biases of Static Panel Data Estimators**|Sylvia Klosin et.al.|[2410.16112v1](http://arxiv.org/abs/2410.16112v1)|null| -|**2024-10-21**|**In-situ observations of the three-dimensional energy cascade rate and Yaglom flux in the Earth's magnetosheath**|Francesco Pecora et.al.|[2410.16099v1](http://arxiv.org/abs/2410.16099v1)|null| -|**2024-10-21**|**Holographic Weyl Anomaly in 8d from General Higher Curvature Gravity**|Fei-Yu Chen et.al.|[2410.16097v1](http://arxiv.org/abs/2410.16097v1)|null| +|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| +|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| +|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| +|**2024-10-21**|**Simulating quantum emitters in arbitrary photonic environments using FDTD: beyond the semi-classical regime**|Qingyi Zhou et.al.|[2410.16118v1](http://arxiv.org/abs/2410.16118v1)|null| +|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| +|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| +|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| |**2024-10-21**|**Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security**|Nathan Tallent et.al.|[2410.16093v1](http://arxiv.org/abs/2410.16093v1)|null| -|**2024-10-21**|**The $Λ_c^+\toηπ^+Λ$ reaction and the $Λ a_0^+(980)$ and $π^+Λ(1670)$ contributions**|Man-Yu Duan et.al.|[2410.16078v1](http://arxiv.org/abs/2410.16078v1)|null| -|**2024-10-21**|**Stochastic Exploration of Real Varieties via Variety Distributions**|David Kahle et.al.|[2410.16071v1](http://arxiv.org/abs/2410.16071v1)|null| -|**2024-10-21**|**Hubble Space Telescope Observations of Nearby Type 1 Quasars. I. Characterisation of the Extended [O III] 5007Å Emission**|Anna Trindade Falcão et.al.|[2410.16053v1](http://arxiv.org/abs/2410.16053v1)|null| -|**2024-10-21**|**Near-Optimal Algorithm for Non-Stationary Kernelized Bandits**|Shogo Iwazaki et.al.|[2410.16052v1](http://arxiv.org/abs/2410.16052v1)|null| -|**2024-10-21**|**Energy-Momentum tensor correlators in $φ^4$ theory I: The spin-zero sector**|Nikos Irges et.al.|[2410.16040v1](http://arxiv.org/abs/2410.16040v1)|null| -|**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| -|**2024-10-21**|**HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum**|Thomas Pusztai et.al.|[2410.16026v1](http://arxiv.org/abs/2410.16026v1)|null| -|**2024-10-21**|**Coexistent Topological and Chiral Phonons in Chiral RhGe: An ab initio study**|P. V. Sreenivasa Reddy et.al.|[2410.16000v1](http://arxiv.org/abs/2410.16000v1)|null| -|**2024-10-21**|**The essential m-dissipativity for degenerate infinite dimensional stochastic Hamiltonian systems and applications**|Benedikt Eisenhuth et.al.|[2410.15993v1](http://arxiv.org/abs/2410.15993v1)|null| -|**2024-10-21**|**Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock**|Hadi Madanian et.al.|[2410.15989v1](http://arxiv.org/abs/2410.15989v1)|null| -|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| +|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| ### Point Cloud Segmentation |Publish Date|Title|Authors|PDF|Code| @@ -209,351 +243,179 @@ |**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| |**2024-10-21**|**HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum**|Thomas Pusztai et.al.|[2410.16026v1](http://arxiv.org/abs/2410.16026v1)|null| -### Point Cloud Registration -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Nonlinear stability of extremal Reissner-Nordström black holes in spherical symmetry**|Yannis Angelopoulos et.al.|[2410.16234v1](http://arxiv.org/abs/2410.16234v1)|null| -|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| -|**2024-10-21**|**Locating the QCD critical point from first principles through contours of constant entropy density**|Hitansh Shah et.al.|[2410.16206v1](http://arxiv.org/abs/2410.16206v1)|null| -|**2024-10-21**|**Deformation of Matrix Geometry via Landau Level Evolution**|Kazuki Hasebe et.al.|[2410.16193v1](http://arxiv.org/abs/2410.16193v1)|null| -|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Reconciling the kinematical constraint with the JIMWLK evolution equation: correlation functions non-local in rapidity**|Piotr Korcyl et.al.|[2410.16180v1](http://arxiv.org/abs/2410.16180v1)|null| -|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| -|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| -|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| -|**2024-10-21**|**Dynamic Biases of Static Panel Data Estimators**|Sylvia Klosin et.al.|[2410.16112v1](http://arxiv.org/abs/2410.16112v1)|null| -|**2024-10-21**|**In-situ observations of the three-dimensional energy cascade rate and Yaglom flux in the Earth's magnetosheath**|Francesco Pecora et.al.|[2410.16099v1](http://arxiv.org/abs/2410.16099v1)|null| -|**2024-10-21**|**Holographic Weyl Anomaly in 8d from General Higher Curvature Gravity**|Fei-Yu Chen et.al.|[2410.16097v1](http://arxiv.org/abs/2410.16097v1)|null| -|**2024-10-21**|**Final Report for CHESS: Cloud, High-Performance Computing, and Edge for Science and Security**|Nathan Tallent et.al.|[2410.16093v1](http://arxiv.org/abs/2410.16093v1)|null| -|**2024-10-21**|**The $Λ_c^+\toηπ^+Λ$ reaction and the $Λ a_0^+(980)$ and $π^+Λ(1670)$ contributions**|Man-Yu Duan et.al.|[2410.16078v1](http://arxiv.org/abs/2410.16078v1)|null| -|**2024-10-21**|**Stochastic Exploration of Real Varieties via Variety Distributions**|David Kahle et.al.|[2410.16071v1](http://arxiv.org/abs/2410.16071v1)|null| -|**2024-10-21**|**Hubble Space Telescope Observations of Nearby Type 1 Quasars. I. Characterisation of the Extended [O III] 5007Å Emission**|Anna Trindade Falcão et.al.|[2410.16053v1](http://arxiv.org/abs/2410.16053v1)|null| -|**2024-10-21**|**Near-Optimal Algorithm for Non-Stationary Kernelized Bandits**|Shogo Iwazaki et.al.|[2410.16052v1](http://arxiv.org/abs/2410.16052v1)|null| -|**2024-10-21**|**Energy-Momentum tensor correlators in $φ^4$ theory I: The spin-zero sector**|Nikos Irges et.al.|[2410.16040v1](http://arxiv.org/abs/2410.16040v1)|null| -|**2024-10-21**|**Local well-posedness and blow-up in the energy space for the 2D NLS with point interaction**|Luigi Forcella et.al.|[2410.16039v1](http://arxiv.org/abs/2410.16039v1)|null| -|**2024-10-21**|**HyperDrive: Scheduling Serverless Functions in the Edge-Cloud-Space 3D Continuum**|Thomas Pusztai et.al.|[2410.16026v1](http://arxiv.org/abs/2410.16026v1)|null| -|**2024-10-21**|**Coexistent Topological and Chiral Phonons in Chiral RhGe: An ab initio study**|P. V. Sreenivasa Reddy et.al.|[2410.16000v1](http://arxiv.org/abs/2410.16000v1)|null| -|**2024-10-21**|**The essential m-dissipativity for degenerate infinite dimensional stochastic Hamiltonian systems and applications**|Benedikt Eisenhuth et.al.|[2410.15993v1](http://arxiv.org/abs/2410.15993v1)|null| -|**2024-10-21**|**Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock**|Hadi Madanian et.al.|[2410.15989v1](http://arxiv.org/abs/2410.15989v1)|null| -|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| - -## Computer Vision +## Federated Learning -### Image Classification +### Federated Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| -|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| +|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| |**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| |**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| +|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| |**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**Serendipitous detection of an intense X-ray flare in the weak-line T Tauri star KM Ori with SRG/eROSITA**|Savithri H. Ezhikode et.al.|[2410.16241v1](http://arxiv.org/abs/2410.16241v1)|null| +|**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| +|**2024-10-21**|**Nonlinear Magnetics Model for Permanent Magnet Synchronous Machines Capturing Saturation and Temperature Effects**|Kishan Srinivasan et.al.|[2410.16240v1](http://arxiv.org/abs/2410.16240v1)|null| |**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| |**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**Gas puff imaging of plasma turbulence in the magnetic island scrape-off layer of W7-X**|S. G. Baek et.al.|[2410.16194v1](http://arxiv.org/abs/2410.16194v1)|null| +|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| +|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| +|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| +|**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| +|**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| +|**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| +|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| +|**2024-10-21**|**Compute-Constrained Data Selection**|Junjie Oscar Yin et.al.|[2410.16208v1](http://arxiv.org/abs/2410.16208v1)|[link](https://github.com/oseyosey/ccds)| +|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| +|**2024-10-21**|**Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media**|Yuchen Cao et.al.|[2410.16204v1](http://arxiv.org/abs/2410.16204v1)|null| +|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| +|**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| +|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| |**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| -|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| -|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models**|Giannis Daras et.al.|[2410.16152v1](http://arxiv.org/abs/2410.16152v1)|null| -|**2024-10-21**|**Denoising Hyperbolic-Valued Data by Relaxed Regularizations**|Robert Beinert et.al.|[2410.16149v1](http://arxiv.org/abs/2410.16149v1)|null| -|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| -|**2024-10-21**|**Classification and dilation for $q$-commuting $2 \times 2$ scalar matrices**|Sourav Pal et.al.|[2410.16134v1](http://arxiv.org/abs/2410.16134v1)|null| +|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| +|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| + +### Optimization +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| |**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| -|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| -|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| -|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| -|**2024-10-21**|**LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset**|Ruikun Zhang et.al.|[2410.16095v1](http://arxiv.org/abs/2410.16095v1)|[link](https://github.com/wangzrk/LMHaze)| -|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| -|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| +|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| +|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| +|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| +|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| +|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| +|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| +|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| +|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| +|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| +|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| +|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| +|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| +|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| +|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| +|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| +|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| +|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| +|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| +|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| +|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| +|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| +|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| +|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| +|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| +|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| +|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| +|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| +|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Object Detection +### Efficient |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Search for $h_b(2P)\toγχ_{bJ}(1P)$ at $\sqrt{s} = 10.860$ GeV**|Belle Collaboration et.al.|[2410.16181v1](http://arxiv.org/abs/2410.16181v1)|null| -|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| -|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| -|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| -|**2024-10-21**|**Searching for Axion-Like Particles with X-ray Observations of Alpha Centauri**|Yu-Xuan Chen et.al.|[2410.16065v1](http://arxiv.org/abs/2410.16065v1)|null| -|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| -|**2024-10-21**|**Size and kinematics of the low-ionization broad emission line region from microlensing-induced line profile distortions in gravitationally lensed quasars**|Damien Hutsemékers et.al.|[2410.16018v1](http://arxiv.org/abs/2410.16018v1)|null| -|**2024-10-21**|**Differential Evolution-Based End-Fire Realized Gain Optimization of Active and Parasitic Arrays**|Rozita Konstantinou et.al.|[2410.16014v1](http://arxiv.org/abs/2410.16014v1)|null| -|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| -|**2024-10-21**|**The Yang-Baxter equation, Leibniz algebras, racks and related algebraic structures**|Nanyan Xu et.al.|[2410.15972v1](http://arxiv.org/abs/2410.15972v1)|null| -|**2024-10-21**|**Partial Orientation Retrieval of Proteins From Coulomb Explosions**|Tomas André et.al.|[2410.15965v1](http://arxiv.org/abs/2410.15965v1)|null| -|**2024-10-21**|**Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization**|Yugandhar Reddy Gogireddy et.al.|[2410.15962v1](http://arxiv.org/abs/2410.15962v1)|null| -|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| -|**2024-10-21**|**Protein structure classification based on X-ray laser induced Coulomb explosion**|Tomas André et.al.|[2410.15934v1](http://arxiv.org/abs/2410.15934v1)|null| -|**2024-10-21**|**Mitigating Object Hallucination via Concentric Causal Attention**|Yun Xing et.al.|[2410.15926v1](http://arxiv.org/abs/2410.15926v1)|[link](https://github.com/xing0047/cca-llava)| -|**2024-10-21**|**Conceptual and practical approaches for investigating irreversible processes**|Dario Lucente et.al.|[2410.15925v1](http://arxiv.org/abs/2410.15925v1)|null| -|**2024-10-21**|**A Simpler Approach for Monotone Parametric Minimum Cut: Finding the Breakpoints in Order**|Arne Beines et.al.|[2410.15920v1](http://arxiv.org/abs/2410.15920v1)|null| -|**2024-10-21**|**On Poisson Distribution**|Dharmendra Kumar Singh et.al.|[2410.15914v1](http://arxiv.org/abs/2410.15914v1)|null| - -### Multi-Object Tracking -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| -|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| -|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| -|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| -|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| -|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| -|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| -|**2024-10-21**|**Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors**|Finn Schmidt et.al.|[2410.16136v1](http://arxiv.org/abs/2410.16136v1)|null| -|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| -|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| -|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| -|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| -|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| - -### Instance Segmentation -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**Role of obstacle softness in the diffusive behavior of active Particles**|Ankit Gupta et.al.|[2410.16223v1](http://arxiv.org/abs/2410.16223v1)|null| -|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| -|**2024-10-21**|**MNIST-Nd: a set of naturalistic datasets to benchmark clustering across dimensions**|Polina Turishcheva et.al.|[2410.16124v1](http://arxiv.org/abs/2410.16124v1)|null| -|**2024-10-21**|**ExDBN: Exact learning of Dynamic Bayesian Networks**|Pavel Rytíř et.al.|[2410.16100v1](http://arxiv.org/abs/2410.16100v1)|null| -|**2024-10-21**|**Orbifold singularity formation along ancient and immortal Ricci flows**|Alix Deruelle et.al.|[2410.16075v1](http://arxiv.org/abs/2410.16075v1)|null| -|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| -|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| -|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| -|**2024-10-21**|**CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation**|Xi Xu et.al.|[2410.16011v1](http://arxiv.org/abs/2410.16011v1)|null| -|**2024-10-21**|**Self-Explained Keywords Empower Large Language Models for Code Generation**|Lishui Fan et.al.|[2410.15966v1](http://arxiv.org/abs/2410.15966v1)|null| -|**2024-10-21**|**Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)**|Animesh Kumar et.al.|[2410.15951v1](http://arxiv.org/abs/2410.15951v1)|null| -|**2024-10-21**|**Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation**|Jiawei Zhao et.al.|[2410.15932v1](http://arxiv.org/abs/2410.15932v1)|null| -|**2024-10-21**|**Automatic Differentiation of Optimization Algorithms with Time-Varying Updates**|Sheheryar Mehmood et.al.|[2410.15923v1](http://arxiv.org/abs/2410.15923v1)|null| -|**2024-10-21**|**A Simpler Approach for Monotone Parametric Minimum Cut: Finding the Breakpoints in Order**|Arne Beines et.al.|[2410.15920v1](http://arxiv.org/abs/2410.15920v1)|null| -|**2024-10-21**|**Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation**|Xinze Li et.al.|[2410.15916v1](http://arxiv.org/abs/2410.15916v1)|[link](https://github.com/Powertony102/corn_official)| -|**2024-10-21**|**IGMaxHS -- An Incremental MaxSAT Solver with Support for XOR Clauses**|Ole Lübke et.al.|[2410.15897v1](http://arxiv.org/abs/2410.15897v1)|null| -|**2024-10-21**|**Foundation Models for Slide-level Cancer Subtyping in Digital Pathology**|Pablo Meseguer et.al.|[2410.15886v1](http://arxiv.org/abs/2410.15886v1)|null| -|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| -|**2024-10-21**|**MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images**|Pablo Meseguer et.al.|[2410.15881v1](http://arxiv.org/abs/2410.15881v1)|null| -|**2024-10-21**|**MultIHeaTS: A Fast and Stable Thermal Solver for Multilayered Planetary Surfaces**|Cyril Mergny et.al.|[2410.15834v1](http://arxiv.org/abs/2410.15834v1)|[link](https://github.com/cmergny/MultIHeaTS)| -|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| -|**2024-10-21**|**FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation**|Sadjad Rezvani et.al.|[2410.15812v1](http://arxiv.org/abs/2410.15812v1)|null| -|**2024-10-21**|**Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt**|Edi Sutoyo et.al.|[2410.15804v1](http://arxiv.org/abs/2410.15804v1)|[link](https://github.com/edisutoyo/satd-augmentation)| -|**2024-10-21**|**Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization**|Hanseon Joo et.al.|[2410.15794v1](http://arxiv.org/abs/2410.15794v1)|[link](https://github.com/HanseonJoo/Habaek)| -|**2024-10-21**|**Improving Instance Optimization in Deformable Image Registration with Gradient Projection**|Yi Zhang et.al.|[2410.15767v1](http://arxiv.org/abs/2410.15767v1)|null| -|**2024-10-21**|**GIG: Graph Data Imputation With Graph Differential Dependencies**|Jiang Hua et.al.|[2410.15747v1](http://arxiv.org/abs/2410.15747v1)|null| -|**2024-10-21**|**Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment**|Yankai Jiang et.al.|[2410.15744v1](http://arxiv.org/abs/2410.15744v1)|null| -|**2024-10-21**|**A Fair Allocation is Approximately Optimal for Indivisible Chores, or Is It?**|Bo Li et.al.|[2410.15738v1](http://arxiv.org/abs/2410.15738v1)|null| - -### Object Tracking -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| -|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| -|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| -|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| -|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| -|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| -|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| -|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| -|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| -|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| -|**2024-10-21**|**Size and kinematics of the low-ionization broad emission line region from microlensing-induced line profile distortions in gravitationally lensed quasars**|Damien Hutsemékers et.al.|[2410.16018v1](http://arxiv.org/abs/2410.16018v1)|null| -|**2024-10-21**|**Differential Evolution-Based End-Fire Realized Gain Optimization of Active and Parasitic Arrays**|Rozita Konstantinou et.al.|[2410.16014v1](http://arxiv.org/abs/2410.16014v1)|null| -|**2024-10-21**|**Surprising Patterns in Musical Influence Networks**|Flavio Figueiredo et.al.|[2410.15996v1](http://arxiv.org/abs/2410.15996v1)|null| -|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| -|**2024-10-21**|**The Yang-Baxter equation, Leibniz algebras, racks and related algebraic structures**|Nanyan Xu et.al.|[2410.15972v1](http://arxiv.org/abs/2410.15972v1)|null| -|**2024-10-21**|**An Efficient Local Optimizer-Tracking Solver for Differential-Algebriac Equations with Optimization Criteria**|Alexander Fleming et.al.|[2410.15963v1](http://arxiv.org/abs/2410.15963v1)|null| -|**2024-10-21**|**Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization**|Yugandhar Reddy Gogireddy et.al.|[2410.15962v1](http://arxiv.org/abs/2410.15962v1)|null| - -### Keypoint Detection -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Accelerating Discovery of Extreme Lattice Thermal Conductivity by Crystal Attention Graph Neural Network (CATGNN) Using Chemical Bonding Intuitive Descriptors**|Mohammed Al-Fahdi et.al.|[2410.16066v1](http://arxiv.org/abs/2410.16066v1)|null| -|**2024-10-21**|**RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration**|Pengcheng Shi et.al.|[2410.15682v1](http://arxiv.org/abs/2410.15682v1)|[link](https://github.com/shipc-ai/tcf)| -|**2024-10-19**|**The shape of the brain's connections is predictive of cognitive performance: an explainable machine learning study**|Yui Lo et.al.|[2410.15108v1](http://arxiv.org/abs/2410.15108v1)|null| -|**2024-10-18**|**Sim2real Cattle Joint Estimation in 3D point clouds**|Okour Mohammad et.al.|[2410.14419v1](http://arxiv.org/abs/2410.14419v1)|null| -|**2024-10-18**|**Functionality Determinants and Pore-level Quantitative Structure-Activity Relationship (QSAR) Approach for Water Permeation Rate in Aquaporins**|Juan José Galano-Frutos et.al.|[2410.14355v1](http://arxiv.org/abs/2410.14355v1)|null| -|**2024-10-18**|**2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization**|Junan Chen et.al.|[2410.14343v1](http://arxiv.org/abs/2410.14343v1)|null| -|**2024-10-17**|**Navigation maps of the material space for automated self-driving labs of the future**|Daniel E Widdowson et.al.|[2410.13796v1](http://arxiv.org/abs/2410.13796v1)|null| -|**2024-10-17**|**DualQuat-LOAM: LiDAR Odometry and Mapping parametrized on Dual Quaternions**|Edison P. Velasco-Sánchez et.al.|[2410.13541v1](http://arxiv.org/abs/2410.13541v1)|null| -|**2024-10-16**|**PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network**|Asish Bera et.al.|[2410.12742v1](http://arxiv.org/abs/2410.12742v1)|null| -|**2024-10-16**|**RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition**|Asish Bera et.al.|[2410.12718v1](http://arxiv.org/abs/2410.12718v1)|null| -|**2024-10-16**|**Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation**|Francesco Evangelisti et.al.|[2410.12679v1](http://arxiv.org/abs/2410.12679v1)|null| -|**2024-10-16**|**AdaCropFollow: Self-Supervised Online Adaptation for Visual Under-Canopy Navigation**|Arun N. Sivakumar et.al.|[2410.12411v1](http://arxiv.org/abs/2410.12411v1)|null| -|**2024-10-15**|**Stochastic 3D reconstruction of cracked polycrystalline NMC particles using 2D SEM data**|Philipp Rieder et.al.|[2410.12020v1](http://arxiv.org/abs/2410.12020v1)|null| -|**2024-10-15**|**NavTopo: Leveraging Topological Maps For Autonomous Navigation Of a Mobile Robot**|Kirill Muravyev et.al.|[2410.11492v1](http://arxiv.org/abs/2410.11492v1)|null| -|**2024-10-14**|**Assessing Bias in Metric Models for LLM Open-Ended Generation Bias Benchmarks**|Nathaniel Demchak et.al.|[2410.11059v1](http://arxiv.org/abs/2410.11059v1)|null| -|**2024-10-14**|**3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications**|Eduardo R. Corral-Soto et.al.|[2410.10782v1](http://arxiv.org/abs/2410.10782v1)|null| -|**2024-10-14**|**A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration**|Renlang Huang et.al.|[2410.10295v1](http://arxiv.org/abs/2410.10295v1)|[link](https://github.com/renlanghuang/cast)| -|**2024-10-13**|**Occluded Human Pose Estimation based on Limb Joint Augmentation**|Gangtao Han et.al.|[2410.09885v1](http://arxiv.org/abs/2410.09885v1)|null| -|**2024-10-13**|**Text4Seg: Reimagining Image Segmentation as Text Generation**|Mengcheng Lan et.al.|[2410.09855v1](http://arxiv.org/abs/2410.09855v1)|[link](https://github.com/mc-lan/text4seg)| -|**2024-10-12**|**Many-body Expansion Based Machine Learning Models for Octahedral Transition Metal Complexes**|Ralf Meyer et.al.|[2410.09659v1](http://arxiv.org/abs/2410.09659v1)|[link](https://github.com/hjkgrp/many_body_ml)| -|**2024-10-12**|**Objective Measurements of Voice Quality**|Hira Dhamyal et.al.|[2410.09578v1](http://arxiv.org/abs/2410.09578v1)|null| -|**2024-10-12**|**Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence**|Felipe Cadar et.al.|[2410.09533v1](http://arxiv.org/abs/2410.09533v1)|[link](https://github.com/verlab/DescriptorReasoning_ACCV_2024)| -|**2024-10-11**|**Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking**|Duy Le Dinh Anh et.al.|[2410.09243v1](http://arxiv.org/abs/2410.09243v1)|null| -|**2024-10-11**|**Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image**|Marta Veganzones Rodriguez et.al.|[2410.09155v1](http://arxiv.org/abs/2410.09155v1)|null| -|**2024-10-11**|**Learning Spatial Bimanual Action Models Based on Affordance Regions and Human Demonstrations**|Björn S. Plonka et.al.|[2410.08848v1](http://arxiv.org/abs/2410.08848v1)|null| -|**2024-10-11**|**Data-Driven Discovery of the Origins of UV Absorption in Alpha-3C Protein**|Germaine Neza Hozana et.al.|[2410.08624v1](http://arxiv.org/abs/2410.08624v1)|null| -|**2024-10-11**|**Study on (r,s)- Generalised Transformation Graphs, A Novel Perspective Based on Transformation Graphs**|Parvez Ali et.al.|[2410.09122v1](http://arxiv.org/abs/2410.09122v1)|null| -|**2024-10-10**|**Electronic structure prediction of medium and high entropy alloys across composition space**|Shashank Pathrudkar et.al.|[2410.08294v1](http://arxiv.org/abs/2410.08294v1)|null| -|**2024-10-10**|**Using Lagrangian descriptors to calculate the Maslov index of periodic orbits**|J. Montes et.al.|[2410.08171v1](http://arxiv.org/abs/2410.08171v1)|null| -|**2024-10-10**|**Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks**|Minxing Zhang et.al.|[2410.07670v1](http://arxiv.org/abs/2410.07670v1)|null| - -### Image Matching -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| -|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| -|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**Gas puff imaging of plasma turbulence in the magnetic island scrape-off layer of W7-X**|S. G. Baek et.al.|[2410.16194v1](http://arxiv.org/abs/2410.16194v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| -|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| -|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| -|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models**|Giannis Daras et.al.|[2410.16152v1](http://arxiv.org/abs/2410.16152v1)|null| -|**2024-10-21**|**Denoising Hyperbolic-Valued Data by Relaxed Regularizations**|Robert Beinert et.al.|[2410.16149v1](http://arxiv.org/abs/2410.16149v1)|null| -|**2024-10-21**|**Holographic dictionary for Lifshitz and hyperscaling violating black holes**|Wan Cong et.al.|[2410.16145v1](http://arxiv.org/abs/2410.16145v1)|null| -|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| |**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| -|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| -|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| -|**2024-10-21**|**Do LLMs write like humans? Variation in grammatical and rhetorical styles**|Alex Reinhart et.al.|[2410.16107v1](http://arxiv.org/abs/2410.16107v1)|null| -|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| -|**2024-10-21**|**LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset**|Ruikun Zhang et.al.|[2410.16095v1](http://arxiv.org/abs/2410.16095v1)|[link](https://github.com/wangzrk/LMHaze)| -|**2024-10-21**|**Streaming and Communication Complexity of Load-Balancing via Matching Contractors**|Sepehr Assadi et.al.|[2410.16094v1](http://arxiv.org/abs/2410.16094v1)|null| +|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| +|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| +|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| +|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| +|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| +|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| +|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| +|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| +|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| +|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| +|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| +|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| +|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| +|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| +|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| +|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| +|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| +|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| +|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| +|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| +|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| +|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| +|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| +|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| +|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| +|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| +|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| +|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Semantic Segmentation +### Communication |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| -|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| -|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| -|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| -|**2024-10-21**|**Limpeh ga li gong: Challenges in Singlish Annotations**|Lynnette Hui Xian Ng et.al.|[2410.16156v1](http://arxiv.org/abs/2410.16156v1)|null| -|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| -|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| -|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| -|**2024-10-21**|**ChangeGuard: Validating Code Changes via Pairwise Learning-Guided Execution**|Lars Gröninger et.al.|[2410.16092v1](http://arxiv.org/abs/2410.16092v1)|null| -|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| -|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| -|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| -|**2024-10-21**|**CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation**|Xi Xu et.al.|[2410.16011v1](http://arxiv.org/abs/2410.16011v1)|null| -|**2024-10-21**|**Visual Representation Learning Guided By Multi-modal Prior Knowledge**|Hongkuan Zhou et.al.|[2410.15981v1](http://arxiv.org/abs/2410.15981v1)|null| -|**2024-10-21**|**Fundamental sequences based on localization**|Gunnar Wilken et.al.|[2410.15953v1](http://arxiv.org/abs/2410.15953v1)|null| -|**2024-10-21**|**Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)**|Animesh Kumar et.al.|[2410.15951v1](http://arxiv.org/abs/2410.15951v1)|null| -|**2024-10-21**|**Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation**|Jiawei Zhao et.al.|[2410.15932v1](http://arxiv.org/abs/2410.15932v1)|null| -|**2024-10-21**|**Centrality-aware Product Retrieval and Ranking**|Hadeel Saadany et.al.|[2410.15930v1](http://arxiv.org/abs/2410.15930v1)|null| -|**2024-10-21**|**Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation**|Xinze Li et.al.|[2410.15916v1](http://arxiv.org/abs/2410.15916v1)|[link](https://github.com/Powertony102/corn_official)| -|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| -|**2024-10-21**|**Principles of semantic and functional efficiency in grammatical patterning**|Emily Cheng et.al.|[2410.15865v1](http://arxiv.org/abs/2410.15865v1)|null| -|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| -|**2024-10-21**|**Large Language Models Empower Personalized Valuation in Auction**|Jie Sun et.al.|[2410.15817v1](http://arxiv.org/abs/2410.15817v1)|null| -|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| -|**2024-10-21**|**FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation**|Sadjad Rezvani et.al.|[2410.15812v1](http://arxiv.org/abs/2410.15812v1)|null| -|**2024-10-21**|**Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation**|Yongguang Li et.al.|[2410.15811v1](http://arxiv.org/abs/2410.15811v1)|null| -|**2024-10-21**|**Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization**|Hanseon Joo et.al.|[2410.15794v1](http://arxiv.org/abs/2410.15794v1)|[link](https://github.com/HanseonJoo/Habaek)| - -## Federated Learning +|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| +|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| +|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| +|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| +|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| +|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| +|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| +|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| +|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| +|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| +|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| +|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| +|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| +|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| +|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| +|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| +|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| +|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| +|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| +|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| +|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| +|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| +|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| +|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| +|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| +|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| +|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| +|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| +|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| +|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Federated Learning +### Framework |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| -|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| -|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| -|**2024-10-21**|**Nonlinear Magnetics Model for Permanent Magnet Synchronous Machines Capturing Saturation and Temperature Effects**|Kishan Srinivasan et.al.|[2410.16240v1](http://arxiv.org/abs/2410.16240v1)|null| -|**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| -|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| -|**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| -|**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| -|**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| -|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| -|**2024-10-21**|**Compute-Constrained Data Selection**|Junjie Oscar Yin et.al.|[2410.16208v1](http://arxiv.org/abs/2410.16208v1)|[link](https://github.com/oseyosey/ccds)| -|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| -|**2024-10-21**|**Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media**|Yuchen Cao et.al.|[2410.16204v1](http://arxiv.org/abs/2410.16204v1)|null| -|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| -|**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| -|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| -|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| +|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| +|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| +|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| +|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| +|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| +|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| +|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| +|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| +|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| +|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| +|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| +|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| +|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| +|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| +|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| +|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| +|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| +|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| +|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| +|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| +|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| +|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| +|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| +|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| +|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| +|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| +|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| +|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| +|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| +|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Heterogeneous +### Benchmark |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| @@ -621,7 +483,7 @@ |**2024-10-19**|**Audio Processing using Pattern Recognition for Music Genre Classification**|Sivangi Chatterjee et.al.|[2410.14990v1](http://arxiv.org/abs/2410.14990v1)|null| |**2024-10-19**|**Taming the Long Tail in Human Mobility Prediction**|Xiaohang Xu et.al.|[2410.14970v1](http://arxiv.org/abs/2410.14970v1)|[link](https://github.com/yukayo/lotnext)| -### Benchmark +### Heterogeneous |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| @@ -655,7 +517,7 @@ |**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| |**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Optimization +### Dataset |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| @@ -689,7 +551,7 @@ |**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| |**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Communication +### Asynchronous |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| @@ -723,7 +585,7 @@ |**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| |**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Framework +### Privacy |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| @@ -757,145 +619,283 @@ |**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| |**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| -### Efficient +## Computer Vision + +### Image Matching +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| +|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**Gas puff imaging of plasma turbulence in the magnetic island scrape-off layer of W7-X**|S. G. Baek et.al.|[2410.16194v1](http://arxiv.org/abs/2410.16194v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| +|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| +|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| +|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models**|Giannis Daras et.al.|[2410.16152v1](http://arxiv.org/abs/2410.16152v1)|null| +|**2024-10-21**|**Denoising Hyperbolic-Valued Data by Relaxed Regularizations**|Robert Beinert et.al.|[2410.16149v1](http://arxiv.org/abs/2410.16149v1)|null| +|**2024-10-21**|**Holographic dictionary for Lifshitz and hyperscaling violating black holes**|Wan Cong et.al.|[2410.16145v1](http://arxiv.org/abs/2410.16145v1)|null| +|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| +|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| +|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| +|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| +|**2024-10-21**|**Do LLMs write like humans? Variation in grammatical and rhetorical styles**|Alex Reinhart et.al.|[2410.16107v1](http://arxiv.org/abs/2410.16107v1)|null| +|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| +|**2024-10-21**|**LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset**|Ruikun Zhang et.al.|[2410.16095v1](http://arxiv.org/abs/2410.16095v1)|[link](https://github.com/wangzrk/LMHaze)| +|**2024-10-21**|**Streaming and Communication Complexity of Load-Balancing via Matching Contractors**|Sepehr Assadi et.al.|[2410.16094v1](http://arxiv.org/abs/2410.16094v1)|null| + +### Keypoint Detection +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Accelerating Discovery of Extreme Lattice Thermal Conductivity by Crystal Attention Graph Neural Network (CATGNN) Using Chemical Bonding Intuitive Descriptors**|Mohammed Al-Fahdi et.al.|[2410.16066v1](http://arxiv.org/abs/2410.16066v1)|null| +|**2024-10-21**|**RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration**|Pengcheng Shi et.al.|[2410.15682v1](http://arxiv.org/abs/2410.15682v1)|[link](https://github.com/shipc-ai/tcf)| +|**2024-10-19**|**The shape of the brain's connections is predictive of cognitive performance: an explainable machine learning study**|Yui Lo et.al.|[2410.15108v1](http://arxiv.org/abs/2410.15108v1)|null| +|**2024-10-18**|**Sim2real Cattle Joint Estimation in 3D point clouds**|Okour Mohammad et.al.|[2410.14419v1](http://arxiv.org/abs/2410.14419v1)|null| +|**2024-10-18**|**Functionality Determinants and Pore-level Quantitative Structure-Activity Relationship (QSAR) Approach for Water Permeation Rate in Aquaporins**|Juan José Galano-Frutos et.al.|[2410.14355v1](http://arxiv.org/abs/2410.14355v1)|null| +|**2024-10-18**|**2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization**|Junan Chen et.al.|[2410.14343v1](http://arxiv.org/abs/2410.14343v1)|null| +|**2024-10-17**|**Navigation maps of the material space for automated self-driving labs of the future**|Daniel E Widdowson et.al.|[2410.13796v1](http://arxiv.org/abs/2410.13796v1)|null| +|**2024-10-17**|**DualQuat-LOAM: LiDAR Odometry and Mapping parametrized on Dual Quaternions**|Edison P. Velasco-Sánchez et.al.|[2410.13541v1](http://arxiv.org/abs/2410.13541v1)|null| +|**2024-10-16**|**PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network**|Asish Bera et.al.|[2410.12742v1](http://arxiv.org/abs/2410.12742v1)|null| +|**2024-10-16**|**RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition**|Asish Bera et.al.|[2410.12718v1](http://arxiv.org/abs/2410.12718v1)|null| +|**2024-10-16**|**Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation**|Francesco Evangelisti et.al.|[2410.12679v1](http://arxiv.org/abs/2410.12679v1)|null| +|**2024-10-16**|**AdaCropFollow: Self-Supervised Online Adaptation for Visual Under-Canopy Navigation**|Arun N. Sivakumar et.al.|[2410.12411v1](http://arxiv.org/abs/2410.12411v1)|null| +|**2024-10-15**|**Stochastic 3D reconstruction of cracked polycrystalline NMC particles using 2D SEM data**|Philipp Rieder et.al.|[2410.12020v1](http://arxiv.org/abs/2410.12020v1)|null| +|**2024-10-15**|**NavTopo: Leveraging Topological Maps For Autonomous Navigation Of a Mobile Robot**|Kirill Muravyev et.al.|[2410.11492v1](http://arxiv.org/abs/2410.11492v1)|null| +|**2024-10-14**|**Assessing Bias in Metric Models for LLM Open-Ended Generation Bias Benchmarks**|Nathaniel Demchak et.al.|[2410.11059v1](http://arxiv.org/abs/2410.11059v1)|null| +|**2024-10-14**|**3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications**|Eduardo R. Corral-Soto et.al.|[2410.10782v1](http://arxiv.org/abs/2410.10782v1)|null| +|**2024-10-14**|**A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration**|Renlang Huang et.al.|[2410.10295v1](http://arxiv.org/abs/2410.10295v1)|[link](https://github.com/renlanghuang/cast)| +|**2024-10-13**|**Occluded Human Pose Estimation based on Limb Joint Augmentation**|Gangtao Han et.al.|[2410.09885v1](http://arxiv.org/abs/2410.09885v1)|null| +|**2024-10-13**|**Text4Seg: Reimagining Image Segmentation as Text Generation**|Mengcheng Lan et.al.|[2410.09855v1](http://arxiv.org/abs/2410.09855v1)|[link](https://github.com/mc-lan/text4seg)| +|**2024-10-12**|**Many-body Expansion Based Machine Learning Models for Octahedral Transition Metal Complexes**|Ralf Meyer et.al.|[2410.09659v1](http://arxiv.org/abs/2410.09659v1)|[link](https://github.com/hjkgrp/many_body_ml)| +|**2024-10-12**|**Objective Measurements of Voice Quality**|Hira Dhamyal et.al.|[2410.09578v1](http://arxiv.org/abs/2410.09578v1)|null| +|**2024-10-12**|**Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence**|Felipe Cadar et.al.|[2410.09533v1](http://arxiv.org/abs/2410.09533v1)|[link](https://github.com/verlab/DescriptorReasoning_ACCV_2024)| +|**2024-10-11**|**Enhanced Kalman with Adaptive Appearance Motion SORT for Grounded Generic Multiple Object Tracking**|Duy Le Dinh Anh et.al.|[2410.09243v1](http://arxiv.org/abs/2410.09243v1)|null| +|**2024-10-11**|**Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image**|Marta Veganzones Rodriguez et.al.|[2410.09155v1](http://arxiv.org/abs/2410.09155v1)|null| +|**2024-10-11**|**Learning Spatial Bimanual Action Models Based on Affordance Regions and Human Demonstrations**|Björn S. Plonka et.al.|[2410.08848v1](http://arxiv.org/abs/2410.08848v1)|null| +|**2024-10-11**|**Data-Driven Discovery of the Origins of UV Absorption in Alpha-3C Protein**|Germaine Neza Hozana et.al.|[2410.08624v1](http://arxiv.org/abs/2410.08624v1)|null| +|**2024-10-11**|**Study on (r,s)- Generalised Transformation Graphs, A Novel Perspective Based on Transformation Graphs**|Parvez Ali et.al.|[2410.09122v1](http://arxiv.org/abs/2410.09122v1)|null| +|**2024-10-10**|**Electronic structure prediction of medium and high entropy alloys across composition space**|Shashank Pathrudkar et.al.|[2410.08294v1](http://arxiv.org/abs/2410.08294v1)|null| +|**2024-10-10**|**Using Lagrangian descriptors to calculate the Maslov index of periodic orbits**|J. Montes et.al.|[2410.08171v1](http://arxiv.org/abs/2410.08171v1)|null| +|**2024-10-10**|**Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks**|Minxing Zhang et.al.|[2410.07670v1](http://arxiv.org/abs/2410.07670v1)|null| + +### Semantic Segmentation +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| +|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| +|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| +|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| +|**2024-10-21**|**Limpeh ga li gong: Challenges in Singlish Annotations**|Lynnette Hui Xian Ng et.al.|[2410.16156v1](http://arxiv.org/abs/2410.16156v1)|null| +|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| +|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| +|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| +|**2024-10-21**|**ChangeGuard: Validating Code Changes via Pairwise Learning-Guided Execution**|Lars Gröninger et.al.|[2410.16092v1](http://arxiv.org/abs/2410.16092v1)|null| +|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| +|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| +|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| +|**2024-10-21**|**CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation**|Xi Xu et.al.|[2410.16011v1](http://arxiv.org/abs/2410.16011v1)|null| +|**2024-10-21**|**Visual Representation Learning Guided By Multi-modal Prior Knowledge**|Hongkuan Zhou et.al.|[2410.15981v1](http://arxiv.org/abs/2410.15981v1)|null| +|**2024-10-21**|**Fundamental sequences based on localization**|Gunnar Wilken et.al.|[2410.15953v1](http://arxiv.org/abs/2410.15953v1)|null| +|**2024-10-21**|**Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)**|Animesh Kumar et.al.|[2410.15951v1](http://arxiv.org/abs/2410.15951v1)|null| +|**2024-10-21**|**Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation**|Jiawei Zhao et.al.|[2410.15932v1](http://arxiv.org/abs/2410.15932v1)|null| +|**2024-10-21**|**Centrality-aware Product Retrieval and Ranking**|Hadeel Saadany et.al.|[2410.15930v1](http://arxiv.org/abs/2410.15930v1)|null| +|**2024-10-21**|**Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation**|Xinze Li et.al.|[2410.15916v1](http://arxiv.org/abs/2410.15916v1)|[link](https://github.com/Powertony102/corn_official)| +|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| +|**2024-10-21**|**Principles of semantic and functional efficiency in grammatical patterning**|Emily Cheng et.al.|[2410.15865v1](http://arxiv.org/abs/2410.15865v1)|null| +|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| +|**2024-10-21**|**Large Language Models Empower Personalized Valuation in Auction**|Jie Sun et.al.|[2410.15817v1](http://arxiv.org/abs/2410.15817v1)|null| +|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| +|**2024-10-21**|**FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation**|Sadjad Rezvani et.al.|[2410.15812v1](http://arxiv.org/abs/2410.15812v1)|null| +|**2024-10-21**|**Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation**|Yongguang Li et.al.|[2410.15811v1](http://arxiv.org/abs/2410.15811v1)|null| +|**2024-10-21**|**Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization**|Hanseon Joo et.al.|[2410.15794v1](http://arxiv.org/abs/2410.15794v1)|[link](https://github.com/HanseonJoo/Habaek)| + +### Image Classification +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| +|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| +|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| +|**2024-10-21**|**Serendipitous detection of an intense X-ray flare in the weak-line T Tauri star KM Ori with SRG/eROSITA**|Savithri H. Ezhikode et.al.|[2410.16241v1](http://arxiv.org/abs/2410.16241v1)|null| +|**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**Gas puff imaging of plasma turbulence in the magnetic island scrape-off layer of W7-X**|S. G. Baek et.al.|[2410.16194v1](http://arxiv.org/abs/2410.16194v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| +|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| +|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models**|Giannis Daras et.al.|[2410.16152v1](http://arxiv.org/abs/2410.16152v1)|null| +|**2024-10-21**|**Denoising Hyperbolic-Valued Data by Relaxed Regularizations**|Robert Beinert et.al.|[2410.16149v1](http://arxiv.org/abs/2410.16149v1)|null| +|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**Classification and dilation for $q$-commuting $2 \times 2$ scalar matrices**|Sourav Pal et.al.|[2410.16134v1](http://arxiv.org/abs/2410.16134v1)|null| +|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| +|**2024-10-21**|**Ultra-High-Energy Gamma-Ray Bubble around Microquasar V4641 Sgr**|R. Alfaro et.al.|[2410.16117v1](http://arxiv.org/abs/2410.16117v1)|null| +|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| +|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| +|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| +|**2024-10-21**|**LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset**|Ruikun Zhang et.al.|[2410.16095v1](http://arxiv.org/abs/2410.16095v1)|[link](https://github.com/wangzrk/LMHaze)| +|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| +|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| + +### Instance Segmentation |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| -|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| -|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| -|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| -|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| -|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| -|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| -|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| -|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| -|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| -|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| -|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| -|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| -|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| -|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| -|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| -|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| -|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| -|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| -|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| -|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| -|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| -|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| -|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| -|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| -|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| -|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| -|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| -|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**Role of obstacle softness in the diffusive behavior of active Particles**|Ankit Gupta et.al.|[2410.16223v1](http://arxiv.org/abs/2410.16223v1)|null| +|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| +|**2024-10-21**|**MNIST-Nd: a set of naturalistic datasets to benchmark clustering across dimensions**|Polina Turishcheva et.al.|[2410.16124v1](http://arxiv.org/abs/2410.16124v1)|null| +|**2024-10-21**|**ExDBN: Exact learning of Dynamic Bayesian Networks**|Pavel Rytíř et.al.|[2410.16100v1](http://arxiv.org/abs/2410.16100v1)|null| +|**2024-10-21**|**Orbifold singularity formation along ancient and immortal Ricci flows**|Alix Deruelle et.al.|[2410.16075v1](http://arxiv.org/abs/2410.16075v1)|null| +|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| +|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| +|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| +|**2024-10-21**|**CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation**|Xi Xu et.al.|[2410.16011v1](http://arxiv.org/abs/2410.16011v1)|null| +|**2024-10-21**|**Self-Explained Keywords Empower Large Language Models for Code Generation**|Lishui Fan et.al.|[2410.15966v1](http://arxiv.org/abs/2410.15966v1)|null| +|**2024-10-21**|**Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)**|Animesh Kumar et.al.|[2410.15951v1](http://arxiv.org/abs/2410.15951v1)|null| +|**2024-10-21**|**Focus on BEV: Self-calibrated Cycle View Transformation for Monocular Birds-Eye-View Segmentation**|Jiawei Zhao et.al.|[2410.15932v1](http://arxiv.org/abs/2410.15932v1)|null| +|**2024-10-21**|**Automatic Differentiation of Optimization Algorithms with Time-Varying Updates**|Sheheryar Mehmood et.al.|[2410.15923v1](http://arxiv.org/abs/2410.15923v1)|null| +|**2024-10-21**|**A Simpler Approach for Monotone Parametric Minimum Cut: Finding the Breakpoints in Order**|Arne Beines et.al.|[2410.15920v1](http://arxiv.org/abs/2410.15920v1)|null| +|**2024-10-21**|**Leveraging CORAL-Correlation Consistency Network for Semi-Supervised Left Atrium MRI Segmentation**|Xinze Li et.al.|[2410.15916v1](http://arxiv.org/abs/2410.15916v1)|[link](https://github.com/Powertony102/corn_official)| +|**2024-10-21**|**IGMaxHS -- An Incremental MaxSAT Solver with Support for XOR Clauses**|Ole Lübke et.al.|[2410.15897v1](http://arxiv.org/abs/2410.15897v1)|null| +|**2024-10-21**|**Foundation Models for Slide-level Cancer Subtyping in Digital Pathology**|Pablo Meseguer et.al.|[2410.15886v1](http://arxiv.org/abs/2410.15886v1)|null| +|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| +|**2024-10-21**|**MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images**|Pablo Meseguer et.al.|[2410.15881v1](http://arxiv.org/abs/2410.15881v1)|null| +|**2024-10-21**|**MultIHeaTS: A Fast and Stable Thermal Solver for Multilayered Planetary Surfaces**|Cyril Mergny et.al.|[2410.15834v1](http://arxiv.org/abs/2410.15834v1)|[link](https://github.com/cmergny/MultIHeaTS)| +|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| +|**2024-10-21**|**FusionLungNet: Multi-scale Fusion Convolution with Refinement Network for Lung CT Image Segmentation**|Sadjad Rezvani et.al.|[2410.15812v1](http://arxiv.org/abs/2410.15812v1)|null| +|**2024-10-21**|**Deep Learning and Data Augmentation for Detecting Self-Admitted Technical Debt**|Edi Sutoyo et.al.|[2410.15804v1](http://arxiv.org/abs/2410.15804v1)|[link](https://github.com/edisutoyo/satd-augmentation)| +|**2024-10-21**|**Habaek: High-performance water segmentation through dataset expansion and inductive bias optimization**|Hanseon Joo et.al.|[2410.15794v1](http://arxiv.org/abs/2410.15794v1)|[link](https://github.com/HanseonJoo/Habaek)| +|**2024-10-21**|**Improving Instance Optimization in Deformable Image Registration with Gradient Projection**|Yi Zhang et.al.|[2410.15767v1](http://arxiv.org/abs/2410.15767v1)|null| +|**2024-10-21**|**GIG: Graph Data Imputation With Graph Differential Dependencies**|Jiang Hua et.al.|[2410.15747v1](http://arxiv.org/abs/2410.15747v1)|null| +|**2024-10-21**|**Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment**|Yankai Jiang et.al.|[2410.15744v1](http://arxiv.org/abs/2410.15744v1)|null| +|**2024-10-21**|**A Fair Allocation is Approximately Optimal for Indivisible Chores, or Is It?**|Bo Li et.al.|[2410.15738v1](http://arxiv.org/abs/2410.15738v1)|null| -### Dataset +### Object Tracking |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| -|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| -|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| -|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| -|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| -|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| -|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| -|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| -|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| -|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| -|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| -|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| -|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| -|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| -|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| -|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| -|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| -|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| -|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| -|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| -|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| -|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| -|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| -|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| -|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| -|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| -|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| -|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| -|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| +|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| +|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| +|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| +|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| +|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| +|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| +|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| +|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| +|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| +|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| +|**2024-10-21**|**Size and kinematics of the low-ionization broad emission line region from microlensing-induced line profile distortions in gravitationally lensed quasars**|Damien Hutsemékers et.al.|[2410.16018v1](http://arxiv.org/abs/2410.16018v1)|null| +|**2024-10-21**|**Differential Evolution-Based End-Fire Realized Gain Optimization of Active and Parasitic Arrays**|Rozita Konstantinou et.al.|[2410.16014v1](http://arxiv.org/abs/2410.16014v1)|null| +|**2024-10-21**|**Surprising Patterns in Musical Influence Networks**|Flavio Figueiredo et.al.|[2410.15996v1](http://arxiv.org/abs/2410.15996v1)|null| +|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| +|**2024-10-21**|**The Yang-Baxter equation, Leibniz algebras, racks and related algebraic structures**|Nanyan Xu et.al.|[2410.15972v1](http://arxiv.org/abs/2410.15972v1)|null| +|**2024-10-21**|**An Efficient Local Optimizer-Tracking Solver for Differential-Algebriac Equations with Optimization Criteria**|Alexander Fleming et.al.|[2410.15963v1](http://arxiv.org/abs/2410.15963v1)|null| +|**2024-10-21**|**Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization**|Yugandhar Reddy Gogireddy et.al.|[2410.15962v1](http://arxiv.org/abs/2410.15962v1)|null| -### Asynchronous +### Object Detection |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| -|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| -|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| -|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| -|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| -|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| -|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| -|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| -|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| -|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| -|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| -|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| -|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| -|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| -|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| -|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| -|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| -|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| -|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| -|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| -|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| -|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| -|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| -|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| -|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| -|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| -|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| -|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| -|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| +|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| +|**2024-10-21**|**Search for $h_b(2P)\toγχ_{bJ}(1P)$ at $\sqrt{s} = 10.860$ GeV**|Belle Collaboration et.al.|[2410.16181v1](http://arxiv.org/abs/2410.16181v1)|null| +|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| +|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| +|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| +|**2024-10-21**|**Searching for Axion-Like Particles with X-ray Observations of Alpha Centauri**|Yu-Xuan Chen et.al.|[2410.16065v1](http://arxiv.org/abs/2410.16065v1)|null| +|**2024-10-21**|**Few-shot target-driven instance detection based on open-vocabulary object detection models**|Ben Crulis et.al.|[2410.16028v1](http://arxiv.org/abs/2410.16028v1)|null| +|**2024-10-21**|**Size and kinematics of the low-ionization broad emission line region from microlensing-induced line profile distortions in gravitationally lensed quasars**|Damien Hutsemékers et.al.|[2410.16018v1](http://arxiv.org/abs/2410.16018v1)|null| +|**2024-10-21**|**Differential Evolution-Based End-Fire Realized Gain Optimization of Active and Parasitic Arrays**|Rozita Konstantinou et.al.|[2410.16014v1](http://arxiv.org/abs/2410.16014v1)|null| +|**2024-10-21**|**A Critical Drift-Diffusion Equation: Connections to the Diffusion on $\textbf{SL}(2)$**|Peter Morfe et.al.|[2410.15983v1](http://arxiv.org/abs/2410.15983v1)|null| +|**2024-10-21**|**The Yang-Baxter equation, Leibniz algebras, racks and related algebraic structures**|Nanyan Xu et.al.|[2410.15972v1](http://arxiv.org/abs/2410.15972v1)|null| +|**2024-10-21**|**Partial Orientation Retrieval of Proteins From Coulomb Explosions**|Tomas André et.al.|[2410.15965v1](http://arxiv.org/abs/2410.15965v1)|null| +|**2024-10-21**|**Systematic Exploration of Dialogue Summarization Approaches for Reproducibility, Comparative Assessment, and Methodological Innovations for Advancing Natural Language Processing in Abstractive Summarization**|Yugandhar Reddy Gogireddy et.al.|[2410.15962v1](http://arxiv.org/abs/2410.15962v1)|null| +|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| +|**2024-10-21**|**Protein structure classification based on X-ray laser induced Coulomb explosion**|Tomas André et.al.|[2410.15934v1](http://arxiv.org/abs/2410.15934v1)|null| +|**2024-10-21**|**Mitigating Object Hallucination via Concentric Causal Attention**|Yun Xing et.al.|[2410.15926v1](http://arxiv.org/abs/2410.15926v1)|[link](https://github.com/xing0047/cca-llava)| +|**2024-10-21**|**Conceptual and practical approaches for investigating irreversible processes**|Dario Lucente et.al.|[2410.15925v1](http://arxiv.org/abs/2410.15925v1)|null| +|**2024-10-21**|**A Simpler Approach for Monotone Parametric Minimum Cut: Finding the Breakpoints in Order**|Arne Beines et.al.|[2410.15920v1](http://arxiv.org/abs/2410.15920v1)|null| +|**2024-10-21**|**On Poisson Distribution**|Dharmendra Kumar Singh et.al.|[2410.15914v1](http://arxiv.org/abs/2410.15914v1)|null| -### Privacy +### Multi-Object Tracking |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**DMM: Distributed Matrix Mechanism for Differentially-Private Federated Learning using Packed Secret Sharing**|Alexander Bienstock et.al.|[2410.16161v1](http://arxiv.org/abs/2410.16161v1)|null| -|**2024-10-21**|**Extracting Spatiotemporal Data from Gradients with Large Language Models**|Lele Zheng et.al.|[2410.16121v1](http://arxiv.org/abs/2410.16121v1)|null| -|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| -|**2024-10-21**|**Geographical Node Clustering and Grouping to Guarantee Data IIDness in Federated Learning**|Minkwon Lee et.al.|[2410.15693v1](http://arxiv.org/abs/2410.15693v1)|null| -|**2024-10-21**|**Federated Learning with MMD-based Early Stopping for Adaptive GNSS Interference Classification**|Nishant S. Gaikwad et.al.|[2410.15681v1](http://arxiv.org/abs/2410.15681v1)|null| -|**2024-10-20**|**MIRA: A Method of Federated MultI-Task Learning for LaRge LAnguage Models**|Ahmed Elbakary et.al.|[2410.15524v1](http://arxiv.org/abs/2410.15524v1)|null| -|**2024-10-20**|**Bayesian data fusion for distributed learning**|Peng Wu et.al.|[2410.15473v1](http://arxiv.org/abs/2410.15473v1)|null| -|**2024-10-20**|**Tighter Performance Theory of FedExProx**|Wojciech Anyszka et.al.|[2410.15368v1](http://arxiv.org/abs/2410.15368v1)|null| -|**2024-10-19**|**DPVS-Shapley:Faster and Universal Contribution Evaluation Component in Federated Learning**|Ketin Yin et.al.|[2410.15093v1](http://arxiv.org/abs/2410.15093v1)|null| -|**2024-10-19**|**Personalized Federated Learning with Adaptive Feature Aggregation and Knowledge Transfer**|Keting Yin et.al.|[2410.15073v1](http://arxiv.org/abs/2410.15073v1)|null| -|**2024-10-19**|**Mind the Remaining: Mechanism Design for Robust Federated Unlearning**|Jiaqi Shao et.al.|[2410.15045v1](http://arxiv.org/abs/2410.15045v1)|null| -|**2024-10-18**|**FedSpaLLM: Federated Pruning of Large Language Models**|Guangji Bai et.al.|[2410.14852v1](http://arxiv.org/abs/2410.14852v1)|null| -|**2024-10-18**|**Assessing the Impact of AR-Assisted Warnings on Roadway Workers' Stress Under Different Workload Conditions**|Fatemeh Banani Ardecani et.al.|[2410.14537v1](http://arxiv.org/abs/2410.14537v1)|null| -|**2024-10-18**|**Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning**|Boning Zhang et.al.|[2410.14390v1](http://arxiv.org/abs/2410.14390v1)|null| -|**2024-10-18**|**Comparative Evaluation of Clustered Federated Learning Method**|Michael Ben Ali et.al.|[2410.14212v1](http://arxiv.org/abs/2410.14212v1)|[link](https://github.com/leahcimali/Comparative-Evaluation-of-Clustered-Federated-Learning-Methods)| -|**2024-10-18**|**FedMSE: Federated learning for IoT network intrusion detection**|Van Tuan Nguyen et.al.|[2410.14121v1](http://arxiv.org/abs/2410.14121v1)|[link](https://github.com/dino-chiio/fedmse)| -|**2024-10-18**|**A Communication and Computation Efficient Fully First-order Method for Decentralized Bilevel Optimization**|Min Wen et.al.|[2410.14115v1](http://arxiv.org/abs/2410.14115v1)|null| -|**2024-10-17**|**FedPAE: Peer-Adaptive Ensemble Learning for Asynchronous and Model-Heterogeneous Federated Learning**|Brianna Mueller et.al.|[2410.14075v1](http://arxiv.org/abs/2410.14075v1)|null| -|**2024-10-17**|**Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion**|Xiang Zhang et.al.|[2410.14035v2](http://arxiv.org/abs/2410.14035v2)|null| -|**2024-10-17**|**Conformal Prediction for Federated Graph Neural Networks with Missing Neighbor Information**|Ömer Faruk Akgül et.al.|[2410.14010v1](http://arxiv.org/abs/2410.14010v1)|null| -|**2024-10-17**|**DPFedBank: Crafting a Privacy-Preserving Federated Learning Framework for Financial Institutions with Policy Pillars**|Peilin He et.al.|[2410.13753v1](http://arxiv.org/abs/2410.13753v1)|null| -|**2024-10-17**|**On-device Federated Learning in Smartphones for Detecting Depression from Reddit Posts**|Mustofa Ahmed et.al.|[2410.13709v1](http://arxiv.org/abs/2410.13709v1)|null| -|**2024-10-17**|**Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach**|Luyao Zou et.al.|[2410.13602v2](http://arxiv.org/abs/2410.13602v2)|null| -|**2024-10-17**|**Mathematical modelling to inform outbreak response vaccination**|Manjari Shankar et.al.|[2410.13923v1](http://arxiv.org/abs/2410.13923v1)|null| -|**2024-10-17**|**Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling**|Ronja Stern et.al.|[2410.13460v1](http://arxiv.org/abs/2410.13460v1)|null| -|**2024-10-17**|**Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland**|Luca Rolshoven et.al.|[2410.13456v1](http://arxiv.org/abs/2410.13456v1)|null| -|**2024-10-17**|**Towards Formal Verification of Federated Learning Orchestration Protocols on Satellites**|Miroslav Popovic et.al.|[2410.13429v1](http://arxiv.org/abs/2410.13429v1)|null| -|**2024-10-17**|**FRAG: Toward Federated Vector Database Management for Collaborative and Secure Retrieval-Augmented Generation**|Dongfang Zhao et.al.|[2410.13272v1](http://arxiv.org/abs/2410.13272v1)|null| -|**2024-10-17**|**Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach**|Luyao Zou et.al.|[2410.13260v2](http://arxiv.org/abs/2410.13260v2)|null| -|**2024-10-17**|**Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition**|Chao Tan et.al.|[2410.13221v1](http://arxiv.org/abs/2410.13221v1)|null| +|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**About maximal antichains in a product of two chains:A catch-all note**|Denis Bouyssou et.al.|[2410.16243v1](http://arxiv.org/abs/2410.16243v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| +|**2024-10-21**|**CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking**|Nishat Raihan et.al.|[2410.16211v1](http://arxiv.org/abs/2410.16211v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Quantum Algorithms for Non-smooth Non-convex Optimization**|Chengchang Liu et.al.|[2410.16189v1](http://arxiv.org/abs/2410.16189v1)|null| +|**2024-10-21**|**Computing Inverses of Stieltjes Transforms of Probability Measures**|James Chen et.al.|[2410.16178v1](http://arxiv.org/abs/2410.16178v1)|null| +|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| +|**2024-10-21**|**The Interplay Between Physical Activity, Protein Consumption, and Sleep Quality in Muscle Protein Synthesis**|Ayush Devkota et.al.|[2410.16169v1](http://arxiv.org/abs/2410.16169v1)|null| +|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| +|**2024-10-21**|**PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters**|Azin Ghazimatin et.al.|[2410.16148v1](http://arxiv.org/abs/2410.16148v1)|null| +|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors**|Finn Schmidt et.al.|[2410.16136v1](http://arxiv.org/abs/2410.16136v1)|null| +|**2024-10-21**|**Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning**|Chun-Yi Kuan et.al.|[2410.16130v1](http://arxiv.org/abs/2410.16130v1)|null| +|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| +|**2024-10-21**|**DUMBO: Making durable read-only transactions fly on hardware transactional memory**|João Barreto et.al.|[2410.16110v1](http://arxiv.org/abs/2410.16110v1)|null| +|**2024-10-21**|**Semantics of Sets of Programs**|Jinwoo Kim et.al.|[2410.16102v1](http://arxiv.org/abs/2410.16102v1)|null| +|**2024-10-21**|**Defending Against Attack on the Cloned: In-Band Active Man-in-the-Middle Detection for the Signal Protocol**|Wil Liam Teng et.al.|[2410.16098v1](http://arxiv.org/abs/2410.16098v1)|null| ## Few-shot Learning -### Few-shot Learning +### One-shot Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| @@ -929,7 +929,7 @@ |**2024-10-21**|**MagicPIG: LSH Sampling for Efficient LLM Generation**|Zhuoming Chen et.al.|[2410.16179v1](http://arxiv.org/abs/2410.16179v1)|null| |**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| -### One-shot Learning +### Few-shot Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| @@ -997,14 +997,47 @@ |**2024-10-16**|**Systems with Switching Causal Relations: A Meta-Causal Perspective**|Moritz Willig et.al.|[2410.13054v1](http://arxiv.org/abs/2410.13054v1)|null| |**2024-10-16**|**Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception**|Jihao Zhao et.al.|[2410.12788v1](http://arxiv.org/abs/2410.12788v1)|[link](https://github.com/IAAR-Shanghai/Meta-Chunking)| -## Transfer Learning +## Unsupervised Learning -### Transfer Learning +### GAN +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| +|**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| +|**2024-10-21**|**Separations in query complexity for total search problems**|Shalev Ben-David et.al.|[2410.16245v1](http://arxiv.org/abs/2410.16245v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| +|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| +|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| +|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| +|**2024-10-21**|**Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network**|Suman Sapkota et.al.|[2410.16159v1](http://arxiv.org/abs/2410.16159v1)|null| +|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| +|**2024-10-21**|**A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns**|Tianyi Men et.al.|[2410.16155v1](http://arxiv.org/abs/2410.16155v1)|null| +|**2024-10-21**|**Unsupervised Replay Strategies for Continual Learning with Limited Data**|Anthony Bazhenov et.al.|[2410.16154v1](http://arxiv.org/abs/2410.16154v1)|null| +|**2024-10-21**|**Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance**|Mostafa Hussien et.al.|[2410.16151v1](http://arxiv.org/abs/2410.16151v1)|null| +|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| +|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| +|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| +|**2024-10-21**|**Cooperative Multistatic Target Detection in Cell-Free Communication Networks**|Tianyu Yang et.al.|[2410.16140v1](http://arxiv.org/abs/2410.16140v1)|null| +|**2024-10-21**|**Theoretical Insights into Line Graph Transformation on Graph Learning**|Fan Yang et.al.|[2410.16138v1](http://arxiv.org/abs/2410.16138v1)|[link](https://github.com/lukeyf/graphs-and-lines)| +|**2024-10-21**|**A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields**|Runkang Guo et.al.|[2410.16132v1](http://arxiv.org/abs/2410.16132v1)|null| +|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| +|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| +|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| +|**2024-10-21**|**A Deep Unfolding-Based Scalarization Approach for Power Control in D2D Networks**|Jan Christian Hauffen et.al.|[2410.16104v1](http://arxiv.org/abs/2410.16104v1)|null| + +### Unsupervised Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| |**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| @@ -1013,14 +1046,11 @@ |**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| |**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| |**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| -|**2024-10-21**|**Nonlinear Magnetics Model for Permanent Magnet Synchronous Machines Capturing Saturation and Temperature Effects**|Kishan Srinivasan et.al.|[2410.16240v1](http://arxiv.org/abs/2410.16240v1)|null| |**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| |**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| |**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| |**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| |**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| |**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| @@ -1032,14 +1062,19 @@ |**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| |**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| |**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| +|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| +|**2024-10-21**|**MagicPIG: LSH Sampling for Efficient LLM Generation**|Zhuoming Chen et.al.|[2410.16179v1](http://arxiv.org/abs/2410.16179v1)|null| +|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| -## Unsupervised Learning +## Transfer Learning -### Unsupervised Learning +### Transfer Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| |**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| @@ -1048,11 +1083,14 @@ |**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| |**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| |**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| +|**2024-10-21**|**Nonlinear Magnetics Model for Permanent Magnet Synchronous Machines Capturing Saturation and Temperature Effects**|Kishan Srinivasan et.al.|[2410.16240v1](http://arxiv.org/abs/2410.16240v1)|null| |**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| |**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| |**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| +|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| |**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| |**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| |**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| @@ -1064,50 +1102,114 @@ |**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| |**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| |**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| -|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| -|**2024-10-21**|**MagicPIG: LSH Sampling for Efficient LLM Generation**|Zhuoming Chen et.al.|[2410.16179v1](http://arxiv.org/abs/2410.16179v1)|null| -|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| -### GAN +## Multi-modal + +### Vision-Language +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | + +### Text and Image Generation |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| +|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| +|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| +|**2024-10-21**|**Multiparticle scalar dark matter with $\mathbb{Z}_N$ symmetry**|Subhaditya Bhattacharya et.al.|[2410.16275v1](http://arxiv.org/abs/2410.16275v1)|null| +|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| |**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| +|**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| +|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| +|**2024-10-21**|**Surface acoustic waves Brillouin photonics on a silicon nitride chip**|Yvan Klaver et.al.|[2410.16263v1](http://arxiv.org/abs/2410.16263v1)|null| +|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| +|**2024-10-21**|**Multi-product Zeno effect with higher order convergence rates**|Tim Möbus et.al.|[2410.16260v1](http://arxiv.org/abs/2410.16260v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| +|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution**|Maosong Cao et.al.|[2410.16256v1](http://arxiv.org/abs/2410.16256v1)|[link](https://github.com/open-compass/compassjudger)| |**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Loss of 12 Starlink Satellites Due to Pre-conditioning of Intense Space Weather Activity Surrounding the Extreme Geomagnetic Storm of 10 May 2024**|Ayisha M Ashruf et.al.|[2410.16254v1](http://arxiv.org/abs/2410.16254v1)|null| +|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| +|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| +|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| +|**2024-10-21**|**Cups and Gates I: Cohomology invariants and logical quantum operations**|Nikolas P. Breuckmann et.al.|[2410.16250v1](http://arxiv.org/abs/2410.16250v1)|null| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| |**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| |**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**Separations in query complexity for total search problems**|Shalev Ben-David et.al.|[2410.16245v1](http://arxiv.org/abs/2410.16245v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| -|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| -|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| -|**2024-10-21**|**Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network**|Suman Sapkota et.al.|[2410.16159v1](http://arxiv.org/abs/2410.16159v1)|null| -|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| -|**2024-10-21**|**A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns**|Tianyi Men et.al.|[2410.16155v1](http://arxiv.org/abs/2410.16155v1)|null| -|**2024-10-21**|**Unsupervised Replay Strategies for Continual Learning with Limited Data**|Anthony Bazhenov et.al.|[2410.16154v1](http://arxiv.org/abs/2410.16154v1)|null| -|**2024-10-21**|**Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance**|Mostafa Hussien et.al.|[2410.16151v1](http://arxiv.org/abs/2410.16151v1)|null| -|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| -|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| -|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| -|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| -|**2024-10-21**|**Cooperative Multistatic Target Detection in Cell-Free Communication Networks**|Tianyu Yang et.al.|[2410.16140v1](http://arxiv.org/abs/2410.16140v1)|null| -|**2024-10-21**|**Theoretical Insights into Line Graph Transformation on Graph Learning**|Fan Yang et.al.|[2410.16138v1](http://arxiv.org/abs/2410.16138v1)|[link](https://github.com/lukeyf/graphs-and-lines)| -|**2024-10-21**|**A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields**|Runkang Guo et.al.|[2410.16132v1](http://arxiv.org/abs/2410.16132v1)|null| -|**2024-10-21**|**Multimodal Flare Forecasting with Deep Learning**|Grégoire Francisco et.al.|[2410.16116v1](http://arxiv.org/abs/2410.16116v1)|null| -|**2024-10-21**|**Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency**|Aidan Boyd et.al.|[2410.16115v1](http://arxiv.org/abs/2410.16115v1)|null| -|**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| -|**2024-10-21**|**A Deep Unfolding-Based Scalarization Approach for Power Control in D2D Networks**|Jan Christian Hauffen et.al.|[2410.16104v1](http://arxiv.org/abs/2410.16104v1)|null| -## Multi-modal +### VQA +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| +|**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| +|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| +|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| +|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| +|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| +|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| +|**2024-10-21**|**1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs**|Jinheng Wang et.al.|[2410.16144v1](http://arxiv.org/abs/2410.16144v1)|[link](https://github.com/microsoft/bitnet)| +|**2024-10-21**|**Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs**|Kang Zhao et.al.|[2410.16135v1](http://arxiv.org/abs/2410.16135v1)|null| +|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| +|**2024-10-21**|**Do LLMs write like humans? Variation in grammatical and rhetorical styles**|Alex Reinhart et.al.|[2410.16107v1](http://arxiv.org/abs/2410.16107v1)|null| +|**2024-10-21**|**Statistical Inference for Temporal Difference Learning with Linear Function Approximation**|Weichen Wu et.al.|[2410.16106v1](http://arxiv.org/abs/2410.16106v1)|null| +|**2024-10-21**|**Streaming and Communication Complexity of Load-Balancing via Matching Contractors**|Sepehr Assadi et.al.|[2410.16094v1](http://arxiv.org/abs/2410.16094v1)|null| +|**2024-10-21**|**Fine-Tuning LLMs for Reliable Medical Question-Answering Services**|Ali Anaissi et.al.|[2410.16088v1](http://arxiv.org/abs/2410.16088v1)|null| +|**2024-10-21**|**Enhanced $S$-factor for the $^{14}$N$(p,γ)^{15}$O reaction and its impact on the solar composition problem**|X. Chen et.al.|[2410.16086v1](http://arxiv.org/abs/2410.16086v1)|null| +|**2024-10-21**|**Unveiling Ferroelectric-Like Behavior in Leaky Dielectrics: A Microscopic Model for Polarization Dynamics and Hysteresis Inversion**|Igor Ricardo Filgueira e Silva et.al.|[2410.16084v1](http://arxiv.org/abs/2410.16084v1)|null| +|**2024-10-21**|**CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts**|Zhenpeng Su et.al.|[2410.16077v1](http://arxiv.org/abs/2410.16077v1)|null| +|**2024-10-21**|**On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds**|Matteo Vilucchio et.al.|[2410.16073v1](http://arxiv.org/abs/2410.16073v1)|null| +|**2024-10-21**|**ComPO: Community Preferences for Language Model Personalization**|Sachin Kumar et.al.|[2410.16027v1](http://arxiv.org/abs/2410.16027v1)|null| +|**2024-10-21**|**START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation**|Jintao Guo et.al.|[2410.16020v1](http://arxiv.org/abs/2410.16020v1)|null| +|**2024-10-21**|**Semiparametric Bayesian Inference for a Conditional Moment Equality Model**|Christopher D. Walker et.al.|[2410.16017v1](http://arxiv.org/abs/2410.16017v1)|null| +|**2024-10-21**|**Identifying new high-confidence polluted white dwarf candidates using Gaia XP spectra and Self-Organizing Maps**|Xabier Pérez-Couto et.al.|[2410.16015v1](http://arxiv.org/abs/2410.16015v1)|null| +|**2024-10-21**|**Are Language Model Logits Calibrated?**|Charles Lovering et.al.|[2410.16007v1](http://arxiv.org/abs/2410.16007v1)|null| +|**2024-10-21**|**Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering**|Yu Zhao et.al.|[2410.15999v1](http://arxiv.org/abs/2410.15999v1)|null| +|**2024-10-21**|**Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence**|Ram Mohan Rao Kadiyala et.al.|[2410.15990v1](http://arxiv.org/abs/2410.15990v1)|null| +|**2024-10-21**|**Granularity Matters in Long-Tail Learning**|Shizhen Zhao et.al.|[2410.15980v1](http://arxiv.org/abs/2410.15980v1)|null| +|**2024-10-21**|**Partial Orientation Retrieval of Proteins From Coulomb Explosions**|Tomas André et.al.|[2410.15965v1](http://arxiv.org/abs/2410.15965v1)|null| +|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| +|**2024-10-21**|**The mutual arrangement of Wright-Fisher diffusion path measures and its impact on parameter estimation**|Paul A. Jenkins et.al.|[2410.15955v1](http://arxiv.org/abs/2410.15955v1)|null| -### Vision-Language +### Multi-modal |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| +|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| +|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| +|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| +|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| +|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| +|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| +|**2024-10-21**|**Visual Representation Learning Guided By Multi-modal Prior Knowledge**|Hongkuan Zhou et.al.|[2410.15981v1](http://arxiv.org/abs/2410.15981v1)|null| +|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| +|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| +|**2024-10-21**|**Robust Loop Closure by Textual Cues in Challenging Environments**|Tongxing Jin et.al.|[2410.15869v1](http://arxiv.org/abs/2410.15869v1)|null| +|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| +|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| +|**2024-10-21**|**Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation**|Yongguang Li et.al.|[2410.15811v1](http://arxiv.org/abs/2410.15811v1)|null| +|**2024-10-21**|**WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction**|Heng Zhai et.al.|[2410.15792v1](http://arxiv.org/abs/2410.15792v1)|null| +|**2024-10-21**|**Generalizing Motion Planners with Mixture of Experts for Autonomous Driving**|Qiao Sun et.al.|[2410.15774v1](http://arxiv.org/abs/2410.15774v1)|null| +|**2024-10-21**|**Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment**|Yankai Jiang et.al.|[2410.15744v1](http://arxiv.org/abs/2410.15744v1)|null| +|**2024-10-21**|**AutoTrain: No-code training for state-of-the-art models**|Abhishek Thakur et.al.|[2410.15735v1](http://arxiv.org/abs/2410.15735v1)|[link](https://github.com/huggingface/autotrain-advanced)| +|**2024-10-21**|**Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model**|Shibo Zhou et.al.|[2410.15689v1](http://arxiv.org/abs/2410.15689v1)|null| +|**2024-10-21**|**Voice-Enabled AI Agents can Perform Common Scams**|Richard Fang et.al.|[2410.15650v1](http://arxiv.org/abs/2410.15650v1)|null| +|**2024-10-21**|**Test-time Adaptation for Cross-modal Retrieval with Query Shift**|Haobin Li et.al.|[2410.15624v1](http://arxiv.org/abs/2410.15624v1)|null| +|**2024-10-21**|**Topology-Aware Exploration of Circle of Willis for CTA and MRA: Segmentation, Detection, and Classification**|Minghui Zhang et.al.|[2410.15614v1](http://arxiv.org/abs/2410.15614v1)|null| +|**2024-10-21**|**Multimodal Learning for Embryo Viability Prediction in Clinical IVF**|Junsik Kim et.al.|[2410.15581v1](http://arxiv.org/abs/2410.15581v1)|null| +|**2024-10-21**|**Bayesian Concept Bottleneck Models with LLM Priors**|Jean Feng et.al.|[2410.15555v1](http://arxiv.org/abs/2410.15555v1)|[link](https://github.com/jjfeng/bc-llm)| +|**2024-10-20**|**SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction**|Swarang Joshi et.al.|[2410.15517v1](http://arxiv.org/abs/2410.15517v1)|null| +|**2024-10-20**|**Evaluating Transferable Emotion Expressions for Zoomorphic Social Robots using VR Prototyping**|Shaun Macdonald et.al.|[2410.15486v1](http://arxiv.org/abs/2410.15486v1)|null| +|**2024-10-20**|**Generalized Multimodal Fusion via Poisson-Nernst-Planck Equation**|Jiayu Xiong et.al.|[2410.15475v1](http://arxiv.org/abs/2410.15475v1)|null| +|**2024-10-20**|**EVA: An Embodied World Model for Future Video Anticipation**|Xiaowei Chi et.al.|[2410.15461v1](http://arxiv.org/abs/2410.15461v1)|null| +|**2024-10-20**|**IPO: Interpretable Prompt Optimization for Vision-Language Models**|Yingjun Du et.al.|[2410.15397v1](http://arxiv.org/abs/2410.15397v1)|null| ### Alignment |Publish Date|Title|Authors|PDF|Code| @@ -1172,112 +1274,46 @@ |**2024-10-21**|**Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning**|Ronglong Fang et.al.|[2410.16105v1](http://arxiv.org/abs/2410.16105v1)|[link](https://github.com/ronglong-fang/addressingspectralbiasviamgdl)| |**2024-10-21**|**LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset**|Ruikun Zhang et.al.|[2410.16095v1](http://arxiv.org/abs/2410.16095v1)|[link](https://github.com/wangzrk/LMHaze)| |**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| -|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| -|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| -|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| -|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| - -### Text and Image Generation -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| -|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| -|**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| -|**2024-10-21**|**Multiparticle scalar dark matter with $\mathbb{Z}_N$ symmetry**|Subhaditya Bhattacharya et.al.|[2410.16275v1](http://arxiv.org/abs/2410.16275v1)|null| -|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| -|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| -|**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| -|**2024-10-21**|**Hyperbolicity in scalar-Gauss-Bonnet gravity: a gauge invariant study for spherical evolution**|Farid Thaalba et.al.|[2410.16264v1](http://arxiv.org/abs/2410.16264v1)|null| -|**2024-10-21**|**Surface acoustic waves Brillouin photonics on a silicon nitride chip**|Yvan Klaver et.al.|[2410.16263v1](http://arxiv.org/abs/2410.16263v1)|null| -|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| -|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| -|**2024-10-21**|**Multi-product Zeno effect with higher order convergence rates**|Tim Möbus et.al.|[2410.16260v1](http://arxiv.org/abs/2410.16260v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| -|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| -|**2024-10-21**|**CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution**|Maosong Cao et.al.|[2410.16256v1](http://arxiv.org/abs/2410.16256v1)|[link](https://github.com/open-compass/compassjudger)| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Loss of 12 Starlink Satellites Due to Pre-conditioning of Intense Space Weather Activity Surrounding the Extreme Geomagnetic Storm of 10 May 2024**|Ayisha M Ashruf et.al.|[2410.16254v1](http://arxiv.org/abs/2410.16254v1)|null| -|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| -|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| -|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| -|**2024-10-21**|**Cups and Gates I: Cohomology invariants and logical quantum operations**|Nikolas P. Breuckmann et.al.|[2410.16250v1](http://arxiv.org/abs/2410.16250v1)|null| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| - -### VQA -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| -|**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| -|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| -|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| -|**2024-10-21**|**Correcting for Selection Biases in the Determination of the Hubble Constant from Time-Delay Cosmography**|ian Li et.al.|[2410.16171v1](http://arxiv.org/abs/2410.16171v1)|null| -|**2024-10-21**|**Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models**|Yufei Zhan et.al.|[2410.16163v1](http://arxiv.org/abs/2410.16163v1)|[link](https://github.com/jefferyzhan/griffon)| -|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| -|**2024-10-21**|**1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs**|Jinheng Wang et.al.|[2410.16144v1](http://arxiv.org/abs/2410.16144v1)|[link](https://github.com/microsoft/bitnet)| -|**2024-10-21**|**Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs**|Kang Zhao et.al.|[2410.16135v1](http://arxiv.org/abs/2410.16135v1)|null| -|**2024-10-21**|**SMART: Self-learning Meta-strategy Agent for Reasoning Tasks**|Rongxing Liu et.al.|[2410.16128v1](http://arxiv.org/abs/2410.16128v1)|[link](https://github.com/kumar-shridhar/smart)| -|**2024-10-21**|**Do LLMs write like humans? Variation in grammatical and rhetorical styles**|Alex Reinhart et.al.|[2410.16107v1](http://arxiv.org/abs/2410.16107v1)|null| -|**2024-10-21**|**Statistical Inference for Temporal Difference Learning with Linear Function Approximation**|Weichen Wu et.al.|[2410.16106v1](http://arxiv.org/abs/2410.16106v1)|null| -|**2024-10-21**|**Streaming and Communication Complexity of Load-Balancing via Matching Contractors**|Sepehr Assadi et.al.|[2410.16094v1](http://arxiv.org/abs/2410.16094v1)|null| -|**2024-10-21**|**Fine-Tuning LLMs for Reliable Medical Question-Answering Services**|Ali Anaissi et.al.|[2410.16088v1](http://arxiv.org/abs/2410.16088v1)|null| -|**2024-10-21**|**Enhanced $S$-factor for the $^{14}$N$(p,γ)^{15}$O reaction and its impact on the solar composition problem**|X. Chen et.al.|[2410.16086v1](http://arxiv.org/abs/2410.16086v1)|null| -|**2024-10-21**|**Unveiling Ferroelectric-Like Behavior in Leaky Dielectrics: A Microscopic Model for Polarization Dynamics and Hysteresis Inversion**|Igor Ricardo Filgueira e Silva et.al.|[2410.16084v1](http://arxiv.org/abs/2410.16084v1)|null| -|**2024-10-21**|**CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts**|Zhenpeng Su et.al.|[2410.16077v1](http://arxiv.org/abs/2410.16077v1)|null| -|**2024-10-21**|**On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds**|Matteo Vilucchio et.al.|[2410.16073v1](http://arxiv.org/abs/2410.16073v1)|null| -|**2024-10-21**|**ComPO: Community Preferences for Language Model Personalization**|Sachin Kumar et.al.|[2410.16027v1](http://arxiv.org/abs/2410.16027v1)|null| -|**2024-10-21**|**START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation**|Jintao Guo et.al.|[2410.16020v1](http://arxiv.org/abs/2410.16020v1)|null| -|**2024-10-21**|**Semiparametric Bayesian Inference for a Conditional Moment Equality Model**|Christopher D. Walker et.al.|[2410.16017v1](http://arxiv.org/abs/2410.16017v1)|null| -|**2024-10-21**|**Identifying new high-confidence polluted white dwarf candidates using Gaia XP spectra and Self-Organizing Maps**|Xabier Pérez-Couto et.al.|[2410.16015v1](http://arxiv.org/abs/2410.16015v1)|null| -|**2024-10-21**|**Are Language Model Logits Calibrated?**|Charles Lovering et.al.|[2410.16007v1](http://arxiv.org/abs/2410.16007v1)|null| -|**2024-10-21**|**Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering**|Yu Zhao et.al.|[2410.15999v1](http://arxiv.org/abs/2410.15999v1)|null| -|**2024-10-21**|**Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence**|Ram Mohan Rao Kadiyala et.al.|[2410.15990v1](http://arxiv.org/abs/2410.15990v1)|null| -|**2024-10-21**|**Granularity Matters in Long-Tail Learning**|Shizhen Zhao et.al.|[2410.15980v1](http://arxiv.org/abs/2410.15980v1)|null| -|**2024-10-21**|**Partial Orientation Retrieval of Proteins From Coulomb Explosions**|Tomas André et.al.|[2410.15965v1](http://arxiv.org/abs/2410.15965v1)|null| -|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| -|**2024-10-21**|**The mutual arrangement of Wright-Fisher diffusion path measures and its impact on parameter estimation**|Paul A. Jenkins et.al.|[2410.15955v1](http://arxiv.org/abs/2410.15955v1)|null| +|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| +|**2024-10-21**|**Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation**|Ruting Chi et.al.|[2410.16063v1](http://arxiv.org/abs/2410.16063v1)|null| +|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| +|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| -### Multi-modal +## Contrastive Learning + +### Contrastive Learning |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| +|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| +|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| +|**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| |**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| -|**2024-10-21**|**A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data**|Simon Deltadahl et.al.|[2410.16177v1](http://arxiv.org/abs/2410.16177v1)|null| -|**2024-10-21**|**Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining**|Han Huang et.al.|[2410.16166v1](http://arxiv.org/abs/2410.16166v1)|[link](https://github.com/hanhuang22/aitqe)| -|**2024-10-21**|**Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data**|Nikos Sakellariou et.al.|[2410.16089v1](http://arxiv.org/abs/2410.16089v1)|null| -|**2024-10-21**|**Deep Multimodal Representation Learning for Stellar Spectra**|Tobias Buck et.al.|[2410.16081v1](http://arxiv.org/abs/2410.16081v1)|null| -|**2024-10-21**|**Label Filling via Mixed Supervision for Medical Image Segmentation from Noisy Annotations**|Ming Li et.al.|[2410.16057v1](http://arxiv.org/abs/2410.16057v1)|null| -|**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| -|**2024-10-21**|**Visual Representation Learning Guided By Multi-modal Prior Knowledge**|Hongkuan Zhou et.al.|[2410.15981v1](http://arxiv.org/abs/2410.15981v1)|null| -|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| -|**2024-10-21**|**How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?**|Zuojin Tang et.al.|[2410.15885v1](http://arxiv.org/abs/2410.15885v1)|null| -|**2024-10-21**|**Robust Loop Closure by Textual Cues in Challenging Environments**|Tongxing Jin et.al.|[2410.15869v1](http://arxiv.org/abs/2410.15869v1)|null| -|**2024-10-21**|**LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training**|Thomas Kreutz et.al.|[2410.15833v1](http://arxiv.org/abs/2410.15833v1)|[link](https://github.com/jensle97/lion-xa)| -|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| -|**2024-10-21**|**Data-Efficient CLIP-Powered Dual-Branch Networks for Source-Free Unsupervised Domain Adaptation**|Yongguang Li et.al.|[2410.15811v1](http://arxiv.org/abs/2410.15811v1)|null| -|**2024-10-21**|**WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction**|Heng Zhai et.al.|[2410.15792v1](http://arxiv.org/abs/2410.15792v1)|null| -|**2024-10-21**|**Generalizing Motion Planners with Mixture of Experts for Autonomous Driving**|Qiao Sun et.al.|[2410.15774v1](http://arxiv.org/abs/2410.15774v1)|null| -|**2024-10-21**|**Unleashing the Potential of Vision-Language Pre-Training for 3D Zero-Shot Lesion Segmentation via Mask-Attribute Alignment**|Yankai Jiang et.al.|[2410.15744v1](http://arxiv.org/abs/2410.15744v1)|null| -|**2024-10-21**|**AutoTrain: No-code training for state-of-the-art models**|Abhishek Thakur et.al.|[2410.15735v1](http://arxiv.org/abs/2410.15735v1)|[link](https://github.com/huggingface/autotrain-advanced)| -|**2024-10-21**|**Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model**|Shibo Zhou et.al.|[2410.15689v1](http://arxiv.org/abs/2410.15689v1)|null| -|**2024-10-21**|**Voice-Enabled AI Agents can Perform Common Scams**|Richard Fang et.al.|[2410.15650v1](http://arxiv.org/abs/2410.15650v1)|null| -|**2024-10-21**|**Test-time Adaptation for Cross-modal Retrieval with Query Shift**|Haobin Li et.al.|[2410.15624v1](http://arxiv.org/abs/2410.15624v1)|null| -|**2024-10-21**|**Topology-Aware Exploration of Circle of Willis for CTA and MRA: Segmentation, Detection, and Classification**|Minghui Zhang et.al.|[2410.15614v1](http://arxiv.org/abs/2410.15614v1)|null| -|**2024-10-21**|**Multimodal Learning for Embryo Viability Prediction in Clinical IVF**|Junsik Kim et.al.|[2410.15581v1](http://arxiv.org/abs/2410.15581v1)|null| -|**2024-10-21**|**Bayesian Concept Bottleneck Models with LLM Priors**|Jean Feng et.al.|[2410.15555v1](http://arxiv.org/abs/2410.15555v1)|[link](https://github.com/jjfeng/bc-llm)| -|**2024-10-20**|**SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction**|Swarang Joshi et.al.|[2410.15517v1](http://arxiv.org/abs/2410.15517v1)|null| -|**2024-10-20**|**Evaluating Transferable Emotion Expressions for Zoomorphic Social Robots using VR Prototyping**|Shaun Macdonald et.al.|[2410.15486v1](http://arxiv.org/abs/2410.15486v1)|null| -|**2024-10-20**|**Generalized Multimodal Fusion via Poisson-Nernst-Planck Equation**|Jiayu Xiong et.al.|[2410.15475v1](http://arxiv.org/abs/2410.15475v1)|null| -|**2024-10-20**|**EVA: An Embodied World Model for Future Video Anticipation**|Xiaowei Chi et.al.|[2410.15461v1](http://arxiv.org/abs/2410.15461v1)|null| -|**2024-10-20**|**IPO: Interpretable Prompt Optimization for Vision-Language Models**|Yingjun Du et.al.|[2410.15397v1](http://arxiv.org/abs/2410.15397v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| +|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| +|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| +|**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| +|**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| +|**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| +|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| +|**2024-10-21**|**Compute-Constrained Data Selection**|Junjie Oscar Yin et.al.|[2410.16208v1](http://arxiv.org/abs/2410.16208v1)|[link](https://github.com/oseyosey/ccds)| +|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| +|**2024-10-21**|**Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media**|Yuchen Cao et.al.|[2410.16204v1](http://arxiv.org/abs/2410.16204v1)|null| +|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| +|**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| +|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| +|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| +|**2024-10-21**|**MagicPIG: LSH Sampling for Efficient LLM Generation**|Zhuoming Chen et.al.|[2410.16179v1](http://arxiv.org/abs/2410.16179v1)|null| ## Transformer @@ -1349,42 +1385,6 @@ |**2024-10-21**|**TS-ACL: A Time Series Analytic Continual Learning Framework for Privacy-Preserving and Class-Incremental Pattern Recognition**|Kejia Fan et.al.|[2410.15954v1](http://arxiv.org/abs/2410.15954v1)|null| |**2024-10-21**|**Redefining Finance: The Influence of Artificial Intelligence (AI) and Machine Learning (ML)**|Animesh Kumar et.al.|[2410.15951v1](http://arxiv.org/abs/2410.15951v1)|null| -## Graph Neural Network - -### Graph Neural Network -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| -|**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**Unique subgraphs are rare**|Domagoj Bradač et.al.|[2410.16233v1](http://arxiv.org/abs/2410.16233v1)|null| -|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| -|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| -|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| -|**2024-10-21**|**Information for Conversation Generation: Proposals Utilising Knowledge Graphs**|Alex Clay et.al.|[2410.16196v1](http://arxiv.org/abs/2410.16196v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| -|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| -|**2024-10-21**|**Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network**|Suman Sapkota et.al.|[2410.16159v1](http://arxiv.org/abs/2410.16159v1)|null| -|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| -|**2024-10-21**|**A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns**|Tianyi Men et.al.|[2410.16155v1](http://arxiv.org/abs/2410.16155v1)|null| -|**2024-10-21**|**Unsupervised Replay Strategies for Continual Learning with Limited Data**|Anthony Bazhenov et.al.|[2410.16154v1](http://arxiv.org/abs/2410.16154v1)|null| -|**2024-10-21**|**Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance**|Mostafa Hussien et.al.|[2410.16151v1](http://arxiv.org/abs/2410.16151v1)|null| -|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| -|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| -|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| -|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| -|**2024-10-21**|**Cooperative Multistatic Target Detection in Cell-Free Communication Networks**|Tianyu Yang et.al.|[2410.16140v1](http://arxiv.org/abs/2410.16140v1)|null| -|**2024-10-21**|**Theoretical Insights into Line Graph Transformation on Graph Learning**|Fan Yang et.al.|[2410.16138v1](http://arxiv.org/abs/2410.16138v1)|[link](https://github.com/lukeyf/graphs-and-lines)| -|**2024-10-21**|**Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors**|Finn Schmidt et.al.|[2410.16136v1](http://arxiv.org/abs/2410.16136v1)|null| -|**2024-10-21**|**A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields**|Runkang Guo et.al.|[2410.16132v1](http://arxiv.org/abs/2410.16132v1)|null| - ## Reinforcement Learning ### Reinforcement Learning @@ -1404,196 +1404,58 @@ |**2024-10-21**|**Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning**|Wenqi Bai et.al.|[2410.15837v1](http://arxiv.org/abs/2410.15837v1)|null| |**2024-10-21**|**Offline reinforcement learning for job-shop scheduling problems**|Imanol Echeverria et.al.|[2410.15714v1](http://arxiv.org/abs/2410.15714v1)|null| |**2024-10-21**|**Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces**|Jifeng Hu et.al.|[2410.15698v1](http://arxiv.org/abs/2410.15698v1)|null| -|**2024-10-21**|**Understanding and Alleviating Memory Consumption in RLHF for LLMs**|Jin Zhou et.al.|[2410.15651v1](http://arxiv.org/abs/2410.15651v1)|null| -|**2024-10-21**|**Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identificatio**|Zhangjian Ji et.al.|[2410.15613v1](http://arxiv.org/abs/2410.15613v1)|null| -|**2024-10-21**|**In-Trajectory Inverse Reinforcement Learning: Learn Incrementally From An Ongoing Trajectory**|Shicheng Liu et.al.|[2410.15612v1](http://arxiv.org/abs/2410.15612v1)|null| -|**2024-10-21**|**On The Global Convergence Of Online RLHF With Neural Parametrization**|Mudit Gaur et.al.|[2410.15610v1](http://arxiv.org/abs/2410.15610v1)|null| -|**2024-10-21**|**Reinforced Imitative Trajectory Planning for Urban Automated Driving**|Di Zeng et.al.|[2410.15607v1](http://arxiv.org/abs/2410.15607v1)|[link](https://github.com/zigned/nuplan_zigned)| -|**2024-10-21**|**Patrol Security Game: Defending Against Adversary with Freedom in Attack Timing, Location, and Duration**|Hao-Tsung Yang et.al.|[2410.15600v1](http://arxiv.org/abs/2410.15600v1)|null| -|**2024-10-21**|**A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization**|Wenyi Xiao et.al.|[2410.15595v1](http://arxiv.org/abs/2410.15595v1)|null| -|**2024-10-21**|**A Plug-and-Play Fully On-the-Job Real-Time Reinforcement Learning Algorithm for a Direct-Drive Tandem-Wing Experiment Platforms Under Multiple Random Operating Conditions**|Zhang Minghao et.al.|[2410.15554v1](http://arxiv.org/abs/2410.15554v1)|null| -|**2024-10-20**|**Reinforcement Learning for Dynamic Memory Allocation**|Arisrei Lim et.al.|[2410.15492v1](http://arxiv.org/abs/2410.15492v1)|null| -|**2024-10-20**|**Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization**|Timofei Gritsaev et.al.|[2410.15474v1](http://arxiv.org/abs/2410.15474v1)|null| -|**2024-10-20**|**AssemblyComplete: 3D Combinatorial Construction with Deep Reinforcement Learning**|Alan Chen et.al.|[2410.15469v1](http://arxiv.org/abs/2410.15469v1)|null| -|**2024-10-20**|**Heterogeneous Graph Reinforcement Learning for Dependency-aware Multi-task Allocation in Spatial Crowdsourcing**|Yong Zhao et.al.|[2410.15449v1](http://arxiv.org/abs/2410.15449v1)|null| -|**2024-10-20**|**Reinforcement Learning-Based REST API Testing with Multi-Coverage**|Tien-Quang Nguyen et.al.|[2410.15399v1](http://arxiv.org/abs/2410.15399v1)|null| -|**2024-10-20**|**A Distributed Primal-Dual Method for Constrained Multi-agent Reinforcement Learning with General Parameterization**|Ali Kahe et.al.|[2410.15335v1](http://arxiv.org/abs/2410.15335v1)|null| -|**2024-10-20**|**Training Language Models to Critique With Multi-agent Feedback**|Tian Lan et.al.|[2410.15287v1](http://arxiv.org/abs/2410.15287v1)|null| -|**2024-10-19**|**A Semidefinite Relaxation Approach for Fair Graph Clustering**|Sina Baharlouei et.al.|[2410.15233v1](http://arxiv.org/abs/2410.15233v1)|null| - -## Contrastive Learning - -### Contrastive Learning -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| -|**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Distribution Learning with Valid Outputs Beyond the Worst-Case**|Nick Rittler et.al.|[2410.16253v1](http://arxiv.org/abs/2410.16253v1)|null| -|**2024-10-21**|**Random Spin Committee Approach For Smooth Interatomic Potentials**|Vlad Cărare et.al.|[2410.16252v1](http://arxiv.org/abs/2410.16252v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| -|**2024-10-21**|**Analyzing Context Contributions in LLM-based Machine Translation**|Emmanouil Zaranis et.al.|[2410.16246v1](http://arxiv.org/abs/2410.16246v1)|null| -|**2024-10-21**|**MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report**|Samrajya Thapa et.al.|[2410.16239v1](http://arxiv.org/abs/2410.16239v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| -|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -|**2024-10-21**|**Sample Optimal and Memory Efficient Quantum State Tomography**|Yanglin Hu et.al.|[2410.16220v1](http://arxiv.org/abs/2410.16220v1)|null| -|**2024-10-21**|**Pre-training Distillation for Large Language Models: A Design Space Exploration**|Hao Peng et.al.|[2410.16215v1](http://arxiv.org/abs/2410.16215v1)|null| -|**2024-10-21**|**Asymmetries in Financial Spillovers**|Florian Huber et.al.|[2410.16214v1](http://arxiv.org/abs/2410.16214v1)|null| -|**2024-10-21**|**Comprehensive benchmarking of large language models for RNA secondary structure prediction**|L. I. Zablocki et.al.|[2410.16212v1](http://arxiv.org/abs/2410.16212v1)|[link](https://github.com/sinc-lab/rna-llm-folding)| -|**2024-10-21**|**Compute-Constrained Data Selection**|Junjie Oscar Yin et.al.|[2410.16208v1](http://arxiv.org/abs/2410.16208v1)|[link](https://github.com/oseyosey/ccds)| -|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| -|**2024-10-21**|**Systematic Review: Text Processing Algorithms in Machine Learning and Deep Learning for Mental Health Detection on Social Media**|Yuchen Cao et.al.|[2410.16204v1](http://arxiv.org/abs/2410.16204v1)|null| -|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| -|**2024-10-21**|**Improve Vision Language Model Chain-of-thought Reasoning**|Ruohong Zhang et.al.|[2410.16198v1](http://arxiv.org/abs/2410.16198v1)|null| -|**2024-10-21**|**A Trust-Region Method for Graphical Stein Variational Inference**|Liam Pavlovic et.al.|[2410.16195v1](http://arxiv.org/abs/2410.16195v1)|null| -|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| -|**2024-10-21**|**Interpretable Prediction Rule Ensembles in the Presence of Missing Data**|Vincent Schroeder et.al.|[2410.16187v1](http://arxiv.org/abs/2410.16187v1)|null| -|**2024-10-21**|**RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style**|Yantao Liu et.al.|[2410.16184v1](http://arxiv.org/abs/2410.16184v1)|[link](https://github.com/thu-keg/rm-bench)| -|**2024-10-21**|**MagicPIG: LSH Sampling for Efficient LLM Generation**|Zhuoming Chen et.al.|[2410.16179v1](http://arxiv.org/abs/2410.16179v1)|null| - -## Robotics - -### SLAM -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-19**|**EndoMetric: Near-light metric scale monocular SLAM**|Raúl Iranzo et.al.|[2410.15065v1](http://arxiv.org/abs/2410.15065v1)|null| -|**2024-10-17**|**Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot**|Dongkun Han et.al.|[2410.13612v1](http://arxiv.org/abs/2410.13612v1)|null| -|**2024-10-17**|**TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal**|Yanpeng Jia et.al.|[2410.13240v1](http://arxiv.org/abs/2410.13240v1)|null| -|**2024-10-16**|**QueensCAMP: an RGB-D dataset for robust Visual SLAM**|Hudson M. S. Bruno et.al.|[2410.12520v1](http://arxiv.org/abs/2410.12520v1)|[link](https://github.com/larocs/queenscamp-dataset)| -|**2024-10-16**|**PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM**|Guanghao Li et.al.|[2410.12324v2](http://arxiv.org/abs/2410.12324v2)|null| -|**2024-10-16**|**Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem**|Yichen Sha et.al.|[2410.12169v1](http://arxiv.org/abs/2410.12169v1)|null| -|**2024-10-15**|**V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting**|Tuan Dang et.al.|[2410.12068v1](http://arxiv.org/abs/2410.12068v1)|[link](https://github.com/tuantdang/v3d-slam)| -|**2024-10-15**|**GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information**|Wancai Zheng et.al.|[2410.11356v1](http://arxiv.org/abs/2410.11356v1)|null| -|**2024-10-15**|**Multiview Scene Graph**|Juexiao Zhang et.al.|[2410.11187v1](http://arxiv.org/abs/2410.11187v1)|null| -|**2024-10-14**|**MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator**|Taozhe Li et.al.|[2410.10669v1](http://arxiv.org/abs/2410.10669v1)|null| -|**2024-10-13**|**Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph**|Benoit Casseau et.al.|[2410.09896v1](http://arxiv.org/abs/2410.09896v1)|null| -|**2024-10-12**|**SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs**|Wenxi Chen et.al.|[2410.09503v1](http://arxiv.org/abs/2410.09503v1)|[link](https://github.com/X-LANCE/SLAM-LLM)| -|**2024-10-12**|**An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation**|Wei Liang et.al.|[2410.09443v1](http://arxiv.org/abs/2410.09443v1)|null| -|**2024-10-12**|**ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras**|Junkai Niu et.al.|[2410.09374v1](http://arxiv.org/abs/2410.09374v1)|[link](https://github.com/nail-hnu/esvo2)| -|**2024-10-11**|**Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System**|Zheng Liu et.al.|[2410.08935v1](http://arxiv.org/abs/2410.08935v1)|[link](https://github.com/hku-mars/Voxel-SLAM)| -|**2024-10-11**|**Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints**|Yicheng He et.al.|[2410.08780v1](http://arxiv.org/abs/2410.08780v1)|null| -|**2024-10-10**|**ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization**|Mason B. Peterson et.al.|[2410.08262v1](http://arxiv.org/abs/2410.08262v1)|null| -|**2024-10-10**|**IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera**|Jian Huang et.al.|[2410.08107v2](http://arxiv.org/abs/2410.08107v2)|[link](https://github.com/wu-cvgl/inceventgs)| -|**2024-10-08**|**Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching**|Gongxin Yao et.al.|[2410.06285v1](http://arxiv.org/abs/2410.06285v1)|null| -|**2024-10-08**|**Submodular Optimization for Keyframe Selection & Usage in SLAM**|David Thorne et.al.|[2410.05576v1](http://arxiv.org/abs/2410.05576v1)|null| -|**2024-10-07**|**SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones**|Denis Davletshin et.al.|[2410.05405v1](http://arxiv.org/abs/2410.05405v1)|null| -|**2024-10-07**|**Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection**|Ang He et.al.|[2410.05017v1](http://arxiv.org/abs/2410.05017v1)|null| -|**2024-10-05**|**A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems**|Nikola Radulov et.al.|[2410.04242v1](http://arxiv.org/abs/2410.04242v1)|[link](https://github.com/nikolaradulov/slamfuse)| -|**2024-10-05**|**High-Speed Stereo Visual SLAM for Low-Powered Computing Devices**|Ashish Kumar et.al.|[2410.04090v1](http://arxiv.org/abs/2410.04090v1)|null| -|**2024-10-04**|**Estimating Body and Hand Motion in an Ego-sensed World**|Brent Yi et.al.|[2410.03665v2](http://arxiv.org/abs/2410.03665v2)|null| -|**2024-10-04**|**EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM**|Shi Chen et.al.|[2410.03812v1](http://arxiv.org/abs/2410.03812v1)|null| -|**2024-10-03**|**LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features**|Zihao Dong et.al.|[2410.02961v1](http://arxiv.org/abs/2410.02961v1)|null| -|**2024-10-02**|**ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space**|Hogyun Kim et.al.|[2410.01325v1](http://arxiv.org/abs/2410.01325v1)|null| -|**2024-10-01**|**Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency**|William Dubois et.al.|[2410.00758v1](http://arxiv.org/abs/2410.00758v1)|null| -|**2024-10-01**|**CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM**|Dapeng Feng et.al.|[2410.00486v2](http://arxiv.org/abs/2410.00486v2)|[link](https://github.com/DapengFeng/cartgs)| - -### SFM -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| -|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| -|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| -|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| -|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| -|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| -|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| -|**2024-10-21**|**Multi-product Zeno effect with higher order convergence rates**|Tim Möbus et.al.|[2410.16260v1](http://arxiv.org/abs/2410.16260v1)|null| -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| -|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| -|**2024-10-21**|**Loss of 12 Starlink Satellites Due to Pre-conditioning of Intense Space Weather Activity Surrounding the Extreme Geomagnetic Storm of 10 May 2024**|Ayisha M Ashruf et.al.|[2410.16254v1](http://arxiv.org/abs/2410.16254v1)|null| -|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| -|**2024-10-21**|**Cups and Gates I: Cohomology invariants and logical quantum operations**|Nikolas P. Breuckmann et.al.|[2410.16250v1](http://arxiv.org/abs/2410.16250v1)|null| -|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| -|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| -|**2024-10-21**|**Separations in query complexity for total search problems**|Shalev Ben-David et.al.|[2410.16245v1](http://arxiv.org/abs/2410.16245v1)|null| -|**2024-10-21**|**Semiconductor Circuits for Quantum Computing with Electronic Wave Packets**|David Pomaranski et.al.|[2410.16244v1](http://arxiv.org/abs/2410.16244v1)|null| -|**2024-10-21**|**Serendipitous detection of an intense X-ray flare in the weak-line T Tauri star KM Ori with SRG/eROSITA**|Savithri H. Ezhikode et.al.|[2410.16241v1](http://arxiv.org/abs/2410.16241v1)|null| -|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| -|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| -|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| -|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| -|**2024-10-21**|**Using a quantum SWAP engine to experimentally validate thermodynamic uncertainty relations**|Krishna Shende et.al.|[2410.16230v1](http://arxiv.org/abs/2410.16230v1)|null| -|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**Lipschitz Stability of Travel Time Data**|Joonas Ilmavirta et.al.|[2410.16224v1](http://arxiv.org/abs/2410.16224v1)|null| -|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| -|**2024-10-21**|**Hikita surjectivity for $\mathcal N /// T$**|Linus Setiabrata et.al.|[2410.16217v1](http://arxiv.org/abs/2410.16217v1)|null| - -### Robotics -|Publish Date|Title|Authors|PDF|Code| -| :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| -|**2024-10-21**|**LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation**|Hao Gao et.al.|[2410.16197v1](http://arxiv.org/abs/2410.16197v1)|null| -|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| -|**2024-10-21**|**Continuum Robot Shape Estimation Using Magnetic Ball Chains**|Giovanni Pittiglio et.al.|[2410.16025v1](http://arxiv.org/abs/2410.16025v1)|null| -|**2024-10-21**|**ARCADE: Scalable Demonstration Collection and Generation via Augmented Reality for Imitation Learning**|Yue Yang et.al.|[2410.15994v1](http://arxiv.org/abs/2410.15994v1)|null| -|**2024-10-21**|**Analyzing Closed-loop Training Techniques for Realistic Traffic Agent Models in Autonomous Highway Driving Simulations**|Matthias Bitzer et.al.|[2410.15987v1](http://arxiv.org/abs/2410.15987v1)|null| -|**2024-10-21**|**Learning Quadrotor Control From Visual Features Using Differentiable Simulation**|Johannes Heeg et.al.|[2410.15979v1](http://arxiv.org/abs/2410.15979v1)|null| -|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| -|**2024-10-21**|**Neural Predictor for Flight Control with Payload**|Ao Jin et.al.|[2410.15946v1](http://arxiv.org/abs/2410.15946v1)|null| -|**2024-10-21**|**Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection**|Koji Inoue et.al.|[2410.15929v1](http://arxiv.org/abs/2410.15929v1)|null| -|**2024-10-21**|**Fully distributed and resilient source seeking for robot swarms**|Jesús Bautista et.al.|[2410.15921v1](http://arxiv.org/abs/2410.15921v1)|null| -|**2024-10-21**|**Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles**|Zhengming Wang et.al.|[2410.15912v1](http://arxiv.org/abs/2410.15912v1)|null| -|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| -|**2024-10-21**|**Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images**|Yiming Li et.al.|[2410.15879v1](http://arxiv.org/abs/2410.15879v1)|null| -|**2024-10-21**|**Safety-critical Control with Control Barrier Functions: A Hierarchical Optimization Framework**|Junjun Xie et.al.|[2410.15877v1](http://arxiv.org/abs/2410.15877v1)|null| -|**2024-10-21**|**Robust Loop Closure by Textual Cues in Challenging Environments**|Tongxing Jin et.al.|[2410.15869v1](http://arxiv.org/abs/2410.15869v1)|null| -|**2024-10-21**|**Task-oriented Robotic Manipulation with Vision Language Models**|Nurhan Bulus Guran et.al.|[2410.15863v1](http://arxiv.org/abs/2410.15863v1)|null| -|**2024-10-21**|**Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning**|Wenqi Bai et.al.|[2410.15837v1](http://arxiv.org/abs/2410.15837v1)|null| -|**2024-10-21**|**Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers**|Andrea Berra et.al.|[2410.15802v1](http://arxiv.org/abs/2410.15802v1)|null| -|**2024-10-21**|**Flying through Moving Gates without Full State Estimation**|Ralf Römer et.al.|[2410.15799v1](http://arxiv.org/abs/2410.15799v1)|null| -|**2024-10-21**|**Design of a Flexible Robot Arm for Safe Aerial Physical Interaction**|Julien Mellet et.al.|[2410.15797v1](http://arxiv.org/abs/2410.15797v1)|null| -|**2024-10-21**|**WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction**|Heng Zhai et.al.|[2410.15792v1](http://arxiv.org/abs/2410.15792v1)|null| -|**2024-10-21**|**Generalizing Motion Planners with Mixture of Experts for Autonomous Driving**|Qiao Sun et.al.|[2410.15774v1](http://arxiv.org/abs/2410.15774v1)|null| -|**2024-10-21**|**IPM-LSTM: A Learning-Based Interior Point Method for Solving Nonlinear Programs**|Xi Gao et.al.|[2410.15731v1](http://arxiv.org/abs/2410.15731v1)|null| -|**2024-10-21**|**MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation**|Yu Sheng et.al.|[2410.15730v1](http://arxiv.org/abs/2410.15730v1)|null| -|**2024-10-21**|**Efficient Non-Myopic Layered Bayesian Optimization For Large-Scale Bathymetric Informative Path Planning**|Alexander Kiessling et.al.|[2410.15720v1](http://arxiv.org/abs/2410.15720v1)|null| -|**2024-10-21**|**Hierarchical Search-Based Cooperative Motion Planning**|Yuchen Wu et.al.|[2410.15710v1](http://arxiv.org/abs/2410.15710v1)|null| -|**2024-10-21**|**PALMS: Plane-based Accessible Indoor Localization Using Mobile Smartphones**|Yunqian Cheng et.al.|[2410.15694v1](http://arxiv.org/abs/2410.15694v1)|[link](https://github.com/head-inthe-cloud/palms-indoor-localization)| -|**2024-10-21**|**RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration**|Pengcheng Shi et.al.|[2410.15682v1](http://arxiv.org/abs/2410.15682v1)|[link](https://github.com/shipc-ai/tcf)| +|**2024-10-21**|**Understanding and Alleviating Memory Consumption in RLHF for LLMs**|Jin Zhou et.al.|[2410.15651v1](http://arxiv.org/abs/2410.15651v1)|null| +|**2024-10-21**|**Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identificatio**|Zhangjian Ji et.al.|[2410.15613v1](http://arxiv.org/abs/2410.15613v1)|null| +|**2024-10-21**|**In-Trajectory Inverse Reinforcement Learning: Learn Incrementally From An Ongoing Trajectory**|Shicheng Liu et.al.|[2410.15612v1](http://arxiv.org/abs/2410.15612v1)|null| +|**2024-10-21**|**On The Global Convergence Of Online RLHF With Neural Parametrization**|Mudit Gaur et.al.|[2410.15610v1](http://arxiv.org/abs/2410.15610v1)|null| +|**2024-10-21**|**Reinforced Imitative Trajectory Planning for Urban Automated Driving**|Di Zeng et.al.|[2410.15607v1](http://arxiv.org/abs/2410.15607v1)|[link](https://github.com/zigned/nuplan_zigned)| +|**2024-10-21**|**Patrol Security Game: Defending Against Adversary with Freedom in Attack Timing, Location, and Duration**|Hao-Tsung Yang et.al.|[2410.15600v1](http://arxiv.org/abs/2410.15600v1)|null| +|**2024-10-21**|**A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization**|Wenyi Xiao et.al.|[2410.15595v1](http://arxiv.org/abs/2410.15595v1)|null| +|**2024-10-21**|**A Plug-and-Play Fully On-the-Job Real-Time Reinforcement Learning Algorithm for a Direct-Drive Tandem-Wing Experiment Platforms Under Multiple Random Operating Conditions**|Zhang Minghao et.al.|[2410.15554v1](http://arxiv.org/abs/2410.15554v1)|null| +|**2024-10-20**|**Reinforcement Learning for Dynamic Memory Allocation**|Arisrei Lim et.al.|[2410.15492v1](http://arxiv.org/abs/2410.15492v1)|null| +|**2024-10-20**|**Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization**|Timofei Gritsaev et.al.|[2410.15474v1](http://arxiv.org/abs/2410.15474v1)|null| +|**2024-10-20**|**AssemblyComplete: 3D Combinatorial Construction with Deep Reinforcement Learning**|Alan Chen et.al.|[2410.15469v1](http://arxiv.org/abs/2410.15469v1)|null| +|**2024-10-20**|**Heterogeneous Graph Reinforcement Learning for Dependency-aware Multi-task Allocation in Spatial Crowdsourcing**|Yong Zhao et.al.|[2410.15449v1](http://arxiv.org/abs/2410.15449v1)|null| +|**2024-10-20**|**Reinforcement Learning-Based REST API Testing with Multi-Coverage**|Tien-Quang Nguyen et.al.|[2410.15399v1](http://arxiv.org/abs/2410.15399v1)|null| +|**2024-10-20**|**A Distributed Primal-Dual Method for Constrained Multi-agent Reinforcement Learning with General Parameterization**|Ali Kahe et.al.|[2410.15335v1](http://arxiv.org/abs/2410.15335v1)|null| +|**2024-10-20**|**Training Language Models to Critique With Multi-agent Feedback**|Tian Lan et.al.|[2410.15287v1](http://arxiv.org/abs/2410.15287v1)|null| +|**2024-10-19**|**A Semidefinite Relaxation Approach for Fair Graph Clustering**|Sina Baharlouei et.al.|[2410.15233v1](http://arxiv.org/abs/2410.15233v1)|null| -### Visual Localization +## Graph Neural Network + +### Graph Neural Network |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| -|**2024-10-21**|**A new method of reconstructing images of gamma-ray telescopes applied to the LST-1 of CTAO**|CTA-LST Project et.al.|[2410.16042v1](http://arxiv.org/abs/2410.16042v1)|null| -|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| -|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| -|**2024-10-21**|**MINFLUX -- molecular resolution with minimal photons**|Lukas Scheiderer et.al.|[2410.15902v1](http://arxiv.org/abs/2410.15902v1)|null| -|**2024-10-21**|**R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate**|Sandeep Nagar et.al.|[2410.15851v1](http://arxiv.org/abs/2410.15851v1)|null| -|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| -|**2024-10-21**|**MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation**|Yu Sheng et.al.|[2410.15730v1](http://arxiv.org/abs/2410.15730v1)|null| -|**2024-10-21**|**Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identificatio**|Zhangjian Ji et.al.|[2410.15613v1](http://arxiv.org/abs/2410.15613v1)|null| -|**2024-10-20**|**Exploring Curriculum Learning for Vision-Language Tasks: A Study on Small-Scale Multimodal Training**|Rohan Saha et.al.|[2410.15509v1](http://arxiv.org/abs/2410.15509v1)|null| -|**2024-10-20**|**Event-based Sensor Fusion and Application on Odometry: A Survey**|Jiaqiang Zhang et.al.|[2410.15480v1](http://arxiv.org/abs/2410.15480v1)|null| -|**2024-10-20**|**EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting**|Bohao Liao et.al.|[2410.15392v1](http://arxiv.org/abs/2410.15392v1)|null| -|**2024-10-20**|**The Massive and Distant Clusters of WISE Survey 2: Second Data Release**|Khunanon Thongkham et.al.|[2410.15303v1](http://arxiv.org/abs/2410.15303v1)|null| -|**2024-10-19**|**Low-cost Robust Night-time Aerial Material Segmentation through Hyperspectral Data and Sparse Spatio-Temporal Learning**|Chandrajit Bajaj et.al.|[2410.15208v1](http://arxiv.org/abs/2410.15208v1)|null| -|**2024-10-19**|**EndoMetric: Near-light metric scale monocular SLAM**|Raúl Iranzo et.al.|[2410.15065v1](http://arxiv.org/abs/2410.15065v1)|null| -|**2024-10-19**|**Wave (from) Polarized Light Learning (WPLL) method: high resolution spatio-temporal measurements of water surface waves in laboratory setups**|Noam Ginio et.al.|[2410.14988v1](http://arxiv.org/abs/2410.14988v1)|null| -|**2024-10-19**|**Subversive Characters and Stereotyping Readers: Characterizing Queer Relationalities with Dialogue-Based Relation Extraction**|Kent K. Chang et.al.|[2410.14978v1](http://arxiv.org/abs/2410.14978v1)|null| -|**2024-10-19**|**3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving**|Linh Van Ma et.al.|[2410.14977v1](http://arxiv.org/abs/2410.14977v1)|[link](https://github.com/linh-gist/ms-glmb-nuScenes)| -|**2024-10-19**|**AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks**|Ryan Diaz et.al.|[2410.14968v1](http://arxiv.org/abs/2410.14968v1)|null| -|**2024-10-18**|**Neural Real-Time Recalibration for Infrared Multi-Camera Systems**|Benyamin Mehmandar et.al.|[2410.14505v1](http://arxiv.org/abs/2410.14505v1)|[link](https://github.com/theictlab/neural-recalibration)| -|**2024-10-18**|**FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models**|Rui Hu et.al.|[2410.14429v1](http://arxiv.org/abs/2410.14429v1)|null| -|**2024-10-18**|**Variable Aperture Bokeh Rendering via Customized Focal Plane Guidance**|Kang Chen et.al.|[2410.14400v1](http://arxiv.org/abs/2410.14400v1)|[link](https://github.com/motong-ai-studio/vabm)| -|**2024-10-18**|**Non-Invasive Qualitative Vibration Analysis using Event Camera**|Dwijay Bane et.al.|[2410.14364v1](http://arxiv.org/abs/2410.14364v1)|null| -|**2024-10-18**|**Evaluating the evaluators: Towards human-aligned metrics for missing markers reconstruction**|Taras Kucherenko et.al.|[2410.14334v1](http://arxiv.org/abs/2410.14334v1)|null| -|**2024-10-18**|**Towards High-Speed Passive Visible Light Communication with Event Cameras and Digital Micro-Mirrors**|Yanxiang Wang et.al.|[2410.14228v2](http://arxiv.org/abs/2410.14228v2)|null| -|**2024-10-18**|**MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging**|Zhenghao Pan et.al.|[2410.14214v1](http://arxiv.org/abs/2410.14214v1)|null| -|**2024-10-18**|**MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations**|Vishal Vivek Saley et.al.|[2410.14204v1](http://arxiv.org/abs/2410.14204v1)|null| -|**2024-10-17**|**UniDrive: Towards Universal Driving Perception Across Camera Configurations**|Ye Li et.al.|[2410.13864v1](http://arxiv.org/abs/2410.13864v1)|[link](https://github.com/ywyeli/unidrive)| -|**2024-10-17**|**VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding**|Runsen Xu et.al.|[2410.13860v1](http://arxiv.org/abs/2410.13860v1)|[link](https://github.com/openrobotlab/vlm-grounder)| -|**2024-10-17**|**VidPanos: Generative Panoramic Videos from Casual Panning Videos**|Jingwei Ma et.al.|[2410.13832v1](http://arxiv.org/abs/2410.13832v1)|null| +|**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| +|**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent**|Santhosh Karnik et.al.|[2410.16247v1](http://arxiv.org/abs/2410.16247v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**Unique subgraphs are rare**|Domagoj Bradač et.al.|[2410.16233v1](http://arxiv.org/abs/2410.16233v1)|null| +|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| +|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| +|**2024-10-21**|**On Creating an English-Thai Code-switched Machine Translation in Medical Domain**|Parinthapat Pengpun et.al.|[2410.16221v1](http://arxiv.org/abs/2410.16221v1)|null| +|**2024-10-21**|**Theoretical Limitations of Ensembles in the Age of Overparameterization**|Niclas Dern et.al.|[2410.16201v1](http://arxiv.org/abs/2410.16201v1)|null| +|**2024-10-21**|**Information for Conversation Generation: Proposals Utilising Knowledge Graphs**|Alex Clay et.al.|[2410.16196v1](http://arxiv.org/abs/2410.16196v1)|null| +|**2024-10-21**|**Training Better Deep Learning Models Using Human Saliency**|Aidan Boyd et.al.|[2410.16190v1](http://arxiv.org/abs/2410.16190v1)|null| +|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| +|**2024-10-21**|**Learning How to Vote With Principles: Axiomatic Insights Into the Collective Decisions of Neural Networks**|Levin Hornischer et.al.|[2410.16170v1](http://arxiv.org/abs/2410.16170v1)|null| +|**2024-10-21**|**Metric as Transform: Exploring beyond Affine Transform for Interpretable Neural Network**|Suman Sapkota et.al.|[2410.16159v1](http://arxiv.org/abs/2410.16159v1)|null| +|**2024-10-21**|**Networks: The Visual Language of Complexity**|Blai Vidiella et.al.|[2410.16158v1](http://arxiv.org/abs/2410.16158v1)|null| +|**2024-10-21**|**A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns**|Tianyi Men et.al.|[2410.16155v1](http://arxiv.org/abs/2410.16155v1)|null| +|**2024-10-21**|**Unsupervised Replay Strategies for Continual Learning with Limited Data**|Anthony Bazhenov et.al.|[2410.16154v1](http://arxiv.org/abs/2410.16154v1)|null| +|**2024-10-21**|**Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance**|Mostafa Hussien et.al.|[2410.16151v1](http://arxiv.org/abs/2410.16151v1)|null| +|**2024-10-21**|**Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting**|Robin Thériault et.al.|[2410.16150v1](http://arxiv.org/abs/2410.16150v1)|null| +|**2024-10-21**|**Towards Combating Frequency Simplicity-biased Learning for Domain Generalization**|Xilin He et.al.|[2410.16146v1](http://arxiv.org/abs/2410.16146v1)|[link](https://github.com/c0notsilly/advfrequency)| +|**2024-10-21**|**An Explainable Contrastive-based Dilated Convolutional Network with Transformer for Pediatric Pneumonia Detection**|Chandravardhan Singh Raghaw et.al.|[2410.16143v1](http://arxiv.org/abs/2410.16143v1)|null| +|**2024-10-21**|**AdChain: Decentralized Header Bidding**|Behkish Nassirzadeh et.al.|[2410.16141v1](http://arxiv.org/abs/2410.16141v1)|null| +|**2024-10-21**|**Cooperative Multistatic Target Detection in Cell-Free Communication Networks**|Tianyu Yang et.al.|[2410.16140v1](http://arxiv.org/abs/2410.16140v1)|null| +|**2024-10-21**|**Theoretical Insights into Line Graph Transformation on Graph Learning**|Fan Yang et.al.|[2410.16138v1](http://arxiv.org/abs/2410.16138v1)|[link](https://github.com/lukeyf/graphs-and-lines)| +|**2024-10-21**|**Modeling dynamic neural activity by combining naturalistic video stimuli and stimulus-independent latent factors**|Finn Schmidt et.al.|[2410.16136v1](http://arxiv.org/abs/2410.16136v1)|null| +|**2024-10-21**|**A Data-driven Crowd Simulation Framework Integrating Physics-informed Machine Learning with Navigation Potential Fields**|Runkang Guo et.al.|[2410.16132v1](http://arxiv.org/abs/2410.16132v1)|null| ## Medical Application @@ -1699,6 +1561,144 @@ |**2024-10-21**|**Continuous Speech Synthesis using per-token Latent Diffusion**|Arnon Turetzky et.al.|[2410.16048v1](http://arxiv.org/abs/2410.16048v1)|null| |**2024-10-21**|**Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024**|Jiamin Cao et.al.|[2410.16037v1](http://arxiv.org/abs/2410.16037v1)|null| +## Robotics + +### Robotics +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**CoT-TL: Low-Resource Temporal Knowledge Representation of Planning Instructions Using Chain-of-Thought Reasoning**|Kumar Manas et.al.|[2410.16207v1](http://arxiv.org/abs/2410.16207v1)|null| +|**2024-10-21**|**LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation**|Hao Gao et.al.|[2410.16197v1](http://arxiv.org/abs/2410.16197v1)|null| +|**2024-10-21**|**Spiking Neural Networks as a Controller for Emergent Swarm Agents**|Kevin Zhu et.al.|[2410.16175v1](http://arxiv.org/abs/2410.16175v1)|null| +|**2024-10-21**|**Continuum Robot Shape Estimation Using Magnetic Ball Chains**|Giovanni Pittiglio et.al.|[2410.16025v1](http://arxiv.org/abs/2410.16025v1)|null| +|**2024-10-21**|**ARCADE: Scalable Demonstration Collection and Generation via Augmented Reality for Imitation Learning**|Yue Yang et.al.|[2410.15994v1](http://arxiv.org/abs/2410.15994v1)|null| +|**2024-10-21**|**Analyzing Closed-loop Training Techniques for Realistic Traffic Agent Models in Autonomous Highway Driving Simulations**|Matthias Bitzer et.al.|[2410.15987v1](http://arxiv.org/abs/2410.15987v1)|null| +|**2024-10-21**|**Learning Quadrotor Control From Visual Features Using Differentiable Simulation**|Johannes Heeg et.al.|[2410.15979v1](http://arxiv.org/abs/2410.15979v1)|null| +|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| +|**2024-10-21**|**Neural Predictor for Flight Control with Payload**|Ao Jin et.al.|[2410.15946v1](http://arxiv.org/abs/2410.15946v1)|null| +|**2024-10-21**|**Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection**|Koji Inoue et.al.|[2410.15929v1](http://arxiv.org/abs/2410.15929v1)|null| +|**2024-10-21**|**Fully distributed and resilient source seeking for robot swarms**|Jesús Bautista et.al.|[2410.15921v1](http://arxiv.org/abs/2410.15921v1)|null| +|**2024-10-21**|**Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles**|Zhengming Wang et.al.|[2410.15912v1](http://arxiv.org/abs/2410.15912v1)|null| +|**2024-10-21**|**Distributed Learning for UAV Swarms**|Chen Hu et.al.|[2410.15882v1](http://arxiv.org/abs/2410.15882v1)|null| +|**2024-10-21**|**Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images**|Yiming Li et.al.|[2410.15879v1](http://arxiv.org/abs/2410.15879v1)|null| +|**2024-10-21**|**Safety-critical Control with Control Barrier Functions: A Hierarchical Optimization Framework**|Junjun Xie et.al.|[2410.15877v1](http://arxiv.org/abs/2410.15877v1)|null| +|**2024-10-21**|**Robust Loop Closure by Textual Cues in Challenging Environments**|Tongxing Jin et.al.|[2410.15869v1](http://arxiv.org/abs/2410.15869v1)|null| +|**2024-10-21**|**Task-oriented Robotic Manipulation with Vision Language Models**|Nurhan Bulus Guran et.al.|[2410.15863v1](http://arxiv.org/abs/2410.15863v1)|null| +|**2024-10-21**|**Long-distance Geomagnetic Navigation in GNSS-denied Environments with Deep Reinforcement Learning**|Wenqi Bai et.al.|[2410.15837v1](http://arxiv.org/abs/2410.15837v1)|null| +|**2024-10-21**|**Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers**|Andrea Berra et.al.|[2410.15802v1](http://arxiv.org/abs/2410.15802v1)|null| +|**2024-10-21**|**Flying through Moving Gates without Full State Estimation**|Ralf Römer et.al.|[2410.15799v1](http://arxiv.org/abs/2410.15799v1)|null| +|**2024-10-21**|**Design of a Flexible Robot Arm for Safe Aerial Physical Interaction**|Julien Mellet et.al.|[2410.15797v1](http://arxiv.org/abs/2410.15797v1)|null| +|**2024-10-21**|**WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction**|Heng Zhai et.al.|[2410.15792v1](http://arxiv.org/abs/2410.15792v1)|null| +|**2024-10-21**|**Generalizing Motion Planners with Mixture of Experts for Autonomous Driving**|Qiao Sun et.al.|[2410.15774v1](http://arxiv.org/abs/2410.15774v1)|null| +|**2024-10-21**|**IPM-LSTM: A Learning-Based Interior Point Method for Solving Nonlinear Programs**|Xi Gao et.al.|[2410.15731v1](http://arxiv.org/abs/2410.15731v1)|null| +|**2024-10-21**|**MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation**|Yu Sheng et.al.|[2410.15730v1](http://arxiv.org/abs/2410.15730v1)|null| +|**2024-10-21**|**Efficient Non-Myopic Layered Bayesian Optimization For Large-Scale Bathymetric Informative Path Planning**|Alexander Kiessling et.al.|[2410.15720v1](http://arxiv.org/abs/2410.15720v1)|null| +|**2024-10-21**|**Hierarchical Search-Based Cooperative Motion Planning**|Yuchen Wu et.al.|[2410.15710v1](http://arxiv.org/abs/2410.15710v1)|null| +|**2024-10-21**|**PALMS: Plane-based Accessible Indoor Localization Using Mobile Smartphones**|Yunqian Cheng et.al.|[2410.15694v1](http://arxiv.org/abs/2410.15694v1)|[link](https://github.com/head-inthe-cloud/palms-indoor-localization)| +|**2024-10-21**|**RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration**|Pengcheng Shi et.al.|[2410.15682v1](http://arxiv.org/abs/2410.15682v1)|[link](https://github.com/shipc-ai/tcf)| + +### SFM +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| +|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| +|**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| +|**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| +|**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| +|**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| +|**2024-10-21**|**Multi-product Zeno effect with higher order convergence rates**|Tim Möbus et.al.|[2410.16260v1](http://arxiv.org/abs/2410.16260v1)|null| +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| +|**2024-10-21**|**Revisiting Deep Feature Reconstruction for Logical and Structural Industrial Anomaly Detection**|Sukanya Patra et.al.|[2410.16255v1](http://arxiv.org/abs/2410.16255v1)|[link](https://github.com/sukanyapatra1997/ulsad-2024)| +|**2024-10-21**|**Loss of 12 Starlink Satellites Due to Pre-conditioning of Intense Space Weather Activity Surrounding the Extreme Geomagnetic Storm of 10 May 2024**|Ayisha M Ashruf et.al.|[2410.16254v1](http://arxiv.org/abs/2410.16254v1)|null| +|**2024-10-21**|**Can Knowledge Editing Really Correct Hallucinations?**|Baixiang Huang et.al.|[2410.16251v1](http://arxiv.org/abs/2410.16251v1)|[link](https://github.com/llm-editing/HalluEditBench)| +|**2024-10-21**|**Cups and Gates I: Cohomology invariants and logical quantum operations**|Nikolas P. Breuckmann et.al.|[2410.16250v1](http://arxiv.org/abs/2410.16250v1)|null| +|**2024-10-21**|**Composing Optimized Stepsize Schedules for Gradient Descent**|Benjamin Grimmer et.al.|[2410.16249v1](http://arxiv.org/abs/2410.16249v1)|null| +|**2024-10-21**|**Hyperparameter Optimisation in Deep Learning from Ensemble Methods: Applications to Proton Structure**|Juan Cruz-Martinez et.al.|[2410.16248v1](http://arxiv.org/abs/2410.16248v1)|null| +|**2024-10-21**|**Separations in query complexity for total search problems**|Shalev Ben-David et.al.|[2410.16245v1](http://arxiv.org/abs/2410.16245v1)|null| +|**2024-10-21**|**Semiconductor Circuits for Quantum Computing with Electronic Wave Packets**|David Pomaranski et.al.|[2410.16244v1](http://arxiv.org/abs/2410.16244v1)|null| +|**2024-10-21**|**Serendipitous detection of an intense X-ray flare in the weak-line T Tauri star KM Ori with SRG/eROSITA**|Savithri H. Ezhikode et.al.|[2410.16241v1](http://arxiv.org/abs/2410.16241v1)|null| +|**2024-10-21**|**Deep Radiomics Detection of Clinically Significant Prostate Cancer on Multicenter MRI: Initial Comparison to PI-RADS Assessment**|G. A. Nketiah et.al.|[2410.16238v1](http://arxiv.org/abs/2410.16238v1)|null| +|**2024-10-21**|**IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems**|Yihuan Mao et.al.|[2410.16237v1](http://arxiv.org/abs/2410.16237v1)|null| +|**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| +|**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| +|**2024-10-21**|**Using a quantum SWAP engine to experimentally validate thermodynamic uncertainty relations**|Krishna Shende et.al.|[2410.16230v1](http://arxiv.org/abs/2410.16230v1)|null| +|**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**Lipschitz Stability of Travel Time Data**|Joonas Ilmavirta et.al.|[2410.16224v1](http://arxiv.org/abs/2410.16224v1)|null| +|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| +|**2024-10-21**|**PuLsE: Accurate and Robust Ultrasound-based Continuous Heart-Rate Monitoring on a Wrist-Worn IoT Device**|Marco Giordano et.al.|[2410.16219v1](http://arxiv.org/abs/2410.16219v1)|null| +|**2024-10-21**|**Hikita surjectivity for $\mathcal N /// T$**|Linus Setiabrata et.al.|[2410.16217v1](http://arxiv.org/abs/2410.16217v1)|null| + +### SLAM +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-19**|**EndoMetric: Near-light metric scale monocular SLAM**|Raúl Iranzo et.al.|[2410.15065v1](http://arxiv.org/abs/2410.15065v1)|null| +|**2024-10-17**|**Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot**|Dongkun Han et.al.|[2410.13612v1](http://arxiv.org/abs/2410.13612v1)|null| +|**2024-10-17**|**TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal**|Yanpeng Jia et.al.|[2410.13240v1](http://arxiv.org/abs/2410.13240v1)|null| +|**2024-10-16**|**QueensCAMP: an RGB-D dataset for robust Visual SLAM**|Hudson M. S. Bruno et.al.|[2410.12520v1](http://arxiv.org/abs/2410.12520v1)|[link](https://github.com/larocs/queenscamp-dataset)| +|**2024-10-16**|**PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM**|Guanghao Li et.al.|[2410.12324v2](http://arxiv.org/abs/2410.12324v2)|null| +|**2024-10-16**|**Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem**|Yichen Sha et.al.|[2410.12169v1](http://arxiv.org/abs/2410.12169v1)|null| +|**2024-10-15**|**V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting**|Tuan Dang et.al.|[2410.12068v1](http://arxiv.org/abs/2410.12068v1)|[link](https://github.com/tuantdang/v3d-slam)| +|**2024-10-15**|**GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information**|Wancai Zheng et.al.|[2410.11356v1](http://arxiv.org/abs/2410.11356v1)|null| +|**2024-10-15**|**Multiview Scene Graph**|Juexiao Zhang et.al.|[2410.11187v1](http://arxiv.org/abs/2410.11187v1)|null| +|**2024-10-14**|**MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator**|Taozhe Li et.al.|[2410.10669v1](http://arxiv.org/abs/2410.10669v1)|null| +|**2024-10-13**|**Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph**|Benoit Casseau et.al.|[2410.09896v1](http://arxiv.org/abs/2410.09896v1)|null| +|**2024-10-12**|**SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs**|Wenxi Chen et.al.|[2410.09503v1](http://arxiv.org/abs/2410.09503v1)|[link](https://github.com/X-LANCE/SLAM-LLM)| +|**2024-10-12**|**An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation**|Wei Liang et.al.|[2410.09443v1](http://arxiv.org/abs/2410.09443v1)|null| +|**2024-10-12**|**ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras**|Junkai Niu et.al.|[2410.09374v1](http://arxiv.org/abs/2410.09374v1)|[link](https://github.com/nail-hnu/esvo2)| +|**2024-10-11**|**Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System**|Zheng Liu et.al.|[2410.08935v1](http://arxiv.org/abs/2410.08935v1)|[link](https://github.com/hku-mars/Voxel-SLAM)| +|**2024-10-11**|**Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints**|Yicheng He et.al.|[2410.08780v1](http://arxiv.org/abs/2410.08780v1)|null| +|**2024-10-10**|**ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization**|Mason B. Peterson et.al.|[2410.08262v1](http://arxiv.org/abs/2410.08262v1)|null| +|**2024-10-10**|**IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera**|Jian Huang et.al.|[2410.08107v2](http://arxiv.org/abs/2410.08107v2)|[link](https://github.com/wu-cvgl/inceventgs)| +|**2024-10-08**|**Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching**|Gongxin Yao et.al.|[2410.06285v1](http://arxiv.org/abs/2410.06285v1)|null| +|**2024-10-08**|**Submodular Optimization for Keyframe Selection & Usage in SLAM**|David Thorne et.al.|[2410.05576v1](http://arxiv.org/abs/2410.05576v1)|null| +|**2024-10-07**|**SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones**|Denis Davletshin et.al.|[2410.05405v1](http://arxiv.org/abs/2410.05405v1)|null| +|**2024-10-07**|**Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection**|Ang He et.al.|[2410.05017v1](http://arxiv.org/abs/2410.05017v1)|null| +|**2024-10-05**|**A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems**|Nikola Radulov et.al.|[2410.04242v1](http://arxiv.org/abs/2410.04242v1)|[link](https://github.com/nikolaradulov/slamfuse)| +|**2024-10-05**|**High-Speed Stereo Visual SLAM for Low-Powered Computing Devices**|Ashish Kumar et.al.|[2410.04090v1](http://arxiv.org/abs/2410.04090v1)|null| +|**2024-10-04**|**Estimating Body and Hand Motion in an Ego-sensed World**|Brent Yi et.al.|[2410.03665v2](http://arxiv.org/abs/2410.03665v2)|null| +|**2024-10-04**|**EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM**|Shi Chen et.al.|[2410.03812v1](http://arxiv.org/abs/2410.03812v1)|null| +|**2024-10-03**|**LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features**|Zihao Dong et.al.|[2410.02961v1](http://arxiv.org/abs/2410.02961v1)|null| +|**2024-10-02**|**ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space**|Hogyun Kim et.al.|[2410.01325v1](http://arxiv.org/abs/2410.01325v1)|null| +|**2024-10-01**|**Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency**|William Dubois et.al.|[2410.00758v1](http://arxiv.org/abs/2410.00758v1)|null| +|**2024-10-01**|**CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM**|Dapeng Feng et.al.|[2410.00486v2](http://arxiv.org/abs/2410.00486v2)|[link](https://github.com/DapengFeng/cartgs)| + +### Visual Localization +|Publish Date|Title|Authors|PDF|Code| +| :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| +|**2024-10-21**|**A new method of reconstructing images of gamma-ray telescopes applied to the LST-1 of CTAO**|CTA-LST Project et.al.|[2410.16042v1](http://arxiv.org/abs/2410.16042v1)|null| +|**2024-10-21**|**Diffusion Transformer Policy**|Zhi Hou et.al.|[2410.15959v1](http://arxiv.org/abs/2410.15959v1)|null| +|**2024-10-21**|**CamI2V: Camera-Controlled Image-to-Video Diffusion Model**|Guangcong Zheng et.al.|[2410.15957v1](http://arxiv.org/abs/2410.15957v1)|[link](https://github.com/ZGCTroy/CamI2V)| +|**2024-10-21**|**MINFLUX -- molecular resolution with minimal photons**|Lukas Scheiderer et.al.|[2410.15902v1](http://arxiv.org/abs/2410.15902v1)|null| +|**2024-10-21**|**R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate**|Sandeep Nagar et.al.|[2410.15851v1](http://arxiv.org/abs/2410.15851v1)|null| +|**2024-10-21**|**Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation**|Pei Liu et.al.|[2410.15814v1](http://arxiv.org/abs/2410.15814v1)|null| +|**2024-10-21**|**MSGField: A Unified Scene Representation Integrating Motion, Semantics, and Geometry for Robotic Manipulation**|Yu Sheng et.al.|[2410.15730v1](http://arxiv.org/abs/2410.15730v1)|null| +|**2024-10-21**|**Exploring Stronger Transformer Representation Learning for Occluded Person Re-Identificatio**|Zhangjian Ji et.al.|[2410.15613v1](http://arxiv.org/abs/2410.15613v1)|null| +|**2024-10-20**|**Exploring Curriculum Learning for Vision-Language Tasks: A Study on Small-Scale Multimodal Training**|Rohan Saha et.al.|[2410.15509v1](http://arxiv.org/abs/2410.15509v1)|null| +|**2024-10-20**|**Event-based Sensor Fusion and Application on Odometry: A Survey**|Jiaqiang Zhang et.al.|[2410.15480v1](http://arxiv.org/abs/2410.15480v1)|null| +|**2024-10-20**|**EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting**|Bohao Liao et.al.|[2410.15392v1](http://arxiv.org/abs/2410.15392v1)|null| +|**2024-10-20**|**The Massive and Distant Clusters of WISE Survey 2: Second Data Release**|Khunanon Thongkham et.al.|[2410.15303v1](http://arxiv.org/abs/2410.15303v1)|null| +|**2024-10-19**|**Low-cost Robust Night-time Aerial Material Segmentation through Hyperspectral Data and Sparse Spatio-Temporal Learning**|Chandrajit Bajaj et.al.|[2410.15208v1](http://arxiv.org/abs/2410.15208v1)|null| +|**2024-10-19**|**EndoMetric: Near-light metric scale monocular SLAM**|Raúl Iranzo et.al.|[2410.15065v1](http://arxiv.org/abs/2410.15065v1)|null| +|**2024-10-19**|**Wave (from) Polarized Light Learning (WPLL) method: high resolution spatio-temporal measurements of water surface waves in laboratory setups**|Noam Ginio et.al.|[2410.14988v1](http://arxiv.org/abs/2410.14988v1)|null| +|**2024-10-19**|**Subversive Characters and Stereotyping Readers: Characterizing Queer Relationalities with Dialogue-Based Relation Extraction**|Kent K. Chang et.al.|[2410.14978v1](http://arxiv.org/abs/2410.14978v1)|null| +|**2024-10-19**|**3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving**|Linh Van Ma et.al.|[2410.14977v1](http://arxiv.org/abs/2410.14977v1)|[link](https://github.com/linh-gist/ms-glmb-nuScenes)| +|**2024-10-19**|**AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks**|Ryan Diaz et.al.|[2410.14968v1](http://arxiv.org/abs/2410.14968v1)|null| +|**2024-10-18**|**Neural Real-Time Recalibration for Infrared Multi-Camera Systems**|Benyamin Mehmandar et.al.|[2410.14505v1](http://arxiv.org/abs/2410.14505v1)|[link](https://github.com/theictlab/neural-recalibration)| +|**2024-10-18**|**FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models**|Rui Hu et.al.|[2410.14429v1](http://arxiv.org/abs/2410.14429v1)|null| +|**2024-10-18**|**Variable Aperture Bokeh Rendering via Customized Focal Plane Guidance**|Kang Chen et.al.|[2410.14400v1](http://arxiv.org/abs/2410.14400v1)|[link](https://github.com/motong-ai-studio/vabm)| +|**2024-10-18**|**Non-Invasive Qualitative Vibration Analysis using Event Camera**|Dwijay Bane et.al.|[2410.14364v1](http://arxiv.org/abs/2410.14364v1)|null| +|**2024-10-18**|**Evaluating the evaluators: Towards human-aligned metrics for missing markers reconstruction**|Taras Kucherenko et.al.|[2410.14334v1](http://arxiv.org/abs/2410.14334v1)|null| +|**2024-10-18**|**Towards High-Speed Passive Visible Light Communication with Event Cameras and Digital Micro-Mirrors**|Yanxiang Wang et.al.|[2410.14228v2](http://arxiv.org/abs/2410.14228v2)|null| +|**2024-10-18**|**MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging**|Zhenghao Pan et.al.|[2410.14214v1](http://arxiv.org/abs/2410.14214v1)|null| +|**2024-10-18**|**MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations**|Vishal Vivek Saley et.al.|[2410.14204v1](http://arxiv.org/abs/2410.14204v1)|null| +|**2024-10-17**|**UniDrive: Towards Universal Driving Perception Across Camera Configurations**|Ye Li et.al.|[2410.13864v1](http://arxiv.org/abs/2410.13864v1)|[link](https://github.com/ywyeli/unidrive)| +|**2024-10-17**|**VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding**|Runsen Xu et.al.|[2410.13860v1](http://arxiv.org/abs/2410.13860v1)|[link](https://github.com/openrobotlab/vlm-grounder)| +|**2024-10-17**|**VidPanos: Generative Panoramic Videos from Casual Panning Videos**|Jingwei Ma et.al.|[2410.13832v1](http://arxiv.org/abs/2410.13832v1)|null| + ## Edge Computing ### Secure @@ -1735,18 +1735,21 @@ |**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| |**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -### Privacy +### Performance |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| |**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| |**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| |**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| @@ -1765,11 +1768,8 @@ |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -### Trust +### Privacy |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| @@ -1803,21 +1803,18 @@ |**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| |**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -### Performance +### Efficient |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**Origin of the metal-rich vs. metal-poor globular clusters dichotomies in the Milky Way: A sign of low black hole natal kicks**|Ali Rostami-Shirazi et.al.|[2410.16276v1](http://arxiv.org/abs/2410.16276v1)|null| |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| |**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| |**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| |**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| -|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| @@ -1835,9 +1832,12 @@ |**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| +|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| +|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -### Scalability +### Trust |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| @@ -1848,7 +1848,6 @@ |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| |**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| -|**2024-10-21**|**Surface acoustic waves Brillouin photonics on a silicon nitride chip**|Yvan Klaver et.al.|[2410.16263v1](http://arxiv.org/abs/2410.16263v1)|null| |**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| @@ -1870,19 +1869,23 @@ |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| |**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| |**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| +|**2024-10-21**|**A Realistic Threat Model for Large Language Model Jailbreaks**|Valentyn Boreiko et.al.|[2410.16222v1](http://arxiv.org/abs/2410.16222v1)|null| -### Efficient +### Reliability |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | +|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| +|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| |**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| |**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| |**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| +|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| @@ -1900,26 +1903,21 @@ |**2024-10-21**|**LLaVA-KD: A Framework of Distilling Multimodal Large Language Models**|Yuxuan Cai et.al.|[2410.16236v1](http://arxiv.org/abs/2410.16236v1)|null| |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| -|**2024-10-21**|**A Quantum Optimization Algorithm for Optimal Electric Vehicle Charging Station Placement for Intercity Trips**|Tina Radvand et.al.|[2410.16231v1](http://arxiv.org/abs/2410.16231v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| -|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| -|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| -### Reliability +### Scalability |Publish Date|Title|Authors|PDF|Code| | :---: | :---: | :---: | :---: | :---: | -|**2024-10-21**|**The period-luminosity relation of long-period variables in the Large Magellanic Cloud observed with ATLAS**|Daniel Hey et.al.|[2410.16273v1](http://arxiv.org/abs/2410.16273v1)|null| |**2024-10-21**|**Cosmic Shimmering: the Gravitational Wave Signal of Time-Resolved Cosmic Shear Observations**|Giorgio Mentasti et.al.|[2410.16274v1](http://arxiv.org/abs/2410.16274v1)|null| |**2024-10-21**|**FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors**|Chin-Yang Lin et.al.|[2410.16271v1](http://arxiv.org/abs/2410.16271v1)|null| |**2024-10-21**|**MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors**|Honghua Chen et.al.|[2410.16272v1](http://arxiv.org/abs/2410.16272v1)|null| -|**2024-10-21**|**Reflection-Bench: probing AI intelligence with reflection**|Lingyu Li et.al.|[2410.16270v1](http://arxiv.org/abs/2410.16270v1)|[link](https://github.com/yabyum/reflectionbench)| |**2024-10-21**|**Imaging supermoire relaxation and conductive domain walls in helical trilayer graphene**|Jesse C. Hoke et.al.|[2410.16269v1](http://arxiv.org/abs/2410.16269v1)|null| |**2024-10-21**|**SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree**|Shuangrui Ding et.al.|[2410.16268v1](http://arxiv.org/abs/2410.16268v1)|[link](https://github.com/mark12ding/sam2long)| |**2024-10-21**|**xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs**|Michael S. Ryoo et.al.|[2410.16267v1](http://arxiv.org/abs/2410.16267v1)|null| |**2024-10-21**|**3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors**|Xi Liu et.al.|[2410.16266v1](http://arxiv.org/abs/2410.16266v1)|null| |**2024-10-21**|**Quantifying the advantages of applying quantum approximate algorithms to portfolio optimisation**|Haomu Yuan et.al.|[2410.16265v1](http://arxiv.org/abs/2410.16265v1)|null| +|**2024-10-21**|**Surface acoustic waves Brillouin photonics on a silicon nitride chip**|Yvan Klaver et.al.|[2410.16263v1](http://arxiv.org/abs/2410.16263v1)|null| |**2024-10-21**|**Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance**|Zhangwei Gao et.al.|[2410.16261v1](http://arxiv.org/abs/2410.16261v1)|[link](https://github.com/opengvlab/internvl)| -|**2024-10-21**|**Characterizing the Effect of Electrode Shift & Sensor Reapplication on Common sEMG Features in Lower Limb Muscles**|Fraser Douglas et.al.|[2410.16262v1](http://arxiv.org/abs/2410.16262v1)|null| |**2024-10-21**|**Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos**|Gengshan Yang et.al.|[2410.16259v1](http://arxiv.org/abs/2410.16259v1)|null| |**2024-10-21**|**The microscale organization of directed hypergraphs**|Quintino Francesco Lotito et.al.|[2410.16258v1](http://arxiv.org/abs/2410.16258v1)|null| |**2024-10-21**|**Elucidating the design space of language models for image generation**|Xuantong Liu et.al.|[2410.16257v1](http://arxiv.org/abs/2410.16257v1)|null| @@ -1938,6 +1936,8 @@ |**2024-10-21**|**ToW: Thoughts of Words Improve Reasoning in Large Language Models**|Zhikun Xu et.al.|[2410.16235v1](http://arxiv.org/abs/2410.16235v1)|null| |**2024-10-21**|**Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping**|Ryan Li et.al.|[2410.16232v1](http://arxiv.org/abs/2410.16232v1)|null| |**2024-10-21**|**Building A Coding Assistant via the Retrieval-Augmented Language Model**|Xinze Li et.al.|[2410.16229v1](http://arxiv.org/abs/2410.16229v1)|null| +|**2024-10-21**|**Managing Bandwidth: The Key to Cloud-Assisted Autonomous Driving**|Alexander Krentsel et.al.|[2410.16227v1](http://arxiv.org/abs/2410.16227v1)|null| +|**2024-10-21**|**IR finite correlation functions in de Sitter space, a smooth massless limit, and an autonomous equation**|Alexander Kamenshchik et.al.|[2410.16226v1](http://arxiv.org/abs/2410.16226v1)|null| ### Edge Computing |Publish Date|Title|Authors|PDF|Code|