New submissions for Fri, 23 Jun 23 #381
Labels
abstract meaning representation
argument mining
citation context analysis
computational social science
contrastive
cross-language information retrieval
cross-lingual information retrieval
data augmentation
extreme multi-label
knowledge discovery
knowledge graph
legal text
legal
mixup
multi-task
paraphrase
passage generation
plagiarism
robustness
scholarly document processing
scholarly
semantic similarity
similarity measure
simplification
summarization
text generation
Keyword: contrastive
SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning
Authors: Yunxiang Zhang, Xiaojun WanArxiv: https://arxiv.org/abs/2306.12552
TLDR: Recently, commonsense reasoning in text generation has attracted much attention. Generative commonsense thinking is the task that requires machines, given a group of keywords, to compose a single coherent sentence with commonsense plausibility. While existing datasets targeting generative commonsenses reasoning focus on everyday scenarios, it is unclear how well machines reason under specific geographical and temporal contexts. We formalize this challenging task as SituatedGen, where machines with commonsenses should generate a pair of contrastive sentences given a
Repo: None
NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning
Authors: Kamer Ali Yuksel, Thiago Ferreira, Golara Javadi, Mohamed El-Badrashiny, Ahmet GunduzArxiv: https://arxiv.org/abs/2306.12577
TLDR: This paper introduces NoRefER, a novel referenceless quality metric for automatic speech recognition (ASR) systems. Traditional reference-based metrics for evaluating ASR systems require costly ground-truth transcripts. NoRefer overcomes this limitation by fine-tuning a multilingual language model for pair-wise ranking ASR hypotheses using contrastive learning with Siamese network architecture. The self-supervised NoRef ER exploits the known quality relationships between hypotheses from multiple compression levels of an
Repo: None
Keyword: data augmentation
Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance
Authors: Jianhao Yuan, Paul Newman, Matthew GaddArxiv: https://arxiv.org/abs/2306.12556
TLDR: Localisation with Frequency-Modulated Continuous-Wave (FMCW) radar has gained increasing interest due to its inherent resistance to challenging environments. However, complex artefacts of the radar measurement process require appropriate uncertainty estimation to ensure the safe and reliable application of this promising sensor modality. In this work, we propose a multi-session map management system which constructs the best maps for further localisation based on learned variance properties in an embedding space. Using the same variance properties, we
Repo: None
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Authors: Jinxin Liu, Ziqi Zhang, Zhenyu Wei, Zifeng Zhuang, Yachen Kang, Sibo Gai, Donglin WangArxiv: https://arxiv.org/abs/2306.12755
TLDR: Offline reinforcement learning (RL) aims to learn a policy using only pre-collected and fixed data. Although avoiding the time-consuming online interactions in RL, it poses challenges for out-of-distribution (OOD) state actions and often suffers from data inefficiency for training. Despite many efforts being devoted to addressing OOD state actions, the latter (data inefficiency) receives little attention in offline RL. To address this, this paper proposes the cross-domain offline RL,
Repo: None
AugDMC: Data Augmentation Guided Deep Multiple Clustering
Authors: Jiawei Yao, Enbei Liu, Maham Rashid, Juhua HuArxiv: https://arxiv.org/abs/2306.13023
TLDR: Clustering aims to group similar objects together while separating dissimilar ones apart. Thereafter, structures hidden in data can be identified to help understand data in an unsupervised manner. Traditional clustering methods such as k-means provide only a single clustering for one data set. Deep clustering algorithms such as auto-encoder based clustering techniques have shown a better performance, but still provide a single clusterering. However, a given dataset might have multiple clustering structures
Repo: None
Data augmentation for recommender system: A semi-supervised approach using maximum margin matrix factorization
Authors: Shamal Shaikh, Venkateswara Rao Kagita, Vikas Kumar, Arun K PujariArxiv: https://arxiv.org/abs/2306.13050
TLDR: Collaborative filtering (CF) has become a popular method for developing recommender systems (RS) where ratings of a user for new items is predicted based on her past preferences and available preference information of other users. Despite the popularity of CF-based methods, their performance is often greatly limited by the sparsity of observed entries. In this study, we explore the data augmentation and refinement aspects of Maximum Margin Matrix Factorization (MMMF), a widely accepted CF technique for the
Repo: None
Keyword: knowledge graph
Explainable Representations for Relation Prediction in Knowledge Graphs
Authors: Rita T. Sousa, Sara Silva, Catia PesquitaArxiv: https://arxiv.org/abs/2306.12687
TLDR: Knowledge graphs represent real-world entities and their relations in a semantically-rich structure supported by ontologies. Exploring this data with machine learning methods often relies on knowledge graph embeddings, which produce latent representations of entities that preserve structural and local graph neighbourhood properties, but sacrifice explainability. However, in tasks such as link or relation prediction, understanding which specific features better explain a relation is crucial to support complex or critical applications. We propose SEEK, a novel approach for
Repo: None
Otter-Knowledge: benchmarks of multimodal knowledge graph representation learning from different sources for drug discovery
Authors: Hoang Thanh Lam, Marco Luca Sbodio, Marcos Martínez Gallindo, Mykhaylo Zayats, Raúl Fernández-Díaz, Víctor Valls, Gabriele Picco, Cesar Berrospi Ramis, Vanessa LópezArxiv: https://arxiv.org/abs/2306.12802
TLDR: Recent research in representation learning utilizes large databases of proteins or molecules to acquire knowledge of drug and protein structures through unsupervised learning techniques. These pre-trained representations have proven to significantly enhance the accuracy of subsequent tasks, such as predicting the affinity between drugs and target proteins. In this study, we demonstrate that by incorporating knowledge graphs from diverse sources and modalities into the sequences or SMILES representation, we can further enrich the representation and achieve state-of-the-art results on
Repo: None
Keyword: legal
Designing Individualized Policy and Technology Interventions to Improve Gig Work Conditions
Authors: Jane Hsieh, Oluwatobi Adisa, Sachi Bafna, Haiyi ZhuArxiv: https://arxiv.org/abs/2306.12972
TLDR: The gig economy is characterized by short-term contract work completed by independent workers who are paid to perform "gigs", and who have control over when, whether and how they conduct work. Gig economy platforms (e.g., Uber, Lyft, Instacart) offer workers increased job opportunities, lower barriers to entry, and improved flexibility. However, growing evidence suggests that worker well-being and gig work conditions have become significant societal issues. In designing public-facing policies and technologies
Repo: None
Keyword: multi-task
Multi-Task Learning with Loop Specific Attention for CDR Structure Prediction
Authors: Eleni Giovanoudi, Dimitrios RafailidisArxiv: https://arxiv.org/abs/2306.13045
TLDR: The Complementarity Determining Region (CDR) structure prediction of loops in antibody engineering has gained a lot of attraction by researchers. When designing antibodies, a main challenge is to predict the CDR structure of the H3 loop. Compared with the other CDR loops, that is the H1 and H2 loops, and the H2 loop is more challenging due to its varying length and flexible structure. In this paper, we propose a Multi-task learning model with Loop
Repo: None
Keyword: robustness
Verifying Global Neural Network Specifications using Hyperproperties
Authors: David Boetius, Stefan LeueArxiv: https://arxiv.org/abs/2306.12495
TLDR: Current approaches to neural network verification focus on specifications that target small regions around known input data points, such as local robustness. Thus, using these approaches, we can not obtain guarantees for inputs that are not close to known inputs. Yet, it is highly likely that a neural network will encounter such truly unseen inputs during its application. We study global specifications that - when satisfied - provide guarantees for all potential inputs. We introduce a hyperproperty formalism that allows for expressing global specifications such as
Repo: None
DGC-GNN: Descriptor-free Geometric-Color Graph Neural Network for 2D-3D Matching
Authors: Shuzhe Wang, Juho Kannala, Daniel BarathArxiv: https://arxiv.org/abs/2306.12547
TLDR: Direct matching of 2D keypoints in an input image to a 3D point cloud of the scene without requiring visual descriptors has garnered increased interest due to its lower memory requirements, inherent privacy preservation, and reduced need for expensive 3D model maintenance compared to visual descriptor-based methods. However, existing algorithms often compromise on performance, resulting in a significant deterioration compared to their descriptor-less counterparts. In this paper, we introduce DGC-GNN, a novel algorithm that employs a
Repo: None
Neural Spectro-polarimetric Fields
Authors: Youngchan Kim, Wonjoon Jin, Sunghyun Cho, Seung-Hwan BaekArxiv: https://arxiv.org/abs/2306.12562
TLDR: Modeling the spatial radiance distribution of light rays in a scene has been extensively explored for applications, including view synthesis. Spectrum and polarization, the wave properties of light, are often neglected due to their integration into three RGB spectral bands and their non-perceptibility to human vision. Despite this, these properties encompass substantial material and geometric information about a scene. In this work, we propose to model spectro-polarimetric fields, the spatial Stokes-vector distribution of
Repo: None
DP-BREM: Differentially-Private and Byzantine-Robust Federated Learning with Client Momentum
Authors: Xiaolan Gu, Ming Li, Li XiongArxiv: https://arxiv.org/abs/2306.12608
TLDR: Federated Learning (FL) allows multiple participating clients to train machine learning models collaboratively by keeping their datasets local and only exchanging the gradient or model updates with a coordinating server. Existing FL protocols were shown to be vulnerable to attacks that aim to compromise data privacy and/or model robustness. Recently proposed defenses focused on ensuring either privacy or robustness, but not both. In this paper, we focus on simultaneously achieving differential privacy (DP) and Byzantine robustness for cross-
Repo: None
RobustNeuralNetworks.jl: a Package for Machine Learning and Data-Driven Control with Certified Robustness
Authors: Nicholas H. Barbara, Max Revay, Ruigang Wang, Jing Cheng, Ian R. ManchesterArxiv: https://arxiv.org/abs/2306.12612
TLDR: Neural networks are typically sensitive to small input perturbations, leading to unexpected or brittle behaviour. We present RobustNeuralNetworks.jl: a Julia package for neural network models that are constructed to naturally satisfy a set of user-defined robustness constraints. The package is based on the recently proposed Recurrent Equilibrium Network (REN) and Lipschitz-Bounded Deep Network (LBDN) model classes, and is designed to interface directly with Julia
Repo: None
Recent Developments in Recommender Systems: A Survey
Authors: Yang Li, Kangbo Liu, Ranjan Satapathy, Suhang Wang, Erik CambriaArxiv: https://arxiv.org/abs/2306.12680
TLDR: In this technical survey, we comprehensively summarize the latest advancements in the field of recommender systems. The objective of this study is to provide an overview of the current state-of-the-art in the market and highlight the new directions for future research in the fields. The study starts with a comprehensive summary of the main taxonomy of recommend systems, including personalized and group recommender, and then delves into the category of knowledge-based recommender. In addition, the
Repo: None
CEMSSL: A Unified Framework for Multi-Solution Inverse Kinematic Model Learning of Robot Arms with High-Precision Manipulation
Authors: Qu Weiming, Liu Tianlin, Luo DingshengArxiv: https://arxiv.org/abs/2306.12718
TLDR: Multiple solutions mainly originate from the existence of redundant degrees of freedom in the robot arm, which may cause difficulties in inverse model learning, but they can also bring many benefits, such as higher flexibility and robustness. Current multi-solution inverse model training methods rely on conditional deep generative models, yet they often fail to achieve sufficient precision when learning multiple solutions. In this paper, we propose Conditional Embodied Self-Supervised Learning (CEMSSL) for robot arm multi-
Repo: None
Analysis of divergence-preserving unfitted finite element methods for the mixed Poisson problem
Authors: Christoph Lehrenfeld, Tim van Beeck, Igor VoulisArxiv: https://arxiv.org/abs/2306.12722
TLDR: In this paper we present a new H(div)-conforming unfitted finite element method for the mixed Poisson problem which is robust in the cut configuration and preserves conservation properties of body-fitted finite element methods. The key is to formulate the divergence-constraint on the active mesh, instead of the physical domain, in order to obtain robustness with respect to cut configurations without the need for a stabilization that pollutes the mass balance. This change in the formulation results in a
Repo: None
On the Robustness of Generative Retrieval Models: An Out-of-Distribution Perspective
Authors: Yu-An Liu, Ruqing Zhang, Jiafeng Guo, Wei Chen, Xueqi ChengArxiv: https://arxiv.org/abs/2306.12756
TLDR: Recently, we have witnessed generative retrieval increasingly gaining attention in the information retrieval (IR) field, which retrieves documents by directly generating their identifiers. So far, much effort has been devoted to developing effective generative retrieving models. There has been less attention paid to the robustness perspective. When a new retrieval paradigm enters into the real-world application, it is also critical to measure the out-of-distribution (OOD) generalization, i.e., how would gener
Repo: None
Overview of Robust and Multilingual Automatic Evaluation Metrics for Open-Domain Dialogue Systems at DSTC 11 Track 4
Authors: Mario Rodríguez-Cantelar, Chen Zhang, Chengguang Tang, Ke Shi, Sarik Ghazarian, João Sedoc, Luis Fernando D'Haro, Alexander RudnickyArxiv: https://arxiv.org/abs/2306.12794
TLDR: The advent and fast development of neural networks have revolutionized the research on dialogue systems and subsequently have triggered various challenges regarding their automatic evaluation. Automatic evaluation of open-domain dialogue systems as an open challenge has been the center of the attention of many researchers. Despite the consistent efforts to improve automatic metrics' correlations with human evaluation, there have been very few attempts to assess their robustness over multiple domains and dimensions. Also, their focus is mainly on the English language. All of these challenges prompt
Repo: None
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Authors: Joshua Bloom, Pranjal Paliwal, Apratim Mukherjee, Carlo PinciroliArxiv: https://arxiv.org/abs/2306.12926
TLDR: Deep reinforcement learning (DRL) has seen remarkable success in the control of single robots. However, applying DRL to robot swarms presents significant challenges. A critical challenge is non-stationarity, which occurs when two or more robots update individual or shared policies concurrently, thereby engaging in an interdependent training process with no guarantees of convergence. Circumventing non-Stationarity typically involves training the robots with global information about other agents' states and/or actions. In contrast,
Repo: None
Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models
Authors: Francesco Croce, Naman D Singh, Matthias HeinArxiv: https://arxiv.org/abs/2306.12941
TLDR: While a large amount of work has focused on designing adversarial attacks against image classifiers, only a few methods exist to attack semantic segmentation models. We show that attacking segmentation model presents task-specific challenges, for which we propose novel solutions. Our final evaluation protocol outperforms existing methods, and shows that those can overestimate the robustness of the models. Additionally, so far adversarial training, the most successful way for obtaining robust image classifier, could not be successfully applied
Repo: None
Rate-Splitting Multiple Access for 6G Networks: Ten Promising Scenarios and Applications
Authors: Jeonghun Park, Byungju Lee, Jinseok Choi, Hoon Lee, Namyoon Lee, Seok-Hwan Park, Kyoung-Jae Lee, Junil Choi, Sung Ho Chae, Sang-Woon Jeon, Kyung Sup Kwak, Bruno Clerckx, Wonjae ShinArxiv: https://arxiv.org/abs/2306.12978
TLDR: In the upcoming 6G era, multiple access (MA) will play an essential role in achieving high throughput performances required in a wide range of wireless applications. Since MA and interference management are closely related issues, the conventional MA techniques are limited in that they cannot provide near-optimal performance in universal interference regimes. Recently, rate-splitting multi access (RSMA) has been gaining much attention. RSMA splits an individual message into two parts: a common part, decodable
Repo: None
Keyword: scholarly
Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval
Authors: Katrin Glinka, Claudia Müller-BirnArxiv: https://arxiv.org/abs/2306.12843
TLDR: Just as other disciplines, the humanities explore how computational research approaches and tools can meaningfully contribute to scholarly knowledge production. We approach the design of computational tools through the analytical lens of 'human-AI collaboration.' However, there is no generalizable concept of what constitutes 'meaningful' human-AI collaborate. In terms of genuinely human competencies, we consider criticality and reflection as guiding principles of scholarly knowledge. Although (designing for) reflection is a recurring topic in CSCW
Repo: None
Designing Individualized Policy and Technology Interventions to Improve Gig Work Conditions
Authors: Jane Hsieh, Oluwatobi Adisa, Sachi Bafna, Haiyi ZhuArxiv: https://arxiv.org/abs/2306.12972
TLDR: The gig economy is characterized by short-term contract work completed by independent workers who are paid to perform "gigs", and who have control over when, whether and how they conduct work. Gig economy platforms (e.g., Uber, Lyft, Instacart) offer workers increased job opportunities, lower barriers to entry, and improved flexibility. However, growing evidence suggests that worker well-being and gig work conditions have become significant societal issues. In designing public-facing policies and technologies
Repo: None
Keyword: summarization
Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation
Authors: Ran Zhang, Jihed Ouni, Steffen EgerArxiv: https://arxiv.org/abs/2306.12916
TLDR: While summarization has been extensively researched in natural language processing (NLP), cross-lingual cross-temporal summarization (CLCTS) is a largely unexplored area that has the potential to improve cross-cultural accessibility, information sharing, and understanding. This paper comprehensively addresses the CLCTS task, including dataset creation, modeling, and evaluation. We build the first CLCTs corpus, leveraging historical fictive texts and Wikipedia summaries in English and German, and
Repo: None
Keyword: text generation
SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning
Authors: Yunxiang Zhang, Xiaojun WanArxiv: https://arxiv.org/abs/2306.12552
TLDR: Recently, commonsense reasoning in text generation has attracted much attention. Generative commonsense thinking is the task that requires machines, given a group of keywords, to compose a single coherent sentence with commonsense plausibility. While existing datasets targeting generative commonsenses reasoning focus on everyday scenarios, it is unclear how well machines reason under specific geographical and temporal contexts. We formalize this challenging task as SituatedGen, where machines with commonsenses should generate a pair of contrastive sentences given a
Repo: None
The text was updated successfully, but these errors were encountered: