New submissions for Thu, 29 Jun 23 #385

e-tornike · 2023-06-29T03:07:23Z

Keyword: contrastive

ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

Authors: Yakun Yu, Mingjun Zhao, Shi-ang Qi, Feiran Sun, Baoxun Wang, Weidong Guo, Xiaoli Wang, Lei Yang, Di Niu
Arxiv: https://arxiv.org/abs/2306.15796
TLDR: Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodual fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodality sentiment analysis, where specific-knowledge representations for each modality can be learned together with general knowledge representations via knowledge injection based
Repo: None
GraSS: Contrastive Learning with Gradient Guided Sampling Strategy for Remote Sensing Image Semantic Segmentation
Authors: Zhaoyang Zhang, Zhen Ren, Chao Tao, Yunsheng Zhang, Chengli Peng, Haifeng Li
Arxiv: https://arxiv.org/abs/2306.15868
TLDR: Self-supervised contrastive learning (SSCL) has achieved significant milestones in remote sensing image (RSI) understanding. Its essence lies in designing an unsupervised instance discrimination based semantic segmentation task to extract image features from a large number of unlabeled images that are beneficial for downstream tasks. However, existing instances discrimination based SSCL suffer from two limitations when applied to the RSI semantic segmentations task: 1) Positive sample confounding issue; 2) Feature adaptation bias.
Repo: None
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners
Authors: Bowen Shi, Xiaopeng Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian
Arxiv: https://arxiv.org/abs/2306.15876
TLDR: Representation learning has been evolving from traditional supervised training to Contrastive Learning (CL) and Masked Image Modeling (MIM). Previous works have demonstrated their pros and cons in specific scenarios, i.e., CL and supervised pre-training excel at capturing longer-range global patterns and enabling better feature discrimination, while MIM can introduce more local and diverse attention across all transformer layers. In this paper, we explore how to obtain a model that combines their strengths. We start by
Repo: None
Subclass-balancing Contrastive Learning for Long-tailed Recognition
Authors: Chengkai Hou, Jieyu Zhang, Haonan Wang, Tianyi Zhou
Arxiv: https://arxiv.org/abs/2306.15925
TLDR: Long-tailed recognition with imbalanced class distribution naturally emerges in practical machine learning applications. Existing methods such as data reweighing, resampling, and supervised contrastive learning enforce the class balance with a price of introducing imbalance between instances of head class and tail class, which may ignore the underlying rich semantic substructures of the former and exaggerate the biases in the latter. We overcome these drawbacks by a novel ``subclass-balancing contrastivelearning (SBC
Repo: None
Multi-network Contrastive Learning Based on Global and Local Representations
Authors: Weiquan Li, Xianzhong Long, Yun Li
Arxiv: https://arxiv.org/abs/2306.15930
TLDR: The popularity of self-supervised learning has made it possible to train models without relying on labeled data, which saves expensive annotation costs. However, most existing self-Supervised contrastive learning methods often overlook the combination of global and local feature information. This paper proposes a multi-network contrastive training framework based on global andlocal representations. We introduce global and global feature information for self-submitted contrastive learners through multiple networks. The model learns feature information at different scales of an
Repo: None
Challenges of Zero-Shot Recognition with Vision-Language Models: Granularity and Correctness
Authors: Zhenlin Xu, Yi Zhu, Tiffany Deng, Abhay Mittal, Yanbei Chen, Manchen Wang, Paolo Favaro, Joseph Tighe, Davide Modolo
Arxiv: https://arxiv.org/abs/2306.16048
TLDR: This paper investigates the challenges of applying vision-language models (VLMs) to zero-shot visual recognition tasks in an open-world setting, with a focus on contrastive vision- language models such as CLIP. We first examine the performance of VLMs on concepts of different granularity levels. We propose a way to fairly evaluate the performance discrepancy under two experimental setups and find that VlMs are better at recognizing fine-grained concepts. Furthermore, we find that the
Repo: None
Semantic Positive Pairs for Enhancing Contrastive Instance Discrimination
Authors: Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong
Arxiv: https://arxiv.org/abs/2306.16122
TLDR: Self-supervised learning algorithms based on instance discrimination effectively prevent representation collapse and produce promising results in representation learning. However, the process of attracting positive pairs (i.e., two views of the same instance) in the embedding space and repelling all other instances (i, negative pairs) irrespective of their categories could result in discarding important features. To address this issue, we propose an approach to identifying those images with similar semantic content and treating them as positive instances, named semantic
Repo: None

Keyword: data augmentation

Graph Interpolation via Fast Fused-Gromovization

Authors: Xinyu Ma, Xu Chu, Yasha Wang, Yang Lin, Junfeng Zhao, Liantao Ma, Wenwu Zhu
Arxiv: https://arxiv.org/abs/2306.15963
TLDR: Graph data augmentation has proven to be effective in enhancing the generalizability and robustness of graph neural networks (GNNs) for graph-level classifications. However, existing methods mainly focus on augmenting the graph signal space and the graph structure space independently, overlooking their joint interaction. This paper addresses this limitation by formulating the problem as an optimal transport problem that aims to find an optimal strategy for matching nodes between graphs considering the interactions between graph structures and signals. To tackle
Repo: None
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification
Authors: Pei Liu, Luping Ji, Xinyu Zhang, Feng Ye
Arxiv: https://arxiv.org/abs/2306.16180
TLDR: Given the special situation of modeling gigapixel images, multiple instance learning (MIL) has become one of the most important frameworks for Whole Slide Image (WSI) classification. In current practice, most MIL networks often face two unavoidable problems in training: i) insufficient WSI data, and ii) the data memorization nature inherent in neural networks. These problems may hinder MIL models from adequate and efficient training, suppressing the continuous performance promotion of classification models on WSIs. Inspired by the
Repo: None

Keyword: knowledge discovery

Defining data science: a new field of inquiry

Authors: Michael L Brodie
Arxiv: https://arxiv.org/abs/2306.16177
TLDR: Data science is not a science. It is a research paradigm. Its power, scope, and scale will surpass science, our most powerful research paradigm, to enable knowledge discovery and change our world. We have yet to understand and define it, vital to realizing its potential and managing its risks. Modern data science is in its infancy. Emerging slowly since 1962 and rapidly since 2000, it is a fundamentally new field of inquiry, one of the most active, powerful, and rapidly evolving 21st
Repo: None

Keyword: knowledge graph

Knowledge-Enhanced Hierarchical Information Correlation Learning for Multi-Modal Rumor Detection

Authors: Jiawei Liu, Jingyi Xie, Fanrui Zhang, Qiang Zhang, Zheng-jun Zha
Arxiv: https://arxiv.org/abs/2306.15946
TLDR: The explosive growth of rumors with text and images on social media platforms has drawn great attention. Existing studies have made significant contributions to cross-modal information interaction and fusion, but they fail to fully explore hierarchical and complex semantic correlation across different modality content, severely limiting their performance on detecting multi-Modal rumor. In this work, we propose a novel knowledge-enhanced hierarchical information correlation learning approach (KhiCL) for multi-modual rumor detection by jointly modeling the
Repo: None
Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation
Authors: Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin
Arxiv: https://arxiv.org/abs/2306.16195
TLDR: Incorporating external graph knowledge into neural chatbot models has been proven effective for enhancing dialogue generation. However, in conventional graph neural networks (GNNs), message passing on a graph is independent from text, resulting in the graph representation hidden space differing from that of the text. This training regime of existing models therefore leads to a semantic gap between graph knowledge and text. In this study, we propose a novel framework for knowledge graph enhanced dialogue generation with pseudo nodes to involve the language
Repo: None
Relevant Entity Selection: Knowledge Graph Bootstrapping via Zero-Shot Analogical Pruning
Authors: Lucas Jarnac, Miguel Couceiro, Pierre Monnin
Arxiv: https://arxiv.org/abs/2306.16296
TLDR: Knowledge Graph Construction (KGC) can be seen as an iterative process starting from a high quality nucleus that is refined by knowledge extraction approaches in a virtuous loop. Such a nucleus can be obtained from knowledge existing in an open KG like Wikidata. However, due to the size of such generic KGs, integrating them as a whole may entail irrelevant content and scalability issues. We propose an analogy-based approach that starts from seed entities of interest in a generic K
Repo: None

Keyword: legal

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

Authors: Jiaxi Cui, Zongjian Li, Yang Yan, Bohua Chen, Li Yuan
Arxiv: https://arxiv.org/abs/2306.16092
TLDR: Large Language Models (LLMs) have shown the potential to revolutionize natural language processing tasks in various domains, sparking great interest in vertical-specific large models. However, unlike proprietary models such as BloombergGPT and FinGPT, which have leveraged their unique data accumulations to make strides in the finance domain, there hasn't not many similar large language models in the Chinese legal domain to facilitate its digital transformation. In this paper, we propose an open-source legal large language
Repo: None

Keyword: mixup

Dimension Independent Mixup for Hard Negative Sample in Collaborative Filtering

Authors: Xi Wu, Liangwei Yang, Jibing Gong, Chao Zhou, Tianyu Lin, Xiaolong Liu, Philip S. Yu
Arxiv: https://arxiv.org/abs/2306.15905
TLDR: Collaborative filtering (CF) is a widely employed technique that predicts user preferences based on past interactions. Negative sampling plays a vital role in training CF-based models with implicit feedback. In this paper, we propose a novel perspective based on the sampling area to revisit existing sampling methods. We point out that current sampling methods mainly focus on Point-wise or Line-wise sampling, lacking flexibility and leaving a significant portion of the hard sampling area un-explored. To address this limitation
Repo: None
Graph Interpolation via Fast Fused-Gromovization
Authors: Xinyu Ma, Xu Chu, Yasha Wang, Yang Lin, Junfeng Zhao, Liantao Ma, Wenwu Zhu
Arxiv: https://arxiv.org/abs/2306.15963
TLDR: Graph data augmentation has proven to be effective in enhancing the generalizability and robustness of graph neural networks (GNNs) for graph-level classifications. However, existing methods mainly focus on augmenting the graph signal space and the graph structure space independently, overlooking their joint interaction. This paper addresses this limitation by formulating the problem as an optimal transport problem that aims to find an optimal strategy for matching nodes between graphs considering the interactions between graph structures and signals. To tackle
Repo: None
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification
Authors: Pei Liu, Luping Ji, Xinyu Zhang, Feng Ye
Arxiv: https://arxiv.org/abs/2306.16180
TLDR: Given the special situation of modeling gigapixel images, multiple instance learning (MIL) has become one of the most important frameworks for Whole Slide Image (WSI) classification. In current practice, most MIL networks often face two unavoidable problems in training: i) insufficient WSI data, and ii) the data memorization nature inherent in neural networks. These problems may hinder MIL models from adequate and efficient training, suppressing the continuous performance promotion of classification models on WSIs. Inspired by the
Repo: None

Keyword: multi-task

Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition

Authors: Chengguang Gan, Qinghao Zhang, Tatsunori Mori
Arxiv: https://arxiv.org/abs/2306.15978
TLDR: Information extraction(IE) is a crucial subfield within natural language processing. In this study, we introduce a Sentence Classification and Named Entity Recognition Multi-task (SCNM) approach that combines Sentence-to-Label Generation (NER) with Sentence Recognition (SC) to improve the accuracy of SC and NER-labels. In the present study, the authors describe the process of SCNM, NER, and SCNM. We develop a Sent
Repo: None
An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection
Authors: Fanyu Wang, Zhenping Xie
Arxiv: https://arxiv.org/abs/2306.16313
TLDR: Text correction, especially the semantic correction of more widely used scenes, is strongly required to improve, for the fluency and writing efficiency of the text. An adversarial multi-task learning method is proposed to enhance the modeling and detection ability of character polysemy in Chinese sentence context. Wherein, two models, the masked language model and scoring language model, are introduced as a pair of not only coupled but also adversarial learning tasks. Moreover, the Monte Carlo tree search strategy and
Repo: None

Keyword: plagiarism

A systematic literature review on source code similarity measurement and clone detection: techniques, applications, and challenges

Authors: Morteza Zakeri-Nasrabadi, Saeed Parsa, Mohammad Ramezani, Chanchal Roy, Masoud Ekhtiarzadeh
Arxiv: https://arxiv.org/abs/2306.16171
TLDR: Measuring and evaluating source code similarity is a fundamental software engineering activity that embraces a broad range of applications, including but not limited to code recommendation, duplicate code, plagiarism, malware, and smell detection. This paper proposes a systematic literature review and meta-analysis on code similarity measurement and evaluation techniques to shed light on the existing approaches and their characteristics in different applications. We initially found over 10000 articles by querying four digital libraries and ended up with 136 primary studies in the field. The
Repo: None

Keyword: robustness

Toward Mesh-Invariant 3D Generative Deep Learning with Geometric Measures

Authors: Thomas Besnier, Sylvain Arguillère, Emery Pierson, Mohamed Daoudi
Arxiv: https://arxiv.org/abs/2306.15762
TLDR: 3D generative modeling is accelerating as the technology allowing the capture of geometric data is developing. However, the acquired data is often inconsistent, resulting in unregistered meshes or point clouds. Many generative learning algorithms require correspondence between each point when comparing the predicted shape and the target shape. We propose an architecture able to cope with different parameterizations, even during the training phase. In particular, our loss function is built upon a kernel-based metric over a representation of meshes using geometric measures
Repo: None
MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning
Authors: Zhehua Zhong, Tianyi Chen, Zhen Wang
Arxiv: https://arxiv.org/abs/2306.15826
TLDR: Fine-tuning large-scale pre-trained language models has been demonstrated effective for various natural language processing (NLP) tasks. Previous studies have established that incorporating adversarial training during the fine-tuned stage can significantly enhance model generalization and robustness. However, from the perspective of game theory, such utilizations of adversarial Training correspond to pure-strategy games, which are inherently limited in terms of the scope of their strategies, thereby still having room for improvement.
Repo: None
Hierarchical Graph Neural Networks for Proprioceptive 6D Pose Estimation of In-hand Objects
Authors: Alireza Rezazadeh, Snehal Dikhale, Soshi Iba, Nawid Jamali
Arxiv: https://arxiv.org/abs/2306.15858
TLDR: Robotic manipulation, in particular in-hand object manipulation, often requires an accurate estimate of the object's 6D pose. To improve the accuracy of the estimated pose, state-of-the-art approaches in 6D object pose estimation use observational data from one or more modalities, e.g., RGB images, depth, and tactile readings. However, existing approaches make limited use of the underlying geometric structure of the objects captured by these modalities. This results in poor performance
Repo: None
Human Sensing via Passive Spectrum Monitoring
Authors: Huaizheng Mu, Liangqi Yuan, Jia Li
Arxiv: https://arxiv.org/abs/2306.15893
TLDR: Human sensing is significantly improving our lifestyle in many fields such as elderly healthcare and public safety. Research has demonstrated that human activity can alter the passive radio frequency (PRF) spectrum, which represents the passive reception of RF signals in the surrounding environment without actively transmitting a target signal. This paper proposes a novel passive human sensing method that utilizes PRF spectrum alteration as a biometrics modality for human authentication, localization, and activity recognition. The proposed method uses software-defined radio (S
Repo: None
Content-Aware Quantization Index Modulation:Leveraging Data Statistics for Enhanced Image Watermarking
Authors: Jun-Long Mao, Hui-Yi Tang, Shan-Xiang Lyu, Zheng-Chun Zhou, Xiao-Chun Cao
Arxiv: https://arxiv.org/abs/2306.15896
TLDR: Image watermarking techniques have continuously evolved to address new challenges and incorporate advanced features. The advent of data-driven approaches has enabled the processing and analysis of large volumes of data, extracting valuable insights and patterns. In this paper, we propose two content-aware quantization index modulation (QIM) algorithms: Content-Aware QIM (CA-QIM”) and Content-Assisted Minimum Distortion Qim (CAMD-Qim). These algorithms aim to
Repo: None
Bridging the Gap: Neural Collapse Inspired Prompt Tuning for Generalization under Class Imbalance
Authors: Didi Zhu, Yinchuan Li, Min Zhang, Junkun Yuan, Jiashuo Liu, Kun Kuang, Chao Wu
Arxiv: https://arxiv.org/abs/2306.15955
TLDR: Large-scale vision-language (V-L) models have demonstrated remarkable generalization capabilities for downstream tasks through prompt tuning. However, their performance suffers significantly in the presence of class imbalance, a common issue in real-world scenarios. In this paper, we investigate the effects of class imbalanced on the generalization performance of V-L models and extend Neural Collapse phenomenon to these models, revealing the geometric reasons behind the impact of class mismatch on their generalization ability. To address
Repo: None
Graph Interpolation via Fast Fused-Gromovization
Authors: Xinyu Ma, Xu Chu, Yasha Wang, Yang Lin, Junfeng Zhao, Liantao Ma, Wenwu Zhu
Arxiv: https://arxiv.org/abs/2306.15963
TLDR: Graph data augmentation has proven to be effective in enhancing the generalizability and robustness of graph neural networks (GNNs) for graph-level classifications. However, existing methods mainly focus on augmenting the graph signal space and the graph structure space independently, overlooking their joint interaction. This paper addresses this limitation by formulating the problem as an optimal transport problem that aims to find an optimal strategy for matching nodes between graphs considering the interactions between graph structures and signals. To tackle
Repo: None
Enrollment-stage Backdoor Attacks on Speaker Recognition Systems via Adversarial Ultrasound
Authors: Xinfeng Li, Junning Ze, Chen Yan, Yushi Cheng, Xiaoyu Ji, Wenyuan Xu
Arxiv: https://arxiv.org/abs/2306.16022
TLDR: Automatic Speaker Recognition Systems (SRSs) have been widely used in voice applications for personal identification and access control. A typical SRS consists of three stages, i.e., training, enrollment, and recognition. Previous work has revealed that SRSs can be bypassed by backdoor attacks at the training stage or by adversarial ultrasound modulation at the enrollment stage. In this paper, we propose TUNER, a new type of backdoor attack against the enrolment stage of
Repo: None
Evaluating Similitude and Robustness of Deep Image Denoising Models via Adversarial Attack
Authors: Jie Ning, Yao Li, Zhichang Guo
Arxiv: https://arxiv.org/abs/2306.16050
TLDR: Deep neural networks (DNNs) have a wide range of applications in the field of image denoising, and they are superior to traditional image denosising. However, DNNs inevitably show vulnerability, which is the weak robustness in the face of adversarial attacks. In this paper, we find some similitudes between existing deep image Denoising methods, as they are consistently fooled by adversarial attack. First, denoises-PGD is proposed which
Repo: None
Distributional Modeling for Location-Aware Adversarial Patches
Authors: Xingxing Wei, Shouwei Ruan, Yinpeng Dong, Hang Su
Arxiv: https://arxiv.org/abs/2306.16131
TLDR: Adversarial patch is one of the important forms of performing adversarial attacks in the physical world. To improve the naturalness and aggressiveness of existing adversarial patches, location-aware patches are proposed, where the patch's location on the target object is integrated into the optimization process to perform attacks. Although it is effective, efficiently finding the optimal location for placing the patches is challenging, especially under the black-box attack settings. In this paper, we propose the Distribution-Optim
Repo: None
Mitigating the Accuracy-Robustness Trade-off via Multi-Teacher Adversarial Distillation
Authors: Shiji Zhao, Xizhe Wang, Xingxing Wei
Arxiv: https://arxiv.org/abs/2306.16170
TLDR: Adversarial training is a practical approach for improving the robustness of deep neural networks against adversarial attacks. Although adversarial training has been shown to provide reliable robustness, the performance toward clean examples is negatively affected after adversarial learning, which means a trade-off exists between accuracy and robustness. Recently, some studies have tried to use knowledge distillation methods in adversarial Training, achieving competitive performance in improving the accuracy but the accuracy for clean samples is still limited. In this
Repo: None
FuzzyFlow: Leveraging Dataflow To Find and Squash Program Optimization Bugs
Authors: Philipp Schaad, Timo Schneider, Tal Ben-Nun, Alexandru Calotoiu, Alexandros Nikolaos Ziogas, Torsten Hoefler
Arxiv: https://arxiv.org/abs/2306.16178
TLDR: The current hardware landscape and application scale is driving performance engineers towards writing bespoke optimizations. Verifying such optimizations, and generating minimal failing cases, is important for robustness in the face of changing program conditions, such as inputs and sizes. However, isolation of minimal test-cases from existing applications and generating new configurations are often difficult due to side effects on the system state, mostly related to dataflow. This paper introduces FuzzyFlow: a fault localization and test case extraction framework designed
Repo: None
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification
Authors: Pei Liu, Luping Ji, Xinyu Zhang, Feng Ye
Arxiv: https://arxiv.org/abs/2306.16180
TLDR: Given the special situation of modeling gigapixel images, multiple instance learning (MIL) has become one of the most important frameworks for Whole Slide Image (WSI) classification. In current practice, most MIL networks often face two unavoidable problems in training: i) insufficient WSI data, and ii) the data memorization nature inherent in neural networks. These problems may hinder MIL models from adequate and efficient training, suppressing the continuous performance promotion of classification models on WSIs. Inspired by the
Repo: None
SE-PQA: Personalized Community Question Answering
Authors: Pranav Kasela, Gabriella Pasi, Raffaele Perego
Arxiv: https://arxiv.org/abs/2306.16261
TLDR: Personalization in Information Retrieval is a topic studied for a long time. Nevertheless, there is still a lack of high-quality, real-world datasets to conduct large-scale experiments and evaluate models for personalized search. This paper contributes to filling this gap by introducing SE-PQA (StackExchange - Personalized Question Answering), a new curated resource to design and evaluate personalized models related to the task of community Question AnSWering (cQA). The
Repo: None
Seeing is Believing: Detecting Sybil Attack in FANET by Matching Visual and Auditory Domains
Authors: Yanpeng Cui, Qixun Zhang, Zhiyong Feng, Xiong Li, Zhiqing Wei, Ping Zhang
Arxiv: https://arxiv.org/abs/2306.16339
TLDR: The flying ad hoc network (FANET) will play a crucial role in the B5G/6G era since it provides wide coverage and on-demand deployment services in a distributed manner. The detection of Sybil attacks is essential to ensure trusted communication in FANET. Nevertheless, the conventional methods only utilize the untrusted information that UAV nodes passively ``heard'' from the ``auditory" domain (AD), resulting in severe communication disruptions and even collision accidents.
Repo: None
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning
Authors: Paul Pu Liang, Yiwei Lyu, Xiang Fan, Arav Agarwal, Yun Cheng, Louis-Philippe Morency, Ruslan Salakhutdinov
Arxiv: https://arxiv.org/abs/2306.16413
TLDR: Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. In order to accelerate progress towards understudied modalities and tasks while ensuring real-world robustness, we release MultiZoo, a public toolkit consisting of standardized implementations of > 20 core multimodial algorithms and MultiBench, a large-scale benchmark spanning 15 datasets, 10 modalities, 20 prediction tasks, and 6 research areas. Together, these provide an automated end-to-end machine
Repo: None
On Practical Aspects of Aggregation Defenses against Data Poisoning Attacks
Authors: Wenxiao Wang, Soheil Feizi
Arxiv: https://arxiv.org/abs/2306.16415
TLDR: The increasing access to data poses both opportunities and risks in deep learning, as one can manipulate the behaviors of deep learning models with malicious training samples. Such attacks are known as data poisoning. Recent advances in defense strategies against data poisoning have highlighted the effectiveness of aggregation schemes in achieving state-of-the-art results in certified poisoning robustness. However, the practical implications of these approaches remain unclear. Here we focus on Deep Partition Aggregation, a representative aggregation defense, and assess its
Repo: None

Keyword: scholarly

OpenCitations Meta

Authors: Arcangelo Massari, Fabio Mariani, Ivan Heibi, Silvio Peroni, David Shotton
Arxiv: https://arxiv.org/abs/2306.16191
TLDR: OpenCitations Meta is a new database that contains bibliographic metadata of scholarly publications involved in citations indexed by the OpenCitations infrastructure. It adheres to Open Science principles and provides data under a CC0 license for maximum reuse. The data can be accessed through a SPARQL endpoint, REST APIs, and dumps. OpenCitation Meta serves three important purposes. Firstly, it enables disambiguation of citations between publications described using different identifiers from various sources. For example,
Repo: None

Keyword: semantic similarity

Large Language Models as Annotators: Enhancing Generalization of NLP Models at Minimal Cost

Authors: Parikshit Bansal, Amit Sharma
Arxiv: https://arxiv.org/abs/2306.15766
TLDR: State-of-the-art supervised NLP algorithms achieve high accuracy but are also susceptible to failures on inputs from low-data regimes, such as domains that are not represented in training data. As an approximation to collecting ground-truth labels for the specific domain, we study the use of large language models (LLMs) for annotating inputs and improving the generalization of NLP models. Specifically, given a budget for LLM annotations, we present an algorithm for sampling the most
Repo: None

Keyword: similarity measure

Toward Mesh-Invariant 3D Generative Deep Learning with Geometric Measures

Authors: Thomas Besnier, Sylvain Arguillère, Emery Pierson, Mohamed Daoudi
Arxiv: https://arxiv.org/abs/2306.15762
TLDR: 3D generative modeling is accelerating as the technology allowing the capture of geometric data is developing. However, the acquired data is often inconsistent, resulting in unregistered meshes or point clouds. Many generative learning algorithms require correspondence between each point when comparing the predicted shape and the target shape. We propose an architecture able to cope with different parameterizations, even during the training phase. In particular, our loss function is built upon a kernel-based metric over a representation of meshes using geometric measures
Repo: None
A systematic literature review on source code similarity measurement and clone detection: techniques, applications, and challenges
Authors: Morteza Zakeri-Nasrabadi, Saeed Parsa, Mohammad Ramezani, Chanchal Roy, Masoud Ekhtiarzadeh
Arxiv: https://arxiv.org/abs/2306.16171
TLDR: Measuring and evaluating source code similarity is a fundamental software engineering activity that embraces a broad range of applications, including but not limited to code recommendation, duplicate code, plagiarism, malware, and smell detection. This paper proposes a systematic literature review and meta-analysis on code similarity measurement and evaluation techniques to shed light on the existing approaches and their characteristics in different applications. We initially found over 10000 articles by querying four digital libraries and ended up with 136 primary studies in the field. The
Repo: None

Keyword: summarization

Leveraging GPT-4 for Food Effect Summarization to Enhance Product-Specific Guidance Development via Iterative Prompting

Authors: Yiwen Shi, Ping Ren, Jing Wang, Biao Han, Taha ValizadehAslani, Felix Agbavor, Yi Zhang, Meng Hu, Liang Zhao, Hualou Liang
Arxiv: https://arxiv.org/abs/2306.16275
TLDR: Food effect summarization from New Drug Application (NDA) is an essential component of product-specific guidance (PSG) development and assessment. However, manual summarization of food effect from extensive drug application review documents is time-consuming, which arouses a need to develop automated methods. Recent advances in large language models (LLMs) such as ChatGPT and GPT-4, have demonstrated great potential in improving the effectiveness of automated text summarization, but its ability regarding
Repo: None
Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Authors: Zaid Alyafeai, Maged S. Alshaibani, Badr AlKhamissi, Hamzah Luqman, Ebrahim Alareqi, Ali Fadel
Arxiv: https://arxiv.org/abs/2306.16322
TLDR: Large language models (LLMs) have demonstrated impressive performance on various downstream tasks without requiring fine-tuning, including ChatGPT, a chat-based model built on top of LLMs such as GPT-3.5 and GPT, 4.5. Despite having a lower training proportion compared to English, these models also exhibit remarkable capabilities in other languages. In this study, we assess the performance of GPT - 3.5 (GPT-4) and G
Repo: None

Keyword: text generation

Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio

Authors: Allen Roush, Sanjay Basu, Akshay Moorthy, Dmitry Dubovoy
Arxiv: https://arxiv.org/abs/2306.15926
TLDR: Despite rapid advancement in the field of Constrained Natural Language Generation, little time has been spent on exploring the potential of language models which have had their vocabularies lexically, semantically, and/or phonetically constrained. We find that most language models generate compelling text even under significant constraints. We present a simple and universally applicable technique for modifying the output of a language model by compositionally applying filter functions to the language models vocabulary before a unit of text is generated. This
Repo: None
You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting
Authors: Xuan Ren, Lingqiao Liu
Arxiv: https://arxiv.org/abs/2306.15933
TLDR: Despite significant advancements in existing models, generating text descriptions from structured data input, known as data-to-text generation, remains a challenging task. In this paper, we propose a novel approach that goes beyond traditional one-shot generation methods by introducing a multi-step process consisting of generation, verification, and correction stages. Our approach, VCP(Verification and Correction Prompting), begins with the model generating an initial output. We then proceed to verify the correctness of different aspects of
Repo: None
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models
Authors: Yufei Huang, Deyi Xiong
Arxiv: https://arxiv.org/abs/2306.16244
TLDR: Holistically measuring societal biases of large language models is crucial for detecting and reducing ethical risks in highly capable AI models. In this work, we present a Chinese Bias Benchmark dataset that consists of over 100K questions jointly constructed by human experts and generative language models, covering stereotypes and societal biases in 14 social dimensions related to Chinese culture and values. The curation process contains 4 essential steps: bias identification via extensive literature review, ambiguous context generation, AI-assisted disambiguous context
Repo: None

e-tornike self-assigned this Jun 29, 2023

New submissions for Thu, 29 Jun 23 #385

New submissions for Thu, 29 Jun 23 #385

Comments

e-tornike commented Jun 29, 2023

Keyword: contrastive

ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

GraSS: Contrastive Learning with Gradient Guided Sampling Strategy for Remote Sensing Image Semantic Segmentation

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Subclass-balancing Contrastive Learning for Long-tailed Recognition

Multi-network Contrastive Learning Based on Global and Local Representations

Challenges of Zero-Shot Recognition with Vision-Language Models: Granularity and Correctness

Semantic Positive Pairs for Enhancing Contrastive Instance Discrimination

Keyword: data augmentation

Graph Interpolation via Fast Fused-Gromovization

Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification

Keyword: knowledge discovery

Defining data science: a new field of inquiry

Keyword: knowledge graph

Knowledge-Enhanced Hierarchical Information Correlation Learning for Multi-Modal Rumor Detection

Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation

Relevant Entity Selection: Knowledge Graph Bootstrapping via Zero-Shot Analogical Pruning

Keyword: legal

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

Keyword: mixup

Dimension Independent Mixup for Hard Negative Sample in Collaborative Filtering

Graph Interpolation via Fast Fused-Gromovization

Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification

Keyword: multi-task

Sentence-to-Label Generation Framework for Multi-task Learning of Japanese Sentence Classification and Named Entity Recognition

An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

Keyword: plagiarism

A systematic literature review on source code similarity measurement and clone detection: techniques, applications, and challenges

Keyword: robustness

Toward Mesh-Invariant 3D Generative Deep Learning with Geometric Measures

MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning

Hierarchical Graph Neural Networks for Proprioceptive 6D Pose Estimation of In-hand Objects

Human Sensing via Passive Spectrum Monitoring

Content-Aware Quantization Index Modulation:Leveraging Data Statistics for Enhanced Image Watermarking

Bridging the Gap: Neural Collapse Inspired Prompt Tuning for Generalization under Class Imbalance

Graph Interpolation via Fast Fused-Gromovization

Enrollment-stage Backdoor Attacks on Speaker Recognition Systems via Adversarial Ultrasound

Evaluating Similitude and Robustness of Deep Image Denoising Models via Adversarial Attack

Distributional Modeling for Location-Aware Adversarial Patches

Mitigating the Accuracy-Robustness Trade-off via Multi-Teacher Adversarial Distillation

FuzzyFlow: Leveraging Dataflow To Find and Squash Program Optimization Bugs

Pseudo-Bag Mixup Augmentation for Multiple Instance Learning Based Whole Slide Image Classification

SE-PQA: Personalized Community Question Answering

Seeing is Believing: Detecting Sybil Attack in FANET by Matching Visual and Auditory Domains

MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning

On Practical Aspects of Aggregation Defenses against Data Poisoning Attacks

Keyword: scholarly

OpenCitations Meta

Keyword: semantic similarity

Large Language Models as Annotators: Enhancing Generalization of NLP Models at Minimal Cost

Keyword: similarity measure

Toward Mesh-Invariant 3D Generative Deep Learning with Geometric Measures

A systematic literature review on source code similarity measurement and clone detection: techniques, applications, and challenges

Keyword: summarization

Leveraging GPT-4 for Food Effect Summarization to Enhance Product-Specific Guidance Development via Iterative Prompting

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models

Keyword: text generation

Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio

You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting

CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models