Skip to content

Latest commit

 

History

History
4 lines (4 loc) · 266 Bytes

README.md

File metadata and controls

4 lines (4 loc) · 266 Bytes

Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets

Jahanshahi, M. & Mockus, A.
Accepted in the Second International Workshop on Large Language Models for Code (LLM4Code 2025)
Preprint: https://arxiv.org/abs/2501.02628