Skip to content

Commit

Permalink
Process metadata corrections for 2025.genaidetect-1.31 (closes #4544)
Browse files Browse the repository at this point in the history
  • Loading branch information
mjpost committed Feb 9, 2025
1 parent 884f776 commit c9372e6
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion data/xml/2025.genaidetect.xml
Original file line number Diff line number Diff line change
Expand Up @@ -373,7 +373,7 @@
</paper>
<paper id="31">
<title><fixed-case>I</fixed-case>ntegrity<fixed-case>AI</fixed-case> at <fixed-case>G</fixed-case>en<fixed-case>AI</fixed-case> Detection Task 2: Detecting Machine-Generated Academic Essays in <fixed-case>E</fixed-case>nglish and <fixed-case>A</fixed-case>rabic Using <fixed-case>ELECTRA</fixed-case> and Stylometry</title>
<author><first>Mohammad</first><last>ALSmadi</last></author>
<author><first>Mohammad</first><last>AL-Smadi</last></author>
<pages>284–289</pages>
<abstract>We present a robust system for detecting machine-generated academic essays, leveraging pre-trained, transformer-based models specifically tailored for both English and Arabic texts. Our primary approach utilizes ELECTRA-Small for English and AraELECTRA-Base for Arabic, fine-tuned to deliver high performance while balancing computational efficiency. By incorporating stylometric features, such as word count, sentence length, and vocabulary richness, our models excel at distinguishing between human-written and AI-generated content. Proposed models achieved excellent results with an F1- score of 99.7%, ranking second among of 26 teams in the English subtask, and 98.4%, finishing first out of 23 teams in the Arabic one. Main Contributions include: (1) We develop lightweight and efficient models using ELECTRA-Small and AraELECTRA-Base, achieving an impressive F1-score of 98.5% on the English dataset and 98.4% on the Arabic dataset. This demonstrates the power of combining transformer-based architectures with stylometric analysis. (2) We optimize our system to maintain high performance while being computationally efficient, making it suitable for deployment on GPUs with moderate memory capacity. (3) Additionally, we tested larger models, such as ELECTRA-Large, achieving an even higher F1-score of 99.7% on the English dataset, highlighting the potential for further accuracy gains when using more computationally intensive models.</abstract>
<url hash="dad5d53f">2025.genaidetect-1.31</url>
Expand Down

0 comments on commit c9372e6

Please sign in to comment.