spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities #36

teslashibe · 2024-12-16T17:54:04Z

[SPIKE] Investigate Post Scoring System Limitations and Gaming Vulnerabilities

Problem Statement

The current scoring system (@post_scorer.py) has critical limitations in accurately measuring genuine engagement and content quality. The system's simplistic scoring metrics make it vulnerable to gaming and manipulation, potentially rewarding artificial rather than authentic social media engagement.

Current Issues

Engagement Metric Vulnerabilities

Simple normalization factors (100 likes, 50 replies, 75 retweets, 1000 views) are easily gameable
Log transformation doesn't sufficiently address viral outliers or detect artificial inflation
No distinction between quality of engagement sources (bot vs real users)
Engagement scoring treats all interaction types with equal validity

Content Quality Limitations

Binary scoring for media presence (0.5 for any media) ignores quality and relevance
Text length scoring (60-200 chars optimal) doesn't measure content value
No measurement of content originality or uniqueness
Limited content type differentiation (Photos, Videos, GIFs, URLs all treated equally)

Interaction Scoring Flaws

Conversation scoring (1.0 for any conversation) doesn't measure quality
Mention system (optimal at 1-2 mentions) can be gamed through spam
Thread detection is binary (0.5 for self-threads) without quality assessment
No measurement of authentic community engagement

Structural Issues

Fixed weight distribution (40% engagement, 30% content, 30% interaction) may not reflect true value
No temporal analysis of posting patterns or engagement growth
Lacks mechanisms to detect coordinated behavior between agents
Cannot identify or penalize artificial boost networks

Impact

Agents can achieve high scores through mechanical optimization rather than quality
Risk of reward distribution being skewed by gaming rather than merit
No differentiation between organic and artificial engagement
System may incentivize spam and low-quality content optimization

Evidence if Gaming Happening

Multiple agents showing identical engagement patterns
Suspicious rapid engagement growth on certain posts
Cross-promotion patterns between agent networks
Optimization around scoring thresholds rather than content quality

Questions to Address

How can we detect and measure authentic vs artificial engagement?
What metrics would better reflect genuine content value?
How can we incorporate engagement quality into scoring?
What mechanisms could prevent coordinated gaming?
How can we balance complexity vs gaming resistance?

Labels: scoring, security, investigation, high-priority

The text was updated successfully, but these errors were encountered:

teslashibe · 2024-12-27T22:08:29Z

Analysis here https://github.com/masa-finance/agent-arena-subnet/tree/feat-scoring-analysis-for-scoring-spike

teslashibe · 2024-12-30T21:06:46Z

@ryssroad I created a new issue for this suggestion #51

We will cleanup this ticket and think about ways we can use evaluation of conversation in scoring

teslashibe · 2024-12-30T21:19:04Z

Verification that a user is an agent on X

X has a requirement that all agents on X are marked as an automated account (agent).

Read X's terms of service here: https://help.x.com/en/using-x/automated-account-labels

Therefore, we can use the X public API to verify that an X profile is indeed an automated account (agent).

This is the truncated response body that we can GET from X for X profile:

{
  "errors": [...],
  "data": {
    "user": {
      "result": {
        // Highlighted section
        "__typename": "User",
        "id": "VXNlcjoxODYyMTAyNjUzNDY5NDMzODU2",
        "rest_id": "1862102653469433856",
        "affiliates_highlighted_label": {
          "label": {
            "badge": {
              "url": "https://pbs.twimg.com/semantic_core_img/1428827730364096519/4ZXpTBhS?format=png&name=orig"
            },
            "description": "Automated", 
            "longDescription": {
              "text": "Automated by @CreatorBid",
              "entities": [
                {
                  "fromIndex": 13,
                  "toIndex": 24,
                  "ref": {
                    "type": "TimelineRichTextMention",
                    "screen_name": "CreatorBid",
                    "mention_results": {
                      "result": {
                        "__typename": "User",
                        "legacy": {
                          "screen_name": "CreatorBid"
                        },
                        "rest_id": "1747737231916240896"
                      }
                    }
                  }
                }
              ]
            },
            "userLabelType": "AutomatedLabel"
          }
        },
        // Rest of response truncated
        "is_blue_verified": true,
        "profile_image_shape": "Circle",
        "legacy": {...},
        ...
      }
    }
  }
}

We propose adding verification that an X account is automated as an immediate improvement to miner/agent registration and make this an immediate criteria not only for registration to the subnet but implicitly for scoring.

Here is a feature ticket for this improvement: #52

teslashibe · 2024-12-30T21:26:00Z

Community feedback

My opinion: Skip likes, retweets, and comments until they can be verified. Focus on content from the AI agent instead. The more posts a miner has, the harder and costlier it becomes to fake engagement. Reward miners who frequently post, especially on diverse topics. For example, if a miner posts when BTC hits $100,000 and mentions it explicitly (not just saying it's bullish on BTC), it shows effort. This could require web searches at times. Rank miners higher when they comment on news articles and add something meaningful to the discussion, for instance.

teslashibe · 2024-12-30T21:32:59Z

[SPIKE] Comment: LLM-Based Tweet Quality Assessment

After investigating the current scoring vulnerabilities outlined in the spike ticket, we propose leveraging Subnet 19's LLM capabilities to create a more sophisticated content quality assessment system.

Key Insights

Basic engagement metrics are easily gamed and don't reflect true content value
Semantic analysis via LLM could provide deeper quality assessment
Integration with Subnet 19 offers natural pathway for implementation

Proposed Feature

See full feature ticket: [FEATURE] Integrate Subnet 19 LLM for Enhanced Tweet Quality Assessment
Link: #55

The feature proposes:

Using LLM to score multiple quality dimensions (coherence, value, originality)
Analyzing conversational depth and engagement quality
Integrating with existing scoring while maintaining gaming resistance

Next Steps

Review feature ticket proposal
Technical feasibility assessment with Subnet 19 integration

/cc @hide-on-bush-x @theMultitude @grantdfoster @mudler @Luka-Loncar

Let's discuss the technical feasibility of this approach - this is low hanging fruit and we can easily spin up some notebooks to do an analysis here on the impact.

teslashibe · 2024-12-30T22:13:50Z

Linking to feature ticket for Subnet 19 integration bonus: #55

This feature will add a scoring bonus for SN59 agents that use Subnet 19 for inference, requiring:

Integration with SN19 metrics API for usage verification:

GET "https://api.nineteen.com/v1/metrics/{nineteen-userid}"

Implementation of scoring bonus (1.2x for example) for verified active users
Caching and minimum thresholds for verification

Will help drive quality content generation and create stronger subnet synergies.

See full ticket for technical details and acceptance criteria.

cc @namoray for input on metrics API implementation

teslashibe · 2025-01-02T20:00:19Z

Quick win updates and bug fixes here: #61

ryssroad · 2025-01-04T07:24:12Z

Also, you can use post interval checking to determine manual posting, it is quite difficult to keep a strict schedule manually

teslashibe · 2025-01-07T18:17:19Z

@ryssroad we will do an analysis here

Luka-Loncar · 2025-01-16T17:24:48Z

We should create follow up tasks on this Spike - syncing with the team on next grooming.

teslashibe · 2025-01-21T17:15:05Z

This results into Scoring v4: #96

teslashibe changed the title ~~spike(scoring): review current scoring from miner (posts.json) and understand and iterate on the current scoring model~~ spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities Dec 27, 2024

teslashibe assigned teslashibe, grantdfoster and hide-on-bush-x Dec 27, 2024

This comment was marked as resolved.

Sign in to view

teslashibe mentioned this issue Dec 30, 2024

Proposal for a Dedicated Bittensor Subnet for Content Evaluation and Validation in the Twitter Agent Arena #51

Open

Luka-Loncar added 59 spike labels Dec 31, 2024

teslashibe linked a pull request Jan 21, 2025 that will close this issue

🧪 [DRAFT] Agent Scoring v4.0 #96

Draft

Luka-Loncar mentioned this issue Jan 24, 2025

Subnet 59 release v0.4.0 masa-finance/roadmap#88

Open

50 tasks

teslashibe closed this as completed Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities #36

spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities #36

teslashibe commented Dec 16, 2024 •

edited

Loading

teslashibe commented Dec 27, 2024

This comment was marked as resolved.

teslashibe commented Dec 30, 2024

teslashibe commented Dec 30, 2024 •

edited

Loading

teslashibe commented Dec 30, 2024

teslashibe commented Dec 30, 2024 •

edited by Luka-Loncar

Loading

teslashibe commented Dec 30, 2024 •

edited

Loading

teslashibe commented Jan 2, 2025

ryssroad commented Jan 4, 2025 •

edited

Loading

teslashibe commented Jan 7, 2025

Luka-Loncar commented Jan 16, 2025

teslashibe commented Jan 21, 2025

spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities #36

spike(scoring): Investigate Post Scoring System Limitations and Gaming Vulnerabilities #36

Comments

teslashibe commented Dec 16, 2024 • edited Loading

[SPIKE] Investigate Post Scoring System Limitations and Gaming Vulnerabilities

Problem Statement

Current Issues

Engagement Metric Vulnerabilities

Content Quality Limitations

Interaction Scoring Flaws

Structural Issues

Impact

Evidence if Gaming Happening

Questions to Address

teslashibe commented Dec 27, 2024

This comment was marked as resolved.

teslashibe commented Dec 30, 2024

teslashibe commented Dec 30, 2024 • edited Loading

Verification that a user is an agent on X

teslashibe commented Dec 30, 2024

teslashibe commented Dec 30, 2024 • edited by Luka-Loncar Loading

[SPIKE] Comment: LLM-Based Tweet Quality Assessment

Key Insights

Proposed Feature

Next Steps

teslashibe commented Dec 30, 2024 • edited Loading

teslashibe commented Jan 2, 2025

ryssroad commented Jan 4, 2025 • edited Loading

teslashibe commented Jan 7, 2025

Luka-Loncar commented Jan 16, 2025

teslashibe commented Jan 21, 2025

teslashibe commented Dec 16, 2024 •

edited

Loading

teslashibe commented Dec 30, 2024 •

edited

Loading

teslashibe commented Dec 30, 2024 •

edited by Luka-Loncar

Loading

teslashibe commented Dec 30, 2024 •

edited

Loading

ryssroad commented Jan 4, 2025 •

edited

Loading