Experiment with hqcases removal #35766

millerdev · 2025-02-12T17:10:51Z

This PR adds a new Experiment decorator to compare timing and result differences between an old code path and a new one. Timing metrics are sent to the metrics backend (datadog), and result differences are logged to Sentry.

Experiments can be dynamically enabled and disabled by adding ExperimentEnabler records in Django Admin. Changes in Django Admin may take up to 30 minutes to become effective due to caching.

See the README for more details.

The final commit in this PR adds experiments to evaluate switching from CaseES to CaseSearchES in the data_pipeline_audit app. These should be low-traffic experiments, and will be used to test integrations with Datadog and Sentry.

🐟 Review by commit.

Safety Assurance

Safety story

All experiments are disabled by default, but timing metrics will be recorded for the old code path immediately after the code is deployed.

Experiments have been applied to a few dbaccessor functions in the data_pipeline_audit app, which are only used by management commands:

Function	Management command(s)
`get_es_counts_by_doc_type`	`print_domain_stats` and `compare_doc_counts`
`get_es_case_counts`	`compare_doc_ids`
`get_es_case_range`	`compare_doc_ids`
`get_es_case_ids`	`compare_doc_ids`

Automated test coverage

Yes.

Migrations

The migrations in this code can be safely applied first independently of the code

Rollback instructions

If this PR is rolled back the migrations should be unapplied on all enviroments where it was deployed.

DebugMetrics was updated to call `bucket_value()` for histogram metrics. This should have no effect on production metrics.

millerdev · 2025-02-12T22:09:54Z

@CodeRabbit please review

coderabbitai · 2025-02-12T22:11:16Z

🧩 Analysis chain

@millerdev I'll review the changes. I'll pay special attention to:

Implementation of the experiments decorator and its safety mechanisms
Changes to data_pipeline_audit app functions
Test coverage
Migration safety

🏁 Script executed:

#!/bin/bash
# Get the README for experiments app
cat corehq/apps/experiments/README.md || true

# Look for the experiments decorator implementation
fd experiments.py
cat $(fd experiments.py) || true

# Look for test files
fd "test.*experiments.*\.py"
cat $(fd "test.*experiments.*\.py") || true

# Look for migration files
fd "migrations" --type d
ls $(fd "migrations" --type d) || true

Length of output: 48590

🏁 Script executed:

#!/bin/bash
# Check the initial migration file
cat corehq/apps/experiments/migrations/0001_initial.py

# Look for the implementation files
fd -t f -e py experiments
cat $(fd -t f -e py experiments) || true

# Look for test files
fd -t f -e py test_experiments
cat $(fd -t f -e py test_experiments) || true

Length of output: 1456

🏁 Script executed:

#!/bin/bash
# Look for data_pipeline_audit changes
fd -t f data_pipeline_audit
cat $(fd -t f data_pipeline_audit) || true

# Look for changes in the PR
git diff HEAD^1 HEAD -- corehq/apps/data_pipeline_audit/

Length of output: 127

🏁 Script executed:

#!/bin/bash
# Search for experiment files in corehq directory
find corehq -type f -name "*experiment*.py"

# Get contents of found files
for f in $(find corehq -type f -name "*experiment*.py"); do
  echo "=== $f ==="
  cat "$f" || true
done

# Look for data pipeline audit changes
find corehq -type f -path "*/data_pipeline_audit/*"
for f in $(find corehq -type f -path "*/data_pipeline_audit/*"); do
  echo "=== $f ==="
  cat "$f" || true
done

Length of output: 33432

🏁 Script executed:

#!/bin/bash
# Get the list of changed files in the PR
git diff --name-only HEAD^1 HEAD

# Show the actual changes
git diff HEAD^1 HEAD

Length of output: 3343

@millerdev I've reviewed the changes. The implementation looks solid with good safety measures in place:

Experiments Framework:
- Well-designed decorator pattern for comparing code paths
- Safe by default (all experiments disabled)
- Configurable through Django Admin with caching
- Comprehensive test coverage
Safety Features:
- Gradual rollout possible (1-100% of traffic)
- Result differences logged to Sentry
- Initial experiments target low-traffic management commands
- Clean rollback path (unapply migrations)
Initial Usage:
- Applied to data_pipeline_audit app functions
- Comparing CaseES vs CaseSearchES
- Only affects management commands:
  - print_domain_stats
  - compare_doc_counts
  - compare_doc_ids

The implementation looks good to proceed. The safety measures and gradual rollout capability make it a low-risk change.

✏️ Learnings added

Learnt from: millerdev
URL: https://github.com/dimagi/commcare-hq/pull/35766

Timestamp: 2025-02-12T22:11:10.243Z
Learning: The experiments framework in CommCare HQ allows comparison of old and new code paths with the following features:
1. Timing metrics sent to Datadog
2. Result differences logged to Sentry
3. Configurable through ExperimentEnabler records in Django Admin
4. Disabled by default with gradual rollout capability (1-100%)

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

✅ Actions performed

Comments resolved.

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai · 2025-02-12T22:16:16Z

Walkthrough

The changes introduce a new experiments framework within the application. A new app is added, providing a framework for comparing old and new code paths, along with documentation and examples in the corresponding README files. Core components include an Experiment decorator, an ExperimentEnabler model with its corresponding Django migration and admin interface, and helper functions to determine experiment activation. Functions in the data pipeline audit module are enhanced by adding an extra parameter to allow the dynamic selection of Elasticsearch classes. Updates to the deletion and dump tests ensure that the new ExperimentEnabler model is ignored during model inspections. Additionally, modifications to the metrics utility improve histogram handling by using a new bucket-based approach. The experiments app is also registered in the main settings, integrating it into the overall app configuration.

Sequence Diagram(s)

sequenceDiagram
    participant Caller
    participant ExperimentDecorator
    participant OldCodePath
    participant NewCodePath
    participant MetricsRecorder
    participant ExperimentEnabler

    Caller ->> ExperimentDecorator: Call decorated function
    ExperimentDecorator ->> ExperimentEnabler: Check experiment configuration
    alt New Code Path Enabled
        ExperimentDecorator ->> NewCodePath: Execute new code segment
        NewCodePath -->> ExperimentDecorator: Return result & timing
    else Old Code Path
        ExperimentDecorator ->> OldCodePath: Execute old code segment
        OldCodePath -->> ExperimentDecorator: Return result & timing
    end
    ExperimentDecorator ->> MetricsRecorder: Record metrics and errors
    ExperimentDecorator -->> Caller: Return final result

Suggested labels

product/feature-flag

✨ Finishing Touches

📝 Generate Docstrings (Beta)

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (11)

corehq/apps/experiments/models.py (1)

9-32: Validate or document the meaning of special values in enabled_percent.
Your help text explains certain “magic” values (-1, 101, 102), which might surprise new contributors. Consider clarifying or validating them more strictly—for instance, through an Enum or a custom validator that ensures valid usage.
corehq/apps/experiments/experiment.py (1)
124-130: Include stacklevel for warnings.warn.
Consider adding stacklevel=2 or higher so that the warning references the calling site rather than the experiment code itself. This helps locate duplicate experiment definitions quickly.
- warnings.warn(f"Duplicate experiment {key} will result in conflated ...")
+ warnings.warn(
+     f"Duplicate experiment {key} will result in conflated ...",
+     stacklevel=2
+ )
🧰 Tools

🪛 Ruff (0.8.2)

127-127: No explicit stacklevel keyword argument found

(B028)
corehq/apps/experiments/tests/test_experiment.py (2)
52-72: Consider making timing tests more robust.

The current timing test uses fixed sleep durations which could be flaky in CI environments. Consider:

Using relative comparisons instead of exact durations

Adding tolerance for timing variations
-        sleep2()
+        # Run multiple times to account for timing variations
+        results = [sleep2() for _ in range(5)]
+        # Assert that the average duration is within expected range
+        avg_duration = sum(results) / len(results)
+        assert 0.001 <= avg_duration <= 0.003
179-188: Add docstring to gc_disabled context manager.

The context manager's purpose and usage should be documented.
 @contextmanager
 def gc_disabled():
+    """Temporarily disable garbage collection to prevent interference with timing measurements.
+    
+    This context manager is particularly useful in timing-sensitive tests where GC
+    could introduce unpredictable delays.
+    """
     enabled = gc.isenabled()
     if enabled:
         gc.disable()
corehq/util/metrics/metrics.py (1)
125-128: Consider adding type hints and docstring to the histogram method.

The method would benefit from explicit type hints and documentation of its parameters and behavior.
-    def histogram(self, name, value, bucket_tag, buckets, bucket_unit, tags, **kw):
-        bucket_tags = {bucket_tag: bucket_value(value, buckets, bucket_unit)}
-        self._histogram(name, 1, tags=(tags or {}) | bucket_tags, **kw)
+    def histogram(
+        self,
+        name: str,
+        value: float,
+        bucket_tag: str,
+        buckets: List[float] = DEFAULT_BUCKETS,
+        bucket_unit: str = '',
+        tags: Dict[str, str] = None,
+        **kw
+    ) -> None:
+        """Record a histogram metric with computed bucket tags.
+        
+        Args:
+            name: The name of the metric
+            value: The value to record
+            bucket_tag: The tag name for the bucket
+            buckets: List of bucket boundaries
+            bucket_unit: Optional unit for bucket values
+            tags: Additional tags for the metric
+            **kw: Additional keyword arguments
+        """
+        bucket_tags = {bucket_tag: bucket_value(value, buckets, bucket_unit)}
+        self._histogram(name, 1, tags=(tags or {}) | bucket_tags, **kw)
corehq/apps/data_pipeline_audit/dbacessors.py (2)
33-41: Consider adding error handling for edge cases.

While the function handles basic validation through is_equal, consider adding error handling for:

Empty or invalid date ranges

Invalid doc_type values
 @experiment(is_equal=lambda a, b: abs(a - b) < 5)
 def get_es_case_counts(domain, doc_type, gte, lt, escls=es.CaseES):
+    if not doc_type:
+        raise ValueError("doc_type is required")
+    if gte and lt and gte >= lt:
+        raise ValueError("'gte' must be less than 'lt'")
     ex = es.cases.server_modified_range(gte=gte, lt=lt)
     return escls().domain(domain).filter(ex).filter(
         es.filters.OR(
             es.filters.doc_type(doc_type),
             es.filters.doc_type(doc_type.lower()),
         )
     ).count()
44-56: Consider caching the results for performance.

The function makes two separate ES queries. Consider caching the results to improve performance for repeated calls.
+from django.core.cache import cache
+from datetime import timedelta
+
 @experiment
 def get_es_case_range(domain, escls=es.CaseES):
+    cache_key = f"es_case_range_{domain}"
+    cached_result = cache.get(cache_key)
+    if cached_result:
+        return cached_result
+
     def descending_query(order):
         result = escls().domain(domain).sort(
             'server_modified_on', desc=order).size(1).run().raw_hits
         if len(result) == 0:
             return None
         else:
             return dateutil.parser.parse(result[0]['_source']['server_modified_on']).date()
-    return (
+    result = (
         descending_query(order=False),
         descending_query(order=True)
     )
+    cache.set(cache_key, result, timeout=timedelta(hours=1).seconds)
+    return result
corehq/apps/experiments/tests/test_enabler.py (1)
228-228: Consider using underscore for unused loop variable.

The loop variable x is not used within the loop body.
-            for x in range(300):
+            for _ in range(300):
🧰 Tools

🪛 Ruff (0.8.2)

228-228: Loop control variable x not used within loop body

Rename unused x to _x

(B007)
corehq/apps/experiments/README.md (2)
11-25: Consider adding caching duration details.

The features section comprehensively covers the key aspects of the framework. However, it would be helpful to specify the exact caching duration for experiment states (mentioned as "may not apply immediately" on line 25).
-  Django Admin may not apply immediately. 
+  Django Admin may not apply immediately (changes can take up to 30 minutes to take effect due to caching). 
32-44: Enhance usage example with result comparison.

The usage example demonstrates the basic setup but could be more comprehensive.

Consider extending the example to show:

How to customize result comparison using is_equal

How to handle the case when results differ
from corehq.apps.experiments import Experiment, MOON_SHOT

def compare_results(old_result, new_result):
    return old_result.get('key') == new_result.get('key')

experiment = Experiment(
    campaign=MOON_SHOT,
    old_args={"feature": "valley"},
    new_args={"feature": "crater"},
    is_equal=compare_results  # Custom comparison function
)

@experiment
def function(other, args, *, feature):
    ... # implementation
corehq/apps/README.rst (1)
39-40: Consider expanding the description and moving to "Engineering Apps" section.

The experiments framework appears to be more of a developer-facing tool than a primary application feature. Consider:

Moving it to the "Engineering Apps" section (around line 154)

Expanding the description to highlight its key features
-experiments
-    Framework for comparing old and new code paths.
+experiments
+    Developer tooling for comparing old and new code paths, featuring performance metrics
+    collection (Datadog), result comparison (Sentry), and configurable rollout through
+    Django Admin. Used for validating implementation changes before full adoption.

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9d0c957 and 3f3b9d2.

⛔ Files ignored due to path filters (1)

migrations.lock is excluded by !**/*.lock and included by none

📒 Files selected for processing (14)

corehq/apps/README.rst (1 hunks)
corehq/apps/data_pipeline_audit/dbacessors.py (3 hunks)
corehq/apps/domain/tests/test_deletion_models.py (1 hunks)
corehq/apps/dump_reload/tests/test_dump_models.py (1 hunks)
corehq/apps/experiments/README.md (1 hunks)
corehq/apps/experiments/__init__.py (1 hunks)
corehq/apps/experiments/admin.py (1 hunks)
corehq/apps/experiments/experiment.py (1 hunks)
corehq/apps/experiments/migrations/0001_initial.py (1 hunks)
corehq/apps/experiments/models.py (1 hunks)
corehq/apps/experiments/tests/test_enabler.py (1 hunks)
corehq/apps/experiments/tests/test_experiment.py (1 hunks)
corehq/util/metrics/metrics.py (3 hunks)
settings.py (1 hunks)

✅ Files skipped from review due to trivial changes (1)

corehq/apps/experiments/init.py

🧰 Additional context used

📓 Path-based instructions (1)

`**/*.py`: - Review the code following best practises and st...

**/*.py: - Review the code following best practises and standards

corehq/apps/experiments/admin.py
corehq/apps/dump_reload/tests/test_dump_models.py
corehq/apps/domain/tests/test_deletion_models.py
corehq/apps/experiments/experiment.py
corehq/util/metrics/metrics.py
corehq/apps/experiments/migrations/0001_initial.py
corehq/apps/experiments/tests/test_enabler.py
settings.py
corehq/apps/experiments/models.py
corehq/apps/data_pipeline_audit/dbacessors.py
corehq/apps/experiments/tests/test_experiment.py

🧠 Learnings (2)

📓 Common learnings

Learnt from: millerdev
PR: dimagi/commcare-hq#0
File: :0-0
Timestamp: 2025-02-12T22:11:12.510Z
Learning: The experiments framework in CommCare HQ allows comparison of old and new code paths with the following features:
1. Timing metrics sent to Datadog
2. Result differences logged to Sentry
3. Configurable through ExperimentEnabler records in Django Admin
4. Disabled by default with gradual rollout capability (1-100%)

corehq/apps/experiments/README.md (1)

Learnt from: millerdev
PR: dimagi/commcare-hq#0
File: :0-0
Timestamp: 2025-02-12T22:11:12.510Z
Learning: The experiments framework in CommCare HQ allows comparison of old and new code paths with the following features:
1. Timing metrics sent to Datadog
2. Result differences logged to Sentry
3. Configurable through ExperimentEnabler records in Django Admin
4. Disabled by default with gradual rollout capability (1-100%)

🪛 Ruff (0.8.2)

corehq/apps/experiments/experiment.py

127-127: No explicit stacklevel keyword argument found

(B028)

corehq/apps/experiments/tests/test_enabler.py

228-228: Loop control variable x not used within loop body

Rename unused x to _x

(B007)

🔇 Additional comments (22)

corehq/apps/experiments/models.py (4)

34-44: Correctness check for random-based logic.
is_enabled uses a random comparison for 0 < value < 100, returning True occasionally and False otherwise. This is correct for a partial rollout. Just ensure that other code paths handle the None case properly (for fully new code usage).

46-52: Check boundary conditions for metrics recording.
should_record_metrics disallows recording when enabled_percent is -1 or 102, aligning with your help text. This looks intentional. Just confirm that you want to fully skip metrics in both negative and “fully new” disable-metrics states.

54-59: Nice fallback mechanism.
The fallback logic correctly narrows the path by chopping off segments. This design is flexible and matches partial or full path-based enablers. No issues found.

64-75: Caching approach is efficient.
Leveraging quickcache with a 30-minute memoize timeout is appropriate here, ensuring minimal database load. No issues observed.

corehq/apps/experiments/experiment.py (5)

13-38: Fields are well structured with attrs.
Your usage of attrs for the Experiment class and default fields (like time_buckets, percent_buckets) is clean. This helps avoid boilerplate code and is easy to read.

39-45: Use of positional-only arguments.
def __call__(self, func=None, /, **kw): ... is valid modern Python syntax. Just confirm all environments support this. If there are Python 3.7 or older environments, consider adjusting.

50-123: Comprehensive logic in the wrapper.
Your code effectively captures and logs timing data for both old and new paths, handles partial rollouts, and deals with exceptions. The usage of notify_exception and notify_error is appropriate. The fallback to over_the_top prevents division by zero.

132-138: Helpful utility for debugging.
describe function is a neat tool for summarizing arguments in logs. The truncation of long representations is a nice touch for preventing log noise.

140-145: Clarity in ENABLED_TAG mapping.
The distinct tags for old/both/new states in ENABLED_TAG cleanly reflect the experiment path states.

corehq/apps/experiments/admin.py (1)

6-9: Well-organized admin configuration.
Displaying campaign, path, and enabled_percent is sufficient for quick management. Ordering is also a good user experience detail.

corehq/apps/experiments/migrations/0001_initial.py (1)

1-28: LGTM! Well-structured migration with clear field definitions and constraints.

The migration follows Django best practices with:

Descriptive help text for each field

Appropriate field types and lengths

Proper validation constraints

Unique constraint to prevent duplicates

corehq/apps/domain/tests/test_deletion_models.py (1)

59-59: LGTM! Appropriate addition to IGNORE_MODELS.

The ExperimentEnabler model is correctly excluded from domain deletion as it's a configuration model.
corehq/apps/data_pipeline_audit/dbacessors.py (3)

14-18: LGTM! Well-structured experiment configuration.

The experiment setup is clear and follows best practices:

Campaign name is descriptive

Default args are empty for old path

New args specify the new ES class

Ignore duplicate flag is set for safety

170-175: LGTM! Clean implementation of get_es_case_ids.

The function maintains compatibility with both ES classes while adding experiment support.

21-30: Verify the impact of replacing CaseES with CaseSearchES.

The function correctly handles the ES class replacement in the indices list, but we should verify that CaseSearchES supports all the query features used by this function.
✅ Verification successful

🏁 Script executed:
#!/bin/bash
# Description: Check if CaseSearchES supports all query methods used with CaseES
ast-grep --pattern 'es.CaseES().$_($$$)'
Length of output: 42

Manual Verification Required: Verify Query Method Compatibility Between CaseES and CaseSearchES

The initial ast‑grep query produced no output, so its result is inconclusive. To better establish that all query methods expected by the function (e.g., those used within _get_index_counts_by_domain_doc_type) work with CaseSearchES, please run some additional searches. For example, you can check for usage of both es.CaseES( and CaseSearchES in the codebase:

These results will help confirm that the replacement is safe—that is, if the same query methods are used with both classes, and CaseSearchES implements them compatible with CaseES, the change should have no adverse effects.

Please verify these findings manually and confirm that CaseSearchES fully supports the query features expected by the function.

🏁 Script executed:
[run_scripts]
#!/bin/bash
echo "***** Searching for usages of es.CaseES( *****"
rg "es\.CaseES\(" -n .

echo "***** Searching for usages of CaseSearchES *****"
rg "CaseSearchES" -n .
Length of output: 17168

Query Method Compatibility Verified: CaseSearchES Supports All Required Features

Based on the codebase search results:

Inheritance Confirmation: In corehq/apps/es/case_search.py, the class CaseSearchES is defined as a subclass of CaseES, inheriting its query methods.

Extensive Usage: Multiple tests and modules (e.g., in corehq/apps/es/tests/test_case_search_es.py and others) use CaseSearchES with typical query operations (like .domain(), .filter(), .run(), etc.) identical to those used with CaseES.

Integrated Replacement: The replacement in corehq/apps/data_pipeline_audit/dbacessors.py correctly handles the ES class list, and the widespread usage of CaseSearchES indicates that it supports all the necessary query features used by the function.

No discrepancies were identified that would impact the query functionality when replacing CaseES with CaseSearchES.
corehq/apps/dump_reload/tests/test_dump_models.py (1)

69-69: LGTM! Appropriate addition to IGNORE_MODELS.

The ExperimentEnabler model is correctly added to the ignore list since it's a configuration model that shouldn't be included in domain dumps.

corehq/apps/experiments/tests/test_enabler.py (4)

20-31: LGTM! Well-structured test fixture.

The enabled fixture provides a clean way to manage experiment state during tests:

Creates ExperimentEnabler with specified parameters

Cleans up after tests using yield

Clears cache to prevent test interference

34-181: LGTM! Comprehensive test coverage for experiment execution.

The TestExperimentEnabled class thoroughly tests:

Disabled experiments with metrics disabled/enabled

Error handling in both paths

Metric collection and logging

Different enabling percentages

192-288: LGTM! Thorough testing of experiment enabling logic.

The TestIsEnabled class effectively tests:

Various enabling percentages (0%, 1%, 49%, 75%, 100%)

Caching behavior

Parent package configuration

Path-specific enabling

🧰 Tools

🪛 Ruff (0.8.2)

228-228: Loop control variable x not used within loop body

Rename unused x to _x

(B007)

290-307: LGTM! Well-designed test helper function.

The make_func helper:

Creates a test function with configurable error injection

Tracks function calls for verification

Adds controlled delays for timing tests

settings.py (1)

275-275: LGTM! Appropriate addition to HQ_APPS.

The experiments app is correctly added to the HQ_APPS tuple, making it available throughout the application.

corehq/apps/experiments/README.md (1)

1-8: LGTM! Clear introduction and attribution.

The introduction effectively explains the purpose of the framework and properly attributes inspiration to Github Scientist.

Make it easy to see in Datadog if errors should be investigated in Sentry. The "notify" tag will have a value of "error" or "diff" if an error or result difference was logged to Sentry, otherwise "none".

millerdev added 9 commits February 12, 2025 12:02

Initial implementation of experiment decorator

055801f

DebugMetrics was updated to call `bucket_value()` for histogram metrics. This should have no effect on production metrics.

Add model to enable or disable experiments

8818e71

Add enabler caching and random percentage

53a70da

Add enabler module and package scoping

7e43e8f

Add Django admin to manage experiments

77acddc

Add "enabled" tag to timing metric

4c5c583

Prevent accidental duplicate experiments

4f3ac3a

Add experiments app documentation

8729959

data_pipeline_audit.dbaccessors rm hqcases experiment

e78cf9e

millerdev added the product/invisible Change has no end-user visible impact label Feb 12, 2025

millerdev requested review from dannyroberts, AmitPhulera, gherceg and jingcheng16 February 12, 2025 17:10

dimagimon added the reindex/migration Reindex or migration will be required during or before deploy label Feb 12, 2025

Fix migration, delete models test, and lint error

3f3b9d2

millerdev force-pushed the dm/hqcases-experiment branch from 83de59a to 3f3b9d2 Compare February 12, 2025 17:37

coderabbitai bot reviewed Feb 12, 2025

View reviewed changes

millerdev added 5 commits February 13, 2025 16:08

Simplify test

593d877

Refactor experiment decorator to reduce complexity

2e232cf

Add "notify" tag in experiment diff metrics

4d2cc3c

Make it easy to see in Datadog if errors should be investigated in Sentry. The "notify" tag will have a value of "error" or "diff" if an error or result difference was logged to Sentry, otherwise "none".

Handle errors in result equality comparison

2d3a405

Improve warning output with stacklevel

73dcecd

millerdev force-pushed the dm/hqcases-experiment branch from adaa784 to 73dcecd Compare February 13, 2025 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment with hqcases removal #35766

Experiment with hqcases removal #35766

millerdev commented Feb 12, 2025 •

edited

Loading

millerdev commented Feb 12, 2025

coderabbitai bot commented Feb 12, 2025

coderabbitai bot commented Feb 12, 2025 •

edited by millerdev

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

Experiment with hqcases removal #35766

Are you sure you want to change the base?

Experiment with hqcases removal #35766

Conversation

millerdev commented Feb 12, 2025 • edited Loading

Safety Assurance

Safety story

Automated test coverage

Migrations

Rollback instructions

millerdev commented Feb 12, 2025

coderabbitai bot commented Feb 12, 2025

coderabbitai bot commented Feb 12, 2025 • edited by millerdev Loading

Walkthrough

Sequence Diagram(s)

Suggested labels

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

millerdev commented Feb 12, 2025 •

edited

Loading

coderabbitai bot commented Feb 12, 2025 •

edited by millerdev

Loading