Add tests for TFLite models #5

ScottTodd · 2024-08-09T23:53:07Z

Popular TFLite models: https://www.kaggle.com/models/?orderby=downloadCount&framework=tfLite
Old TFLite tests in IREE: https://github.com/iree-org/iree/tree/main/integrations/tensorflow/test/python/iree_tfl_tests

Search around for upstream test suites (corpus of .tflite files)

Could also test TOSA operators maybe using https://git.mlplatform.org/tosa/conformance_tests.git/ (see https://www.mlplatform.org/tosa/software.html)

ScottTodd · 2024-08-23T21:41:44Z

We can also test across different tensorflow package versions, to show when compatibility breaks (e.g. when TOSA ops change, see https://discord.com/channels/689900678990135345/689900680009482386/1276633806643662868 and https://discord.com/channels/689900678990135345/689900680009482386/1255019974867550208).

Progress on #6. A sample test report HTML file is available here: https://scotttodd.github.io/iree-test-suites/onnx_models/report_2024_09_17.html These new tests * Download models from https://github.com/onnx/models * Extract metadata from the models to determine which functions to call with random data * Run the models through [ONNX Runtime](https://onnxruntime.ai/) as a reference implementation * Import the models using `iree-import-onnx` (until we have a better API: iree-org/iree#18289) * Compile the models using `iree-compile` (currently just for `llvm-cpu` but this could be parameterized later) * Run the models using `iree-run-module`, checking outputs using `--expected_output` and the reference data Tests are written in Python using a set of pytest helper functions. As the tests run, they can log details about what commands they are running. When run locally, the `artifacts/` directory will contain all the relevant files. More can be done in follow-up PRs to improve the ergonomics there (like generating flagfiles). Each test case can use XFAIL like `@pytest.mark.xfail(raises=IreeRunException)`. As we test across multiple backends or want to configure the test suite from another repo (e.g. [iree-org/iree](https://github.com/iree-org/iree)), we can explore more expressive marks. Note that unlike the ONNX _operator_ tests, these tests use `onnxruntime` and `iree-import-onnx` at test time. The operator tests handle that as an infrequently ran offline step. We could do something similar here, but the test inputs and outputs can be rather large for real models and that gets into Git LFS or cloud storage territory. If this test authoring model works well enough, we can do something similar for other ML frameworks like TFLite (#5).

ScottTodd · 2024-12-09T19:05:50Z

May start on this soon, given some recent regressions in tflite/tosa program compilation.

ScottTodd · 2024-12-13T23:15:03Z

https://pypi.org/project/ai-edge-litert/ no wheels published for Windows... same for the original https://pypi.org/project/tflite-runtime/. Well, that limits testing options. Might be able to test compilation without execution, or generate test golden inputs/outputs on Linux and check those files in.

ScottTodd · 2024-12-14T00:26:44Z

https://pypi.org/project/ai-edge-litert/ no wheels published for Windows... same for the original https://pypi.org/project/tflite-runtime/. Well, that limits testing options. Might be able to test compilation without execution, or generate test golden inputs/outputs on Linux and check those files in.

Ah! https://github.com/iree-org/iree/blob/main/integrations/tensorflow/test/python/iree_tfl_tests/test_util.py this code works on Windows still

import tensorflow.compat.v2 as tf

self.tflite_interpreter = tf.lite.Interpreter(model_path=self.tflite_file)

Progress on #5. This contains two simple test cases for demonstration purposes, one of which is currently failing due to a regression: iree-org/iree#19402. The test suite follows the same structure as the onnx_models test suite in this repository. Some cleanup and refactoring will be more evident as this grows. We could for example share the `compile_mlir_with_iree` helper function between both test suites.

ScottTodd · 2025-01-20T17:02:40Z

Landed a test suite with two tests so far, running nightly: https://github.com/iree-org/iree-test-suites/actions/workflows/test_litert_models.yml?query=branch%3Amain

The new tests also show how test_mobilenet_v1_0_25_224 is newly failing after iree-org/iree#19683, as expected:
https://github.com/iree-org/iree-test-suites/actions/runs/12845103100/job/35818831797#step:6:21

ERROR    litert_models.utils:utils.py:96 Compilation of '/home/runner/.cache/kagglehub/models/tensorflow/mobilenet-v1/tfLite/0-25-224/1/1_cpu.vmfb' failed
ERROR    litert_models.utils:utils.py:97 iree-compile stdout:
ERROR    litert_models.utils:utils.py:98 
ERROR    litert_models.utils:utils.py:99 iree-compile stderr:
ERROR    litert_models.utils:utils.py:100 <unknown>:0: error: loc("MobilenetV1/MobilenetV1/Conv2d_0/Relu6"): 'tosa.conv2d' op requires attribute 'acc_type'

ScottTodd mentioned this issue Aug 13, 2024

Converting from kaggle popular models fail iree-org/iree#18210

Open

This was referenced Sep 6, 2024

Begin testing models from the ONNX Model Zoo. #23

Merged

Migrate GCS files to new (ideally public) locations iree-org/iree#18518

Open

Test TFLite models in iree_tests nod-ai/SHARK-TestSuite#291

Closed

ScottTodd mentioned this issue Nov 4, 2024

MoveNet singlepose-lightning yield weird result on with Vulkan backend on IREE. iree-org/iree#19001

Open

ScottTodd mentioned this issue Nov 13, 2024

XFAIL failing tflite test iree-org/iree#19123

Closed

ScottTodd mentioned this issue Dec 6, 2024

[tosa] failed to legalize operation 'arith.extsi' iree-org/iree#19402

Open

ScottTodd self-assigned this Dec 9, 2024

ScottTodd added the enhancement New feature or request label Dec 11, 2024

ScottTodd mentioned this issue Dec 14, 2024

Initial setup for LiteRT (TensorFlow Lite) model tests. #59

Merged

ScottTodd mentioned this issue Dec 18, 2024

Add tests for PyTorch models #63

Open

ScottTodd mentioned this issue Jan 2, 2025

Compilation error when compiling Movenet's singlepose thunder tensorflow lite model on IREE 3 iree-org/iree#19568

Open

ScottTodd mentioned this issue Jan 24, 2025

Add tests for StableHLO Models #74

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for TFLite models #5

Add tests for TFLite models #5

ScottTodd commented Aug 9, 2024

ScottTodd commented Aug 23, 2024

ScottTodd commented Dec 9, 2024

ScottTodd commented Dec 13, 2024

ScottTodd commented Dec 14, 2024

ScottTodd commented Jan 20, 2025

Add tests for TFLite models #5

Add tests for TFLite models #5

Comments

ScottTodd commented Aug 9, 2024

ScottTodd commented Aug 23, 2024

ScottTodd commented Dec 9, 2024

ScottTodd commented Dec 13, 2024

ScottTodd commented Dec 14, 2024

ScottTodd commented Jan 20, 2025