如何在实际应用中提升模型效率？ #2252

mzgcz · 2024-11-29T08:43:47Z

Notice: In order to resolve issues more efficiently, please raise issue following the template.
（注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节）

❓ Questions and Help

在实际应用中要怎样提升在线模型（Streaming）的效率呢？
语言模型可以通过batch size进行批量推理，来提升推理效率；可以使用多实例来应对推理请求并发的情况；可以使用TensorRT来优化推理速度。
请问对于FunASR在线模型，上面哪些措施是可行的，有没有更好的推荐？

The text was updated successfully, but these errors were encountered:

mzgcz added the question Further information is requested label Nov 29, 2024