aclnet

Use Case and High-Level Description

The AclNet model is designed to perform sound classification. The AclNet model is trained on an internal dataset of environmental sounds. For details about the model, see this paper.

The model input is a segment of PCM audio samples in [N, C, 1, L] format.

The model output for AclNet is the sound classifier output for the 53 different environmental sound classes from the internal sound database.

Specification

Metric	Value
Type	Classification
GFLOPs	1.4
MParams	2.7
Source framework	PyTorch*

Accuracy

See this publication and this paper.

Input

Original Model

Audio, name - 0, shape - 1,1,1,L, format is N,C,1,L where:

N - batch size
C - channel
L - number of PCM samples (minimum value is 16000)

Converted Model

Audio, name - 0, shape - 1,1,1,L, format is N,C,1,L where:

N - batch size
C - channel
L - number of PCM samples (minimum value is 16000)

Output

Original Model

Sound classifier (see labels), name - 203, shape - 1,53, output data format is N,C where:

N - batch size
C - Predicted softmax scores for each class in [0, 1] range

Converted Model

Sound classifier (see labels), name - 203, shape - 1,53, output data format is N,C where:

N - batch size
C - Predicted softmax scores for each class in [0, 1] range

Legal Information

The original model is distributed under Apache License, Version 2.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aclnet.md

aclnet.md

aclnet

Use Case and High-Level Description

Specification

Accuracy

Input

Original Model

Converted Model

Output

Original Model

Converted Model

Legal Information

Files

aclnet.md

Latest commit

History

aclnet.md

File metadata and controls

aclnet

Use Case and High-Level Description

Specification

Accuracy

Input

Original Model

Converted Model

Output

Original Model

Converted Model

Legal Information