OLDA's "Audio file too short!" warning is misleading #72

andimarafioti · 2017-10-10T14:13:15Z

In OLDA's segmenter.py there's a warning for short files (both in 277-290 and 340-345):

        try:
            # Load and apply transform
            W = load_transform(self.config["transform"])
            F = W.dot(F)

            # Get Segments
            kmin, kmax = get_num_segs(dur)
            est_idxs = get_segments(F, kmin=kmin, kmax=kmax)
        except:
            # The audio file is too short, only beginning and end
            logging.warning("Audio file too short! "
                            "Only start and end boundaries.")
            est_idxs = [0, F.shape[1] - 1]

and

        try:
            # Load and apply transform
            W = load_transform(self.config["transform"])
            F = W.dot(F)

            # Get Segments
            kmin, kmax = get_num_segs(dur)

            # Run algorithm layer by layer
            est_idxs = []
            est_labels = []
            for k in range(kmin, kmax):
                S, cost = get_k_segments(F, k)
                est_idxs.append(S)
                est_labels.append(np.ones(len(S) - 1) * -1)

                # Make sure that the first and last boundaries are included
                assert est_idxs[-1][0] == 0 and \
                    est_idxs[-1][-1] == F.shape[1] - 1, "Layer %d does not " \
                    "start or end in the right frame(s)." % k

                # Post process layer
                est_idxs[-1], est_labels[-1] = \
                        self._postprocess(est_idxs[-1], est_labels[-1])
        except:
            # The audio file is too short, only beginning and end
            logging.warning("Audio file too short! "
                            "Only start and end boundaries.")
            est_idxs = [np.array([0, F.shape[1] - 1])]
            est_labels = [np.ones(1) * -1]

I found at the moment there is an issue between librosa and sklearn that makes the olda algorithm fail (should be fixed soon, though) and this logging to warn about something completely unrelated to the actual problem. Maybe someone familiarized with the olda algorithm could estimate how large the file should be for it to work?

urinieto · 2017-10-11T00:16:51Z

Yeah, this should be fixed as it is terrible coding practice to not catch specific exceptions. Will work on this in the next release. Thanks for pointing it out.

andimarafioti · 2017-10-11T09:40:01Z

Great! I just wanted to point out where this problem with the olda algorithm is coming from in case anyone else runs into it.
About the exception, it's not that bad. But if you're going to work on it I suggest not only catching specific exception but also doing smaller try blocks and never ever inside a for loop. There's even an assertion there that would be caught by the try block and you will get a "file too short" message instead of the intended "Layer %d does not start or end in the right frame(s).".

PaulMcInnis · 2017-12-07T15:19:38Z

Scikit-learn's 0.19.1 release has now fixed this issue, looks like all that needs to be done now is to update the setup.py 👍

urinieto · 2017-12-07T18:38:22Z

Awesome, thanks for sharing this, @PaulMcInnis. Will leave this open such that I remember to remove the general exception catching in the next release.

andimarafioti changed the title ~~OLDA "Audio file too short!" warning is misleading~~ OLDA's "Audio file too short!" warning is misleading Oct 10, 2017

urinieto added the bug label Oct 11, 2017

urinieto self-assigned this Oct 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OLDA's "Audio file too short!" warning is misleading #72

OLDA's "Audio file too short!" warning is misleading #72

andimarafioti commented Oct 10, 2017 •

edited

Loading

urinieto commented Oct 11, 2017

andimarafioti commented Oct 11, 2017

PaulMcInnis commented Dec 7, 2017

urinieto commented Dec 7, 2017

OLDA's "Audio file too short!" warning is misleading #72

OLDA's "Audio file too short!" warning is misleading #72

Comments

andimarafioti commented Oct 10, 2017 • edited Loading

urinieto commented Oct 11, 2017

andimarafioti commented Oct 11, 2017

PaulMcInnis commented Dec 7, 2017

urinieto commented Dec 7, 2017

andimarafioti commented Oct 10, 2017 •

edited

Loading