diff --git a/LICENSE.md b/LICENSE.md
new file mode 100644
index 0000000..47cd78a
--- /dev/null
+++ b/LICENSE.md
@@ -0,0 +1,141 @@
+## Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License
+
+By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
+
+### Section 1 – Definitions.
+
+a. __Adapted Material__ means material subject to Copyright and Similar Rights that is derived from or based upon the Licensed Material and in which the Licensed Material is translated, altered, arranged, transformed, or otherwise modified in a manner requiring permission under the Copyright and Similar Rights held by the Licensor. For purposes of this Public License, where the Licensed Material is a musical work, performance, or sound recording, Adapted Material is always produced where the Licensed Material is synched in timed relation with a moving image.
+
+b. __Copyright and Similar Rights__ means copyright and/or similar rights closely related to copyright including, without limitation, performance, broadcast, sound recording, and Sui Generis Database Rights, without regard to how the rights are labeled or categorized. For purposes of this Public License, the rights specified in Section 2(b)(1)-(2) are not Copyright and Similar Rights.
+
+e. __Effective Technological Measures__ means those measures that, in the absence of proper authority, may not be circumvented under laws fulfilling obligations under Article 11 of the WIPO Copyright Treaty adopted on December 20, 1996, and/or similar international agreements.
+
+f. __Exceptions and Limitations__ means fair use, fair dealing, and/or any other exception or limitation to Copyright and Similar Rights that applies to Your use of the Licensed Material.
+
+h. __Licensed Material__ means the artistic or literary work, database, or other material to which the Licensor applied this Public License.
+
+i. __Licensed Rights__ means the rights granted to You subject to the terms and conditions of this Public License, which are limited to all Copyright and Similar Rights that apply to Your use of the Licensed Material and that the Licensor has authority to license.
+
+h. __Licensor__ means the individual(s) or entity(ies) granting rights under this Public License.
+
+i. __NonCommercial__ means not primarily intended for or directed towards commercial advantage or monetary compensation. For purposes of this Public License, the exchange of the Licensed Material for other material subject to Copyright and Similar Rights by digital file-sharing or similar means is NonCommercial provided there is no payment of monetary compensation in connection with the exchange.
+
+j. __Share__ means to provide material to the public by any means or process that requires permission under the Licensed Rights, such as reproduction, public display, public performance, distribution, dissemination, communication, or importation, and to make material available to the public including in ways that members of the public may access the material from a place and at a time individually chosen by them.
+
+k. __Sui Generis Database Rights__ means rights other than copyright resulting from Directive 96/9/EC of the European Parliament and of the Council of 11 March 1996 on the legal protection of databases, as amended and/or succeeded, as well as other essentially equivalent rights anywhere in the world.
+
+l. __You__ means the individual or entity exercising the Licensed Rights under this Public License. Your has a corresponding meaning.
+
+### Section 2 – Scope.
+
+a. ___License grant.___
+
+ 1. Subject to the terms and conditions of this Public License, the Licensor hereby grants You a worldwide, royalty-free, non-sublicensable, non-exclusive, irrevocable license to exercise the Licensed Rights in the Licensed Material to:
+
+ A. reproduce and Share the Licensed Material, in whole or in part, for NonCommercial purposes only; and
+
+ B. produce and reproduce, but not Share, Adapted Material for NonCommercial purposes only.
+
+ 2. __Exceptions and Limitations.__ For the avoidance of doubt, where Exceptions and Limitations apply to Your use, this Public License does not apply, and You do not need to comply with its terms and conditions.
+
+ 3. __Term.__ The term of this Public License is specified in Section 6(a).
+
+ 4. __Media and formats; technical modifications allowed.__ The Licensor authorizes You to exercise the Licensed Rights in all media and formats whether now known or hereafter created, and to make technical modifications necessary to do so. The Licensor waives and/or agrees not to assert any right or authority to forbid You from making technical modifications necessary to exercise the Licensed Rights, including technical modifications necessary to circumvent Effective Technological Measures. For purposes of this Public License, simply making modifications authorized by this Section 2(a)(4) never produces Adapted Material.
+
+ 5. __Downstream recipients.__
+
+ A. __Offer from the Licensor – Licensed Material.__ Every recipient of the Licensed Material automatically receives an offer from the Licensor to exercise the Licensed Rights under the terms and conditions of this Public License.
+
+ B. __No downstream restrictions.__ You may not offer or impose any additional or different terms or conditions on, or apply any Effective Technological Measures to, the Licensed Material if doing so restricts exercise of the Licensed Rights by any recipient of the Licensed Material.
+
+ 6. __No endorsement.__ Nothing in this Public License constitutes or may be construed as permission to assert or imply that You are, or that Your use of the Licensed Material is, connected with, or sponsored, endorsed, or granted official status by, the Licensor or others designated to receive attribution as provided in Section 3(a)(1)(A)(i).
+
+b. ___Other rights.___
+
+ 1. Moral rights, such as the right of integrity, are not licensed under this Public License, nor are publicity, privacy, and/or other similar personality rights; however, to the extent possible, the Licensor waives and/or agrees not to assert any such rights held by the Licensor to the limited extent necessary to allow You to exercise the Licensed Rights, but not otherwise.
+
+ 2. Patent and trademark rights are not licensed under this Public License.
+
+ 3. To the extent possible, the Licensor waives any right to collect royalties from You for the exercise of the Licensed Rights, whether directly or through a collecting society under any voluntary or waivable statutory or compulsory licensing scheme. In all other cases the Licensor expressly reserves any right to collect such royalties, including when the Licensed Material is used other than for NonCommercial purposes.
+
+### Section 3 – License Conditions.
+
+Your exercise of the Licensed Rights is expressly made subject to the following conditions.
+
+a. ___Attribution.___
+
+ 1. If You Share the Licensed Material, You must:
+
+ A. retain the following if it is supplied by the Licensor with the Licensed Material:
+
+ i. identification of the creator(s) of the Licensed Material and any others designated to receive attribution, in any reasonable manner requested by the Licensor (including by pseudonym if designated);
+
+ ii. a copyright notice;
+
+ iii. a notice that refers to this Public License;
+
+ iv. a notice that refers to the disclaimer of warranties;
+
+ v. a URI or hyperlink to the Licensed Material to the extent reasonably practicable;
+
+ B. indicate if You modified the Licensed Material and retain an indication of any previous modifications; and
+
+ C. indicate the Licensed Material is licensed under this Public License, and include the text of, or the URI or hyperlink to, this Public License.
+
+ For the avoidance of doubt, You do not have permission under this Public License to Share Adapted Material.
+
+ 2. You may satisfy the conditions in Section 3(a)(1) in any reasonable manner based on the medium, means, and context in which You Share the Licensed Material. For example, it may be reasonable to satisfy the conditions by providing a URI or hyperlink to a resource that includes the required information.
+
+ 3. If requested by the Licensor, You must remove any of the information required by Section 3(a)(1)(A) to the extent reasonably practicable.
+
+### Section 4 – Sui Generis Database Rights.
+
+Where the Licensed Rights include Sui Generis Database Rights that apply to Your use of the Licensed Material:
+
+a. for the avoidance of doubt, Section 2(a)(1) grants You the right to extract, reuse, reproduce, and Share all or a substantial portion of the contents of the database for NonCommercial purposes only and provided You do not Share Adapted Material;
+
+b. if You include all or a substantial portion of the database contents in a database in which You have Sui Generis Database Rights, then the database in which You have Sui Generis Database Rights (but not its individual contents) is Adapted Material; and
+
+c. You must comply with the conditions in Section 3(a) if You Share all or a substantial portion of the contents of the database.
+
+For the avoidance of doubt, this Section 4 supplements and does not replace Your obligations under this Public License where the Licensed Rights include other Copyright and Similar Rights.
+
+### Section 5 – Disclaimer of Warranties and Limitation of Liability.
+
+a. __Unless otherwise separately undertaken by the Licensor, to the extent possible, the Licensor offers the Licensed Material as-is and as-available, and makes no representations or warranties of any kind concerning the Licensed Material, whether express, implied, statutory, or other. This includes, without limitation, warranties of title, merchantability, fitness for a particular purpose, non-infringement, absence of latent or other defects, accuracy, or the presence or absence of errors, whether or not known or discoverable. Where disclaimers of warranties are not allowed in full or in part, this disclaimer may not apply to You.__
+
+b. __To the extent possible, in no event will the Licensor be liable to You on any legal theory (including, without limitation, negligence) or otherwise for any direct, special, indirect, incidental, consequential, punitive, exemplary, or other losses, costs, expenses, or damages arising out of this Public License or use of the Licensed Material, even if the Licensor has been advised of the possibility of such losses, costs, expenses, or damages. Where a limitation of liability is not allowed in full or in part, this limitation may not apply to You.__
+
+c. The disclaimer of warranties and limitation of liability provided above shall be interpreted in a manner that, to the extent possible, most closely approximates an absolute disclaimer and waiver of all liability.
+
+### Section 6 – Term and Termination.
+
+a. This Public License applies for the term of the Copyright and Similar Rights licensed here. However, if You fail to comply with this Public License, then Your rights under this Public License terminate automatically.
+
+b. Where Your right to use the Licensed Material has terminated under Section 6(a), it reinstates:
+
+ 1. automatically as of the date the violation is cured, provided it is cured within 30 days of Your discovery of the violation; or
+
+ 2. upon express reinstatement by the Licensor.
+
+ For the avoidance of doubt, this Section 6(b) does not affect any right the Licensor may have to seek remedies for Your violations of this Public License.
+
+c. For the avoidance of doubt, the Licensor may also offer the Licensed Material under separate terms or conditions or stop distributing the Licensed Material at any time; however, doing so will not terminate this Public License.
+
+d. Sections 1, 5, 6, 7, and 8 survive termination of this Public License.
+
+### Section 7 – Other Terms and Conditions.
+
+a. The Licensor shall not be bound by any additional or different terms or conditions communicated by You unless expressly agreed.
+
+b. Any arrangements, understandings, or agreements regarding the Licensed Material not stated herein are separate from and independent of the terms and conditions of this Public License.
+
+### Section 8 – Interpretation.
+
+a. For the avoidance of doubt, this Public License does not, and shall not be interpreted to, reduce, limit, restrict, or impose conditions on any use of the Licensed Material that could lawfully be made without permission under this Public License.
+
+b. To the extent possible, if any provision of this Public License is deemed unenforceable, it shall be automatically reformed to the minimum extent necessary to make it enforceable. If the provision cannot be reformed, it shall be severed from this Public License without affecting the enforceability of the remaining terms and conditions.
+
+c. No term or condition of this Public License will be waived and no failure to comply consented to unless expressly agreed to by the Licensor.
+
+d. Nothing in this Public License constitutes or may be interpreted as a limitation upon, or waiver of, any privileges and immunities that apply to the Licensor or You, including from the legal processes of any jurisdiction or authority.
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..5863402
--- /dev/null
+++ b/README.md
@@ -0,0 +1,5 @@
+## Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
+
+This website is the official site for the paper Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation by Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, and Somayeh Sojoudi.
+
+The webpage has two portions: a demo page and an example of the human evaluation form.
diff --git a/audio/0.wav b/audio/0.wav
new file mode 100644
index 0000000..cf7ddc0
Binary files /dev/null and b/audio/0.wav differ
diff --git a/audio/1.wav b/audio/1.wav
new file mode 100644
index 0000000..f4dcc4e
Binary files /dev/null and b/audio/1.wav differ
diff --git a/audio/10.wav b/audio/10.wav
new file mode 100644
index 0000000..70315a4
Binary files /dev/null and b/audio/10.wav differ
diff --git a/audio/11.wav b/audio/11.wav
new file mode 100644
index 0000000..2572d32
Binary files /dev/null and b/audio/11.wav differ
diff --git a/audio/12.wav b/audio/12.wav
new file mode 100644
index 0000000..c8b0a70
Binary files /dev/null and b/audio/12.wav differ
diff --git a/audio/13.wav b/audio/13.wav
new file mode 100644
index 0000000..2c3b686
Binary files /dev/null and b/audio/13.wav differ
diff --git a/audio/14.wav b/audio/14.wav
new file mode 100644
index 0000000..d296637
Binary files /dev/null and b/audio/14.wav differ
diff --git a/audio/15.wav b/audio/15.wav
new file mode 100644
index 0000000..0b53468
Binary files /dev/null and b/audio/15.wav differ
diff --git a/audio/16.wav b/audio/16.wav
new file mode 100644
index 0000000..c8d6c99
Binary files /dev/null and b/audio/16.wav differ
diff --git a/audio/17.wav b/audio/17.wav
new file mode 100644
index 0000000..918008f
Binary files /dev/null and b/audio/17.wav differ
diff --git a/audio/18.wav b/audio/18.wav
new file mode 100644
index 0000000..337c46a
Binary files /dev/null and b/audio/18.wav differ
diff --git a/audio/19.wav b/audio/19.wav
new file mode 100644
index 0000000..e841027
Binary files /dev/null and b/audio/19.wav differ
diff --git a/audio/2.wav b/audio/2.wav
new file mode 100644
index 0000000..d4e0122
Binary files /dev/null and b/audio/2.wav differ
diff --git a/audio/20.wav b/audio/20.wav
new file mode 100644
index 0000000..5545e21
Binary files /dev/null and b/audio/20.wav differ
diff --git a/audio/21.wav b/audio/21.wav
new file mode 100644
index 0000000..e1dab36
Binary files /dev/null and b/audio/21.wav differ
diff --git a/audio/22.wav b/audio/22.wav
new file mode 100644
index 0000000..92016e5
Binary files /dev/null and b/audio/22.wav differ
diff --git a/audio/23.wav b/audio/23.wav
new file mode 100644
index 0000000..8787b84
Binary files /dev/null and b/audio/23.wav differ
diff --git a/audio/24.wav b/audio/24.wav
new file mode 100644
index 0000000..e8caf5d
Binary files /dev/null and b/audio/24.wav differ
diff --git a/audio/25.wav b/audio/25.wav
new file mode 100644
index 0000000..72bb7f6
Binary files /dev/null and b/audio/25.wav differ
diff --git a/audio/26.wav b/audio/26.wav
new file mode 100644
index 0000000..dbffdca
Binary files /dev/null and b/audio/26.wav differ
diff --git a/audio/27.wav b/audio/27.wav
new file mode 100644
index 0000000..181a93e
Binary files /dev/null and b/audio/27.wav differ
diff --git a/audio/28.wav b/audio/28.wav
new file mode 100644
index 0000000..efbad86
Binary files /dev/null and b/audio/28.wav differ
diff --git a/audio/29.wav b/audio/29.wav
new file mode 100644
index 0000000..93e19d1
Binary files /dev/null and b/audio/29.wav differ
diff --git a/audio/3.wav b/audio/3.wav
new file mode 100644
index 0000000..9eea519
Binary files /dev/null and b/audio/3.wav differ
diff --git a/audio/30.wav b/audio/30.wav
new file mode 100644
index 0000000..3f2380d
Binary files /dev/null and b/audio/30.wav differ
diff --git a/audio/31.wav b/audio/31.wav
new file mode 100644
index 0000000..d7a9abd
Binary files /dev/null and b/audio/31.wav differ
diff --git a/audio/32.wav b/audio/32.wav
new file mode 100644
index 0000000..d851dae
Binary files /dev/null and b/audio/32.wav differ
diff --git a/audio/33.wav b/audio/33.wav
new file mode 100644
index 0000000..a5dc421
Binary files /dev/null and b/audio/33.wav differ
diff --git a/audio/34.wav b/audio/34.wav
new file mode 100644
index 0000000..a6be5dd
Binary files /dev/null and b/audio/34.wav differ
diff --git a/audio/35.wav b/audio/35.wav
new file mode 100644
index 0000000..4d274f6
Binary files /dev/null and b/audio/35.wav differ
diff --git a/audio/36.wav b/audio/36.wav
new file mode 100644
index 0000000..354b122
Binary files /dev/null and b/audio/36.wav differ
diff --git a/audio/37.wav b/audio/37.wav
new file mode 100644
index 0000000..a715799
Binary files /dev/null and b/audio/37.wav differ
diff --git a/audio/38.wav b/audio/38.wav
new file mode 100644
index 0000000..bf6b2a7
Binary files /dev/null and b/audio/38.wav differ
diff --git a/audio/39.wav b/audio/39.wav
new file mode 100644
index 0000000..92a7833
Binary files /dev/null and b/audio/39.wav differ
diff --git a/audio/4.wav b/audio/4.wav
new file mode 100644
index 0000000..64812ff
Binary files /dev/null and b/audio/4.wav differ
diff --git a/audio/40.wav b/audio/40.wav
new file mode 100644
index 0000000..9736fce
Binary files /dev/null and b/audio/40.wav differ
diff --git a/audio/41.wav b/audio/41.wav
new file mode 100644
index 0000000..1260bfa
Binary files /dev/null and b/audio/41.wav differ
diff --git a/audio/42.wav b/audio/42.wav
new file mode 100644
index 0000000..fd870f5
Binary files /dev/null and b/audio/42.wav differ
diff --git a/audio/43.wav b/audio/43.wav
new file mode 100644
index 0000000..57d852d
Binary files /dev/null and b/audio/43.wav differ
diff --git a/audio/44.wav b/audio/44.wav
new file mode 100644
index 0000000..5535b0f
Binary files /dev/null and b/audio/44.wav differ
diff --git a/audio/45.wav b/audio/45.wav
new file mode 100644
index 0000000..e1e4a2f
Binary files /dev/null and b/audio/45.wav differ
diff --git a/audio/46.wav b/audio/46.wav
new file mode 100644
index 0000000..da6edff
Binary files /dev/null and b/audio/46.wav differ
diff --git a/audio/47.wav b/audio/47.wav
new file mode 100644
index 0000000..14db4a5
Binary files /dev/null and b/audio/47.wav differ
diff --git a/audio/48.wav b/audio/48.wav
new file mode 100644
index 0000000..339427d
Binary files /dev/null and b/audio/48.wav differ
diff --git a/audio/49.wav b/audio/49.wav
new file mode 100644
index 0000000..396531a
Binary files /dev/null and b/audio/49.wav differ
diff --git a/audio/5.wav b/audio/5.wav
new file mode 100644
index 0000000..9600c50
Binary files /dev/null and b/audio/5.wav differ
diff --git a/audio/50.wav b/audio/50.wav
new file mode 100644
index 0000000..e0e90e4
Binary files /dev/null and b/audio/50.wav differ
diff --git a/audio/51.wav b/audio/51.wav
new file mode 100644
index 0000000..48a1beb
Binary files /dev/null and b/audio/51.wav differ
diff --git a/audio/52.wav b/audio/52.wav
new file mode 100644
index 0000000..fbe533b
Binary files /dev/null and b/audio/52.wav differ
diff --git a/audio/53.wav b/audio/53.wav
new file mode 100644
index 0000000..64d4a4e
Binary files /dev/null and b/audio/53.wav differ
diff --git a/audio/54.wav b/audio/54.wav
new file mode 100644
index 0000000..d45c90f
Binary files /dev/null and b/audio/54.wav differ
diff --git a/audio/55.wav b/audio/55.wav
new file mode 100644
index 0000000..0a94c98
Binary files /dev/null and b/audio/55.wav differ
diff --git a/audio/56.wav b/audio/56.wav
new file mode 100644
index 0000000..30bfc5f
Binary files /dev/null and b/audio/56.wav differ
diff --git a/audio/57.wav b/audio/57.wav
new file mode 100644
index 0000000..df09567
Binary files /dev/null and b/audio/57.wav differ
diff --git a/audio/58.wav b/audio/58.wav
new file mode 100644
index 0000000..43e7fc4
Binary files /dev/null and b/audio/58.wav differ
diff --git a/audio/59.wav b/audio/59.wav
new file mode 100644
index 0000000..e2d1008
Binary files /dev/null and b/audio/59.wav differ
diff --git a/audio/6.wav b/audio/6.wav
new file mode 100644
index 0000000..d23db58
Binary files /dev/null and b/audio/6.wav differ
diff --git a/audio/60.wav b/audio/60.wav
new file mode 100644
index 0000000..3d570c4
Binary files /dev/null and b/audio/60.wav differ
diff --git a/audio/61.wav b/audio/61.wav
new file mode 100644
index 0000000..94f2431
Binary files /dev/null and b/audio/61.wav differ
diff --git a/audio/62.wav b/audio/62.wav
new file mode 100644
index 0000000..64dc3d9
Binary files /dev/null and b/audio/62.wav differ
diff --git a/audio/63.wav b/audio/63.wav
new file mode 100644
index 0000000..46fecf1
Binary files /dev/null and b/audio/63.wav differ
diff --git a/audio/64.wav b/audio/64.wav
new file mode 100644
index 0000000..cb0fbb4
Binary files /dev/null and b/audio/64.wav differ
diff --git a/audio/65.wav b/audio/65.wav
new file mode 100644
index 0000000..9737f95
Binary files /dev/null and b/audio/65.wav differ
diff --git a/audio/66.wav b/audio/66.wav
new file mode 100644
index 0000000..9e72170
Binary files /dev/null and b/audio/66.wav differ
diff --git a/audio/67.wav b/audio/67.wav
new file mode 100644
index 0000000..f5286f9
Binary files /dev/null and b/audio/67.wav differ
diff --git a/audio/68.wav b/audio/68.wav
new file mode 100644
index 0000000..8208cf6
Binary files /dev/null and b/audio/68.wav differ
diff --git a/audio/69.wav b/audio/69.wav
new file mode 100644
index 0000000..0ec2de3
Binary files /dev/null and b/audio/69.wav differ
diff --git a/audio/7.wav b/audio/7.wav
new file mode 100644
index 0000000..98312cf
Binary files /dev/null and b/audio/7.wav differ
diff --git a/audio/70.wav b/audio/70.wav
new file mode 100644
index 0000000..da9efe5
Binary files /dev/null and b/audio/70.wav differ
diff --git a/audio/71.wav b/audio/71.wav
new file mode 100644
index 0000000..8e3b7b3
Binary files /dev/null and b/audio/71.wav differ
diff --git a/audio/72.wav b/audio/72.wav
new file mode 100644
index 0000000..768264b
Binary files /dev/null and b/audio/72.wav differ
diff --git a/audio/73.wav b/audio/73.wav
new file mode 100644
index 0000000..1b75c6b
Binary files /dev/null and b/audio/73.wav differ
diff --git a/audio/74.wav b/audio/74.wav
new file mode 100644
index 0000000..6fb44de
Binary files /dev/null and b/audio/74.wav differ
diff --git a/audio/75.wav b/audio/75.wav
new file mode 100644
index 0000000..9f5dd55
Binary files /dev/null and b/audio/75.wav differ
diff --git a/audio/76.wav b/audio/76.wav
new file mode 100644
index 0000000..202f404
Binary files /dev/null and b/audio/76.wav differ
diff --git a/audio/77.wav b/audio/77.wav
new file mode 100644
index 0000000..1da15c9
Binary files /dev/null and b/audio/77.wav differ
diff --git a/audio/78.wav b/audio/78.wav
new file mode 100644
index 0000000..f8cc5f3
Binary files /dev/null and b/audio/78.wav differ
diff --git a/audio/79.wav b/audio/79.wav
new file mode 100644
index 0000000..fe452ef
Binary files /dev/null and b/audio/79.wav differ
diff --git a/audio/8.wav b/audio/8.wav
new file mode 100644
index 0000000..ba31fd5
Binary files /dev/null and b/audio/8.wav differ
diff --git a/audio/80.wav b/audio/80.wav
new file mode 100644
index 0000000..d91b2ab
Binary files /dev/null and b/audio/80.wav differ
diff --git a/audio/81.wav b/audio/81.wav
new file mode 100644
index 0000000..7d9aebb
Binary files /dev/null and b/audio/81.wav differ
diff --git a/audio/82.wav b/audio/82.wav
new file mode 100644
index 0000000..c17c8e8
Binary files /dev/null and b/audio/82.wav differ
diff --git a/audio/83.wav b/audio/83.wav
new file mode 100644
index 0000000..c6b5d19
Binary files /dev/null and b/audio/83.wav differ
diff --git a/audio/84.wav b/audio/84.wav
new file mode 100644
index 0000000..f003014
Binary files /dev/null and b/audio/84.wav differ
diff --git a/audio/85.wav b/audio/85.wav
new file mode 100644
index 0000000..242c763
Binary files /dev/null and b/audio/85.wav differ
diff --git a/audio/86.wav b/audio/86.wav
new file mode 100644
index 0000000..f0e8981
Binary files /dev/null and b/audio/86.wav differ
diff --git a/audio/87.wav b/audio/87.wav
new file mode 100644
index 0000000..351aa52
Binary files /dev/null and b/audio/87.wav differ
diff --git a/audio/88.wav b/audio/88.wav
new file mode 100644
index 0000000..e330401
Binary files /dev/null and b/audio/88.wav differ
diff --git a/audio/89.wav b/audio/89.wav
new file mode 100644
index 0000000..e27a250
Binary files /dev/null and b/audio/89.wav differ
diff --git a/audio/9.wav b/audio/9.wav
new file mode 100644
index 0000000..ed4b198
Binary files /dev/null and b/audio/9.wav differ
diff --git a/audio/90.wav b/audio/90.wav
new file mode 100644
index 0000000..18094e6
Binary files /dev/null and b/audio/90.wav differ
diff --git a/audio/91.wav b/audio/91.wav
new file mode 100644
index 0000000..a9e3119
Binary files /dev/null and b/audio/91.wav differ
diff --git a/audio/92.wav b/audio/92.wav
new file mode 100644
index 0000000..0f444b2
Binary files /dev/null and b/audio/92.wav differ
diff --git a/audio/93.wav b/audio/93.wav
new file mode 100644
index 0000000..86e1adc
Binary files /dev/null and b/audio/93.wav differ
diff --git a/audio/94.wav b/audio/94.wav
new file mode 100644
index 0000000..67bfa97
Binary files /dev/null and b/audio/94.wav differ
diff --git a/audio/95.wav b/audio/95.wav
new file mode 100644
index 0000000..572c76a
Binary files /dev/null and b/audio/95.wav differ
diff --git a/audio/96.wav b/audio/96.wav
new file mode 100644
index 0000000..6c1c7a3
Binary files /dev/null and b/audio/96.wav differ
diff --git a/audio/97.wav b/audio/97.wav
new file mode 100644
index 0000000..5ba673a
Binary files /dev/null and b/audio/97.wav differ
diff --git a/audio/98.wav b/audio/98.wav
new file mode 100644
index 0000000..7404a9b
Binary files /dev/null and b/audio/98.wav differ
diff --git a/audio/99.wav b/audio/99.wav
new file mode 100644
index 0000000..36d5fd1
Binary files /dev/null and b/audio/99.wav differ
diff --git a/demo.html b/demo.html
new file mode 100644
index 0000000..62aac79
--- /dev/null
+++ b/demo.html
@@ -0,0 +1,1744 @@
+
+
+
+
+
+
+
+ Consistency TTA Demo Page
+
+
+
+
+
Demo Page
+
Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
+ Yatong Bai,
+ Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi
+
This demonstration page presents the generations from 50 randomly selected prompts from the AudioCaps test set.
+
We present four audio sources: the consistency model fine-tuned with CLAP,
+ the consistency model without CLAP-fine-tuning, the diffusion baseline model, and the ground truth.
+
The diffusion baseline queries the neural network 400 times per audio clip,
+ while the consistency models query a same-sized network only one time.
+
+
+
Prompt 0
+
Whistling followed by a child giggling and then Moe whistling.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 1
+
Some clanking and banging and a man speaking.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 2
+
A man speaking on a microphone as a crowd of people laugh followed by dinner plates clacking.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 3
+
Steam hissing followed by a train whistle blowing and a group of people talking in the background.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 4
+
A vehicle revving and accelerating as tires skid and squeak on a road.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 5
+
Steam escapes with a hissing noise.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 6
+
A man speaking continuously.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 7
+
Knocking sounds as race cars pass by.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 8
+
A man talking followed by plastic clacking then a power tool drilling.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 9
+
Humming of an engine with a woman speaking over a loudspeaker.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 10
+
A telephone ringing with loud echo.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 11
+
Released air hissing followed by a popping explosion then a metal ding persists as a person is laughing and a man is talking..
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 12
+
Constant hissing with mean having conversation.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 13
+
A missile launching followed by an explosion and metal screeching as a motor hums in the background..
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 14
+
An adult female speaks as a cat meows three times, and an electronic device plays in the background.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 15
+
Food and oil sizzling.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 16
+
Some light tapping on a computer keyboard and a baby crying.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 17
+
An electronic beep followed by a man talking.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 18
+
Sanding and filing then a man speaks.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 19
+
An aircraft engine humming followed by plastic clanking then an aircraft engine slowing down.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 20
+
Footsteps and scuffing occur, after which a door grinds, squeaks and clicks, an adult male speaks, and the door grinds, squeaks and clicks shut with a thump.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 21
+
A train horn blowing multiple times as a train runs on railroad tracks while a man and a young kid talk in the background alongside birds cooing in the distance.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 22
+
Strong gusts of wind are followed by cheers and shouts from several people plus the chatter of girl.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 23
+
Compressed air and steam releasing with a man faintly talking in the background.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 24
+
A man talking followed by a goat baaing then a metal gate sliding shut as ducks quack and wind blows into a microphone.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 25
+
A cat is meowing.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 26
+
A toilet is flushing followed by a cat meowing.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 27
+
A person speaks with distant humming and nearby clinking.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 28
+
A dog whimpering followed by laughing and barking..
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 29
+
A vehicle driving by with tires briefly skidding and accelerating then slowing down.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 30
+
A horn and then an engine revving.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 31
+
Several people cheer and scream and speak as water flows hard.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 32
+
A person whistles to music.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 33
+
Laughing and speech in a slowed speed.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 34
+
A man speaking as insects are buzzing and wind is blowing into a microphone.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 35
+
Wind followed by splashing of water.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 36
+
A person whistling.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 37
+
Wood being scraped along with mechanical sounds.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 38
+
A woman speeches.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 39
+
A cat is meowing in a quiet environment.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 40
+
Wind blowing and a siren rings.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 41
+
Static and beeping.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 42
+
Musical whistling with wind blowing.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 43
+
An idle motorbike engine running.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 44
+
A jackhammer drilling and vibrating continuously.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 45
+
A train is passing by and sound its whistle..
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 46
+
A motorboat engine running as water splashes and a man shouts followed by birds chirping in the background.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 47
+
A high frequency motor hums loudly and splashes water.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 48
+
A series of sharp, squeaky snoring noises.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
Prompt 49
+
A bus horn honking as wind is blowing into a microphone before a bus drives by.
+
+
+
Consistency model
+
+
+
+
Consistency model + CLAP-FT
+
+
+
+
Diffusion baseline (TANGO)
+
+
+
+
Ground truth
+
+
+
+
+
+
+
+
+
+
diff --git a/demo_audio/0_Consistency model + CLAP-FT.wav b/demo_audio/0_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..5b6ec55
Binary files /dev/null and b/demo_audio/0_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/0_Consistency model.wav b/demo_audio/0_Consistency model.wav
new file mode 100644
index 0000000..85b15f6
Binary files /dev/null and b/demo_audio/0_Consistency model.wav differ
diff --git a/demo_audio/0_Diffusion baseline (TANGO).wav b/demo_audio/0_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..093f871
Binary files /dev/null and b/demo_audio/0_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/0_Ground truth.wav b/demo_audio/0_Ground truth.wav
new file mode 100644
index 0000000..efc760b
Binary files /dev/null and b/demo_audio/0_Ground truth.wav differ
diff --git a/demo_audio/10_Consistency model + CLAP-FT.wav b/demo_audio/10_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..692d7d3
Binary files /dev/null and b/demo_audio/10_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/10_Consistency model.wav b/demo_audio/10_Consistency model.wav
new file mode 100644
index 0000000..679e0ef
Binary files /dev/null and b/demo_audio/10_Consistency model.wav differ
diff --git a/demo_audio/10_Diffusion baseline (TANGO).wav b/demo_audio/10_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..1b3bc4a
Binary files /dev/null and b/demo_audio/10_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/10_Ground truth.wav b/demo_audio/10_Ground truth.wav
new file mode 100644
index 0000000..51f88ed
Binary files /dev/null and b/demo_audio/10_Ground truth.wav differ
diff --git a/demo_audio/11_Consistency model + CLAP-FT.wav b/demo_audio/11_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..12c0d38
Binary files /dev/null and b/demo_audio/11_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/11_Consistency model.wav b/demo_audio/11_Consistency model.wav
new file mode 100644
index 0000000..b687669
Binary files /dev/null and b/demo_audio/11_Consistency model.wav differ
diff --git a/demo_audio/11_Diffusion baseline (TANGO).wav b/demo_audio/11_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..0c1ffa0
Binary files /dev/null and b/demo_audio/11_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/11_Ground truth.wav b/demo_audio/11_Ground truth.wav
new file mode 100644
index 0000000..415eb8c
Binary files /dev/null and b/demo_audio/11_Ground truth.wav differ
diff --git a/demo_audio/12_Consistency model + CLAP-FT.wav b/demo_audio/12_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..3fdc9a3
Binary files /dev/null and b/demo_audio/12_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/12_Consistency model.wav b/demo_audio/12_Consistency model.wav
new file mode 100644
index 0000000..9cf8250
Binary files /dev/null and b/demo_audio/12_Consistency model.wav differ
diff --git a/demo_audio/12_Diffusion baseline (TANGO).wav b/demo_audio/12_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..0443c76
Binary files /dev/null and b/demo_audio/12_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/12_Ground truth.wav b/demo_audio/12_Ground truth.wav
new file mode 100644
index 0000000..691c7c5
Binary files /dev/null and b/demo_audio/12_Ground truth.wav differ
diff --git a/demo_audio/13_Consistency model + CLAP-FT.wav b/demo_audio/13_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..f41f10f
Binary files /dev/null and b/demo_audio/13_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/13_Consistency model.wav b/demo_audio/13_Consistency model.wav
new file mode 100644
index 0000000..1c58a1d
Binary files /dev/null and b/demo_audio/13_Consistency model.wav differ
diff --git a/demo_audio/13_Diffusion baseline (TANGO).wav b/demo_audio/13_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..6e988ce
Binary files /dev/null and b/demo_audio/13_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/13_Ground truth.wav b/demo_audio/13_Ground truth.wav
new file mode 100644
index 0000000..926aa8a
Binary files /dev/null and b/demo_audio/13_Ground truth.wav differ
diff --git a/demo_audio/14_Consistency model + CLAP-FT.wav b/demo_audio/14_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..4789070
Binary files /dev/null and b/demo_audio/14_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/14_Consistency model.wav b/demo_audio/14_Consistency model.wav
new file mode 100644
index 0000000..75e939b
Binary files /dev/null and b/demo_audio/14_Consistency model.wav differ
diff --git a/demo_audio/14_Diffusion baseline (TANGO).wav b/demo_audio/14_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..a6959da
Binary files /dev/null and b/demo_audio/14_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/14_Ground truth.wav b/demo_audio/14_Ground truth.wav
new file mode 100644
index 0000000..189bbdc
Binary files /dev/null and b/demo_audio/14_Ground truth.wav differ
diff --git a/demo_audio/15_Consistency model + CLAP-FT.wav b/demo_audio/15_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..4645df7
Binary files /dev/null and b/demo_audio/15_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/15_Consistency model.wav b/demo_audio/15_Consistency model.wav
new file mode 100644
index 0000000..1672b5d
Binary files /dev/null and b/demo_audio/15_Consistency model.wav differ
diff --git a/demo_audio/15_Diffusion baseline (TANGO).wav b/demo_audio/15_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..739c03d
Binary files /dev/null and b/demo_audio/15_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/15_Ground truth.wav b/demo_audio/15_Ground truth.wav
new file mode 100644
index 0000000..e72510d
Binary files /dev/null and b/demo_audio/15_Ground truth.wav differ
diff --git a/demo_audio/16_Consistency model + CLAP-FT.wav b/demo_audio/16_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..bc5487b
Binary files /dev/null and b/demo_audio/16_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/16_Consistency model.wav b/demo_audio/16_Consistency model.wav
new file mode 100644
index 0000000..3449fb3
Binary files /dev/null and b/demo_audio/16_Consistency model.wav differ
diff --git a/demo_audio/16_Diffusion baseline (TANGO).wav b/demo_audio/16_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..00e8d5d
Binary files /dev/null and b/demo_audio/16_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/16_Ground truth.wav b/demo_audio/16_Ground truth.wav
new file mode 100644
index 0000000..7246b9e
Binary files /dev/null and b/demo_audio/16_Ground truth.wav differ
diff --git a/demo_audio/17_Consistency model + CLAP-FT.wav b/demo_audio/17_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..b18afbc
Binary files /dev/null and b/demo_audio/17_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/17_Consistency model.wav b/demo_audio/17_Consistency model.wav
new file mode 100644
index 0000000..3a3159c
Binary files /dev/null and b/demo_audio/17_Consistency model.wav differ
diff --git a/demo_audio/17_Diffusion baseline (TANGO).wav b/demo_audio/17_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..4012f08
Binary files /dev/null and b/demo_audio/17_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/17_Ground truth.wav b/demo_audio/17_Ground truth.wav
new file mode 100644
index 0000000..8c61d62
Binary files /dev/null and b/demo_audio/17_Ground truth.wav differ
diff --git a/demo_audio/18_Consistency model + CLAP-FT.wav b/demo_audio/18_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..9891155
Binary files /dev/null and b/demo_audio/18_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/18_Consistency model.wav b/demo_audio/18_Consistency model.wav
new file mode 100644
index 0000000..be5ce8e
Binary files /dev/null and b/demo_audio/18_Consistency model.wav differ
diff --git a/demo_audio/18_Diffusion baseline (TANGO).wav b/demo_audio/18_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..e591866
Binary files /dev/null and b/demo_audio/18_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/18_Ground truth.wav b/demo_audio/18_Ground truth.wav
new file mode 100644
index 0000000..40b28e7
Binary files /dev/null and b/demo_audio/18_Ground truth.wav differ
diff --git a/demo_audio/19_Consistency model + CLAP-FT.wav b/demo_audio/19_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..55870d9
Binary files /dev/null and b/demo_audio/19_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/19_Consistency model.wav b/demo_audio/19_Consistency model.wav
new file mode 100644
index 0000000..8eba916
Binary files /dev/null and b/demo_audio/19_Consistency model.wav differ
diff --git a/demo_audio/19_Diffusion baseline (TANGO).wav b/demo_audio/19_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..79db6e5
Binary files /dev/null and b/demo_audio/19_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/19_Ground truth.wav b/demo_audio/19_Ground truth.wav
new file mode 100644
index 0000000..7b3574b
Binary files /dev/null and b/demo_audio/19_Ground truth.wav differ
diff --git a/demo_audio/1_Consistency model + CLAP-FT.wav b/demo_audio/1_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..daf736d
Binary files /dev/null and b/demo_audio/1_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/1_Consistency model.wav b/demo_audio/1_Consistency model.wav
new file mode 100644
index 0000000..7eb8f33
Binary files /dev/null and b/demo_audio/1_Consistency model.wav differ
diff --git a/demo_audio/1_Diffusion baseline (TANGO).wav b/demo_audio/1_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..5687eb4
Binary files /dev/null and b/demo_audio/1_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/1_Ground truth.wav b/demo_audio/1_Ground truth.wav
new file mode 100644
index 0000000..2625f4f
Binary files /dev/null and b/demo_audio/1_Ground truth.wav differ
diff --git a/demo_audio/20_Consistency model + CLAP-FT.wav b/demo_audio/20_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..14567be
Binary files /dev/null and b/demo_audio/20_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/20_Consistency model.wav b/demo_audio/20_Consistency model.wav
new file mode 100644
index 0000000..38d8c09
Binary files /dev/null and b/demo_audio/20_Consistency model.wav differ
diff --git a/demo_audio/20_Diffusion baseline (TANGO).wav b/demo_audio/20_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..ed01e51
Binary files /dev/null and b/demo_audio/20_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/20_Ground truth.wav b/demo_audio/20_Ground truth.wav
new file mode 100644
index 0000000..66d61bf
Binary files /dev/null and b/demo_audio/20_Ground truth.wav differ
diff --git a/demo_audio/21_Consistency model + CLAP-FT.wav b/demo_audio/21_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..939ff62
Binary files /dev/null and b/demo_audio/21_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/21_Consistency model.wav b/demo_audio/21_Consistency model.wav
new file mode 100644
index 0000000..5e123cb
Binary files /dev/null and b/demo_audio/21_Consistency model.wav differ
diff --git a/demo_audio/21_Diffusion baseline (TANGO).wav b/demo_audio/21_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..3e42734
Binary files /dev/null and b/demo_audio/21_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/21_Ground truth.wav b/demo_audio/21_Ground truth.wav
new file mode 100644
index 0000000..694b19f
Binary files /dev/null and b/demo_audio/21_Ground truth.wav differ
diff --git a/demo_audio/22_Consistency model + CLAP-FT.wav b/demo_audio/22_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..87fb687
Binary files /dev/null and b/demo_audio/22_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/22_Consistency model.wav b/demo_audio/22_Consistency model.wav
new file mode 100644
index 0000000..8b96845
Binary files /dev/null and b/demo_audio/22_Consistency model.wav differ
diff --git a/demo_audio/22_Diffusion baseline (TANGO).wav b/demo_audio/22_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..8c307b9
Binary files /dev/null and b/demo_audio/22_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/22_Ground truth.wav b/demo_audio/22_Ground truth.wav
new file mode 100644
index 0000000..1a4ef41
Binary files /dev/null and b/demo_audio/22_Ground truth.wav differ
diff --git a/demo_audio/23_Consistency model + CLAP-FT.wav b/demo_audio/23_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..b21bc3e
Binary files /dev/null and b/demo_audio/23_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/23_Consistency model.wav b/demo_audio/23_Consistency model.wav
new file mode 100644
index 0000000..50a342b
Binary files /dev/null and b/demo_audio/23_Consistency model.wav differ
diff --git a/demo_audio/23_Diffusion baseline (TANGO).wav b/demo_audio/23_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..64cecfb
Binary files /dev/null and b/demo_audio/23_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/23_Ground truth.wav b/demo_audio/23_Ground truth.wav
new file mode 100644
index 0000000..91d6ac8
Binary files /dev/null and b/demo_audio/23_Ground truth.wav differ
diff --git a/demo_audio/24_Consistency model + CLAP-FT.wav b/demo_audio/24_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..bfabbf3
Binary files /dev/null and b/demo_audio/24_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/24_Consistency model.wav b/demo_audio/24_Consistency model.wav
new file mode 100644
index 0000000..edc7d3a
Binary files /dev/null and b/demo_audio/24_Consistency model.wav differ
diff --git a/demo_audio/24_Diffusion baseline (TANGO).wav b/demo_audio/24_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..ede7a8d
Binary files /dev/null and b/demo_audio/24_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/24_Ground truth.wav b/demo_audio/24_Ground truth.wav
new file mode 100644
index 0000000..bed1809
Binary files /dev/null and b/demo_audio/24_Ground truth.wav differ
diff --git a/demo_audio/25_Consistency model + CLAP-FT.wav b/demo_audio/25_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..6c2e533
Binary files /dev/null and b/demo_audio/25_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/25_Consistency model.wav b/demo_audio/25_Consistency model.wav
new file mode 100644
index 0000000..02f30b1
Binary files /dev/null and b/demo_audio/25_Consistency model.wav differ
diff --git a/demo_audio/25_Diffusion baseline (TANGO).wav b/demo_audio/25_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..08c2ab5
Binary files /dev/null and b/demo_audio/25_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/25_Ground truth.wav b/demo_audio/25_Ground truth.wav
new file mode 100644
index 0000000..fe2e0cd
Binary files /dev/null and b/demo_audio/25_Ground truth.wav differ
diff --git a/demo_audio/26_Consistency model + CLAP-FT.wav b/demo_audio/26_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..5474dae
Binary files /dev/null and b/demo_audio/26_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/26_Consistency model.wav b/demo_audio/26_Consistency model.wav
new file mode 100644
index 0000000..c558701
Binary files /dev/null and b/demo_audio/26_Consistency model.wav differ
diff --git a/demo_audio/26_Diffusion baseline (TANGO).wav b/demo_audio/26_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..d91d23d
Binary files /dev/null and b/demo_audio/26_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/26_Ground truth.wav b/demo_audio/26_Ground truth.wav
new file mode 100644
index 0000000..8837931
Binary files /dev/null and b/demo_audio/26_Ground truth.wav differ
diff --git a/demo_audio/27_Consistency model + CLAP-FT.wav b/demo_audio/27_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..4cb027c
Binary files /dev/null and b/demo_audio/27_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/27_Consistency model.wav b/demo_audio/27_Consistency model.wav
new file mode 100644
index 0000000..ea85ac2
Binary files /dev/null and b/demo_audio/27_Consistency model.wav differ
diff --git a/demo_audio/27_Diffusion baseline (TANGO).wav b/demo_audio/27_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..472f8d8
Binary files /dev/null and b/demo_audio/27_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/27_Ground truth.wav b/demo_audio/27_Ground truth.wav
new file mode 100644
index 0000000..69a9cfd
Binary files /dev/null and b/demo_audio/27_Ground truth.wav differ
diff --git a/demo_audio/28_Consistency model + CLAP-FT.wav b/demo_audio/28_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..1b145f3
Binary files /dev/null and b/demo_audio/28_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/28_Consistency model.wav b/demo_audio/28_Consistency model.wav
new file mode 100644
index 0000000..2d98419
Binary files /dev/null and b/demo_audio/28_Consistency model.wav differ
diff --git a/demo_audio/28_Diffusion baseline (TANGO).wav b/demo_audio/28_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..e5c8508
Binary files /dev/null and b/demo_audio/28_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/28_Ground truth.wav b/demo_audio/28_Ground truth.wav
new file mode 100644
index 0000000..332800b
Binary files /dev/null and b/demo_audio/28_Ground truth.wav differ
diff --git a/demo_audio/29_Consistency model + CLAP-FT.wav b/demo_audio/29_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..22c6296
Binary files /dev/null and b/demo_audio/29_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/29_Consistency model.wav b/demo_audio/29_Consistency model.wav
new file mode 100644
index 0000000..981162d
Binary files /dev/null and b/demo_audio/29_Consistency model.wav differ
diff --git a/demo_audio/29_Diffusion baseline (TANGO).wav b/demo_audio/29_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..1e88a1c
Binary files /dev/null and b/demo_audio/29_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/29_Ground truth.wav b/demo_audio/29_Ground truth.wav
new file mode 100644
index 0000000..a1acf34
Binary files /dev/null and b/demo_audio/29_Ground truth.wav differ
diff --git a/demo_audio/2_Consistency model + CLAP-FT.wav b/demo_audio/2_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..a1172e8
Binary files /dev/null and b/demo_audio/2_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/2_Consistency model.wav b/demo_audio/2_Consistency model.wav
new file mode 100644
index 0000000..6891824
Binary files /dev/null and b/demo_audio/2_Consistency model.wav differ
diff --git a/demo_audio/2_Diffusion baseline (TANGO).wav b/demo_audio/2_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..1fd2cb4
Binary files /dev/null and b/demo_audio/2_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/2_Ground truth.wav b/demo_audio/2_Ground truth.wav
new file mode 100644
index 0000000..c0172f0
Binary files /dev/null and b/demo_audio/2_Ground truth.wav differ
diff --git a/demo_audio/30_Consistency model + CLAP-FT.wav b/demo_audio/30_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..8aa3060
Binary files /dev/null and b/demo_audio/30_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/30_Consistency model.wav b/demo_audio/30_Consistency model.wav
new file mode 100644
index 0000000..7644369
Binary files /dev/null and b/demo_audio/30_Consistency model.wav differ
diff --git a/demo_audio/30_Diffusion baseline (TANGO).wav b/demo_audio/30_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..6f75742
Binary files /dev/null and b/demo_audio/30_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/30_Ground truth.wav b/demo_audio/30_Ground truth.wav
new file mode 100644
index 0000000..0a16513
Binary files /dev/null and b/demo_audio/30_Ground truth.wav differ
diff --git a/demo_audio/31_Consistency model + CLAP-FT.wav b/demo_audio/31_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..c56e09b
Binary files /dev/null and b/demo_audio/31_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/31_Consistency model.wav b/demo_audio/31_Consistency model.wav
new file mode 100644
index 0000000..7fec5f1
Binary files /dev/null and b/demo_audio/31_Consistency model.wav differ
diff --git a/demo_audio/31_Diffusion baseline (TANGO).wav b/demo_audio/31_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..75028b1
Binary files /dev/null and b/demo_audio/31_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/31_Ground truth.wav b/demo_audio/31_Ground truth.wav
new file mode 100644
index 0000000..6b0fd1d
Binary files /dev/null and b/demo_audio/31_Ground truth.wav differ
diff --git a/demo_audio/32_Consistency model + CLAP-FT.wav b/demo_audio/32_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..cd2b719
Binary files /dev/null and b/demo_audio/32_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/32_Consistency model.wav b/demo_audio/32_Consistency model.wav
new file mode 100644
index 0000000..ae4319f
Binary files /dev/null and b/demo_audio/32_Consistency model.wav differ
diff --git a/demo_audio/32_Diffusion baseline (TANGO).wav b/demo_audio/32_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..9b43327
Binary files /dev/null and b/demo_audio/32_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/32_Ground truth.wav b/demo_audio/32_Ground truth.wav
new file mode 100644
index 0000000..5bc34c2
Binary files /dev/null and b/demo_audio/32_Ground truth.wav differ
diff --git a/demo_audio/33_Consistency model + CLAP-FT.wav b/demo_audio/33_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..7f90214
Binary files /dev/null and b/demo_audio/33_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/33_Consistency model.wav b/demo_audio/33_Consistency model.wav
new file mode 100644
index 0000000..847f935
Binary files /dev/null and b/demo_audio/33_Consistency model.wav differ
diff --git a/demo_audio/33_Diffusion baseline (TANGO).wav b/demo_audio/33_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..d697499
Binary files /dev/null and b/demo_audio/33_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/33_Ground truth.wav b/demo_audio/33_Ground truth.wav
new file mode 100644
index 0000000..fe7fda2
Binary files /dev/null and b/demo_audio/33_Ground truth.wav differ
diff --git a/demo_audio/34_Consistency model + CLAP-FT.wav b/demo_audio/34_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..5f93d33
Binary files /dev/null and b/demo_audio/34_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/34_Consistency model.wav b/demo_audio/34_Consistency model.wav
new file mode 100644
index 0000000..13a78d9
Binary files /dev/null and b/demo_audio/34_Consistency model.wav differ
diff --git a/demo_audio/34_Diffusion baseline (TANGO).wav b/demo_audio/34_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..9d55191
Binary files /dev/null and b/demo_audio/34_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/34_Ground truth.wav b/demo_audio/34_Ground truth.wav
new file mode 100644
index 0000000..a9cf350
Binary files /dev/null and b/demo_audio/34_Ground truth.wav differ
diff --git a/demo_audio/35_Consistency model + CLAP-FT.wav b/demo_audio/35_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..8ed7a6e
Binary files /dev/null and b/demo_audio/35_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/35_Consistency model.wav b/demo_audio/35_Consistency model.wav
new file mode 100644
index 0000000..c490981
Binary files /dev/null and b/demo_audio/35_Consistency model.wav differ
diff --git a/demo_audio/35_Diffusion baseline (TANGO).wav b/demo_audio/35_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..d3d73e0
Binary files /dev/null and b/demo_audio/35_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/35_Ground truth.wav b/demo_audio/35_Ground truth.wav
new file mode 100644
index 0000000..bf40ed4
Binary files /dev/null and b/demo_audio/35_Ground truth.wav differ
diff --git a/demo_audio/36_Consistency model + CLAP-FT.wav b/demo_audio/36_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..72817a2
Binary files /dev/null and b/demo_audio/36_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/36_Consistency model.wav b/demo_audio/36_Consistency model.wav
new file mode 100644
index 0000000..1a86295
Binary files /dev/null and b/demo_audio/36_Consistency model.wav differ
diff --git a/demo_audio/36_Diffusion baseline (TANGO).wav b/demo_audio/36_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..e90063c
Binary files /dev/null and b/demo_audio/36_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/36_Ground truth.wav b/demo_audio/36_Ground truth.wav
new file mode 100644
index 0000000..6db4101
Binary files /dev/null and b/demo_audio/36_Ground truth.wav differ
diff --git a/demo_audio/37_Consistency model + CLAP-FT.wav b/demo_audio/37_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..242ded1
Binary files /dev/null and b/demo_audio/37_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/37_Consistency model.wav b/demo_audio/37_Consistency model.wav
new file mode 100644
index 0000000..c4117f2
Binary files /dev/null and b/demo_audio/37_Consistency model.wav differ
diff --git a/demo_audio/37_Diffusion baseline (TANGO).wav b/demo_audio/37_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..985617b
Binary files /dev/null and b/demo_audio/37_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/37_Ground truth.wav b/demo_audio/37_Ground truth.wav
new file mode 100644
index 0000000..3d7527d
Binary files /dev/null and b/demo_audio/37_Ground truth.wav differ
diff --git a/demo_audio/38_Consistency model + CLAP-FT.wav b/demo_audio/38_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..0321d13
Binary files /dev/null and b/demo_audio/38_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/38_Consistency model.wav b/demo_audio/38_Consistency model.wav
new file mode 100644
index 0000000..40a4600
Binary files /dev/null and b/demo_audio/38_Consistency model.wav differ
diff --git a/demo_audio/38_Diffusion baseline (TANGO).wav b/demo_audio/38_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..1450d06
Binary files /dev/null and b/demo_audio/38_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/38_Ground truth.wav b/demo_audio/38_Ground truth.wav
new file mode 100644
index 0000000..1846f8b
Binary files /dev/null and b/demo_audio/38_Ground truth.wav differ
diff --git a/demo_audio/39_Consistency model + CLAP-FT.wav b/demo_audio/39_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..4b21e3e
Binary files /dev/null and b/demo_audio/39_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/39_Consistency model.wav b/demo_audio/39_Consistency model.wav
new file mode 100644
index 0000000..16eb4ed
Binary files /dev/null and b/demo_audio/39_Consistency model.wav differ
diff --git a/demo_audio/39_Diffusion baseline (TANGO).wav b/demo_audio/39_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..b0a2ed2
Binary files /dev/null and b/demo_audio/39_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/39_Ground truth.wav b/demo_audio/39_Ground truth.wav
new file mode 100644
index 0000000..36857e2
Binary files /dev/null and b/demo_audio/39_Ground truth.wav differ
diff --git a/demo_audio/3_Consistency model + CLAP-FT.wav b/demo_audio/3_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..19c250e
Binary files /dev/null and b/demo_audio/3_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/3_Consistency model.wav b/demo_audio/3_Consistency model.wav
new file mode 100644
index 0000000..09fd487
Binary files /dev/null and b/demo_audio/3_Consistency model.wav differ
diff --git a/demo_audio/3_Diffusion baseline (TANGO).wav b/demo_audio/3_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..81ada3d
Binary files /dev/null and b/demo_audio/3_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/3_Ground truth.wav b/demo_audio/3_Ground truth.wav
new file mode 100644
index 0000000..688da5c
Binary files /dev/null and b/demo_audio/3_Ground truth.wav differ
diff --git a/demo_audio/40_Consistency model + CLAP-FT.wav b/demo_audio/40_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..9088b87
Binary files /dev/null and b/demo_audio/40_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/40_Consistency model.wav b/demo_audio/40_Consistency model.wav
new file mode 100644
index 0000000..f742c56
Binary files /dev/null and b/demo_audio/40_Consistency model.wav differ
diff --git a/demo_audio/40_Diffusion baseline (TANGO).wav b/demo_audio/40_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..5061f2b
Binary files /dev/null and b/demo_audio/40_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/40_Ground truth.wav b/demo_audio/40_Ground truth.wav
new file mode 100644
index 0000000..d7784cd
Binary files /dev/null and b/demo_audio/40_Ground truth.wav differ
diff --git a/demo_audio/41_Consistency model + CLAP-FT.wav b/demo_audio/41_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..dd6da20
Binary files /dev/null and b/demo_audio/41_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/41_Consistency model.wav b/demo_audio/41_Consistency model.wav
new file mode 100644
index 0000000..36f56c0
Binary files /dev/null and b/demo_audio/41_Consistency model.wav differ
diff --git a/demo_audio/41_Diffusion baseline (TANGO).wav b/demo_audio/41_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..018dc7b
Binary files /dev/null and b/demo_audio/41_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/41_Ground truth.wav b/demo_audio/41_Ground truth.wav
new file mode 100644
index 0000000..639d01c
Binary files /dev/null and b/demo_audio/41_Ground truth.wav differ
diff --git a/demo_audio/42_Consistency model + CLAP-FT.wav b/demo_audio/42_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..712ae0e
Binary files /dev/null and b/demo_audio/42_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/42_Consistency model.wav b/demo_audio/42_Consistency model.wav
new file mode 100644
index 0000000..9476512
Binary files /dev/null and b/demo_audio/42_Consistency model.wav differ
diff --git a/demo_audio/42_Diffusion baseline (TANGO).wav b/demo_audio/42_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..4795072
Binary files /dev/null and b/demo_audio/42_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/42_Ground truth.wav b/demo_audio/42_Ground truth.wav
new file mode 100644
index 0000000..60e397d
Binary files /dev/null and b/demo_audio/42_Ground truth.wav differ
diff --git a/demo_audio/43_Consistency model + CLAP-FT.wav b/demo_audio/43_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..ccfd597
Binary files /dev/null and b/demo_audio/43_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/43_Consistency model.wav b/demo_audio/43_Consistency model.wav
new file mode 100644
index 0000000..52df071
Binary files /dev/null and b/demo_audio/43_Consistency model.wav differ
diff --git a/demo_audio/43_Diffusion baseline (TANGO).wav b/demo_audio/43_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..645d425
Binary files /dev/null and b/demo_audio/43_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/43_Ground truth.wav b/demo_audio/43_Ground truth.wav
new file mode 100644
index 0000000..325e92b
Binary files /dev/null and b/demo_audio/43_Ground truth.wav differ
diff --git a/demo_audio/44_Consistency model + CLAP-FT.wav b/demo_audio/44_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..f8971ce
Binary files /dev/null and b/demo_audio/44_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/44_Consistency model.wav b/demo_audio/44_Consistency model.wav
new file mode 100644
index 0000000..6252ff8
Binary files /dev/null and b/demo_audio/44_Consistency model.wav differ
diff --git a/demo_audio/44_Diffusion baseline (TANGO).wav b/demo_audio/44_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..9e3a21b
Binary files /dev/null and b/demo_audio/44_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/44_Ground truth.wav b/demo_audio/44_Ground truth.wav
new file mode 100644
index 0000000..cb00543
Binary files /dev/null and b/demo_audio/44_Ground truth.wav differ
diff --git a/demo_audio/45_Consistency model + CLAP-FT.wav b/demo_audio/45_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..87ca2f6
Binary files /dev/null and b/demo_audio/45_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/45_Consistency model.wav b/demo_audio/45_Consistency model.wav
new file mode 100644
index 0000000..4755be9
Binary files /dev/null and b/demo_audio/45_Consistency model.wav differ
diff --git a/demo_audio/45_Diffusion baseline (TANGO).wav b/demo_audio/45_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..8f9ad7d
Binary files /dev/null and b/demo_audio/45_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/45_Ground truth.wav b/demo_audio/45_Ground truth.wav
new file mode 100644
index 0000000..c53fecd
Binary files /dev/null and b/demo_audio/45_Ground truth.wav differ
diff --git a/demo_audio/46_Consistency model + CLAP-FT.wav b/demo_audio/46_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..f7f56ae
Binary files /dev/null and b/demo_audio/46_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/46_Consistency model.wav b/demo_audio/46_Consistency model.wav
new file mode 100644
index 0000000..a1f847c
Binary files /dev/null and b/demo_audio/46_Consistency model.wav differ
diff --git a/demo_audio/46_Diffusion baseline (TANGO).wav b/demo_audio/46_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..bf76cb2
Binary files /dev/null and b/demo_audio/46_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/46_Ground truth.wav b/demo_audio/46_Ground truth.wav
new file mode 100644
index 0000000..6cc01dc
Binary files /dev/null and b/demo_audio/46_Ground truth.wav differ
diff --git a/demo_audio/47_Consistency model + CLAP-FT.wav b/demo_audio/47_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..e4aafa8
Binary files /dev/null and b/demo_audio/47_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/47_Consistency model.wav b/demo_audio/47_Consistency model.wav
new file mode 100644
index 0000000..c439c01
Binary files /dev/null and b/demo_audio/47_Consistency model.wav differ
diff --git a/demo_audio/47_Diffusion baseline (TANGO).wav b/demo_audio/47_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..dea27a9
Binary files /dev/null and b/demo_audio/47_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/47_Ground truth.wav b/demo_audio/47_Ground truth.wav
new file mode 100644
index 0000000..acdec15
Binary files /dev/null and b/demo_audio/47_Ground truth.wav differ
diff --git a/demo_audio/48_Consistency model + CLAP-FT.wav b/demo_audio/48_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..5104727
Binary files /dev/null and b/demo_audio/48_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/48_Consistency model.wav b/demo_audio/48_Consistency model.wav
new file mode 100644
index 0000000..25612f7
Binary files /dev/null and b/demo_audio/48_Consistency model.wav differ
diff --git a/demo_audio/48_Diffusion baseline (TANGO).wav b/demo_audio/48_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..4809d39
Binary files /dev/null and b/demo_audio/48_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/48_Ground truth.wav b/demo_audio/48_Ground truth.wav
new file mode 100644
index 0000000..5cea701
Binary files /dev/null and b/demo_audio/48_Ground truth.wav differ
diff --git a/demo_audio/49_Consistency model + CLAP-FT.wav b/demo_audio/49_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..1f5e74d
Binary files /dev/null and b/demo_audio/49_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/49_Consistency model.wav b/demo_audio/49_Consistency model.wav
new file mode 100644
index 0000000..b412b61
Binary files /dev/null and b/demo_audio/49_Consistency model.wav differ
diff --git a/demo_audio/49_Diffusion baseline (TANGO).wav b/demo_audio/49_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..2e18032
Binary files /dev/null and b/demo_audio/49_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/49_Ground truth.wav b/demo_audio/49_Ground truth.wav
new file mode 100644
index 0000000..798cdaf
Binary files /dev/null and b/demo_audio/49_Ground truth.wav differ
diff --git a/demo_audio/4_Consistency model + CLAP-FT.wav b/demo_audio/4_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..482c4b9
Binary files /dev/null and b/demo_audio/4_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/4_Consistency model.wav b/demo_audio/4_Consistency model.wav
new file mode 100644
index 0000000..d1d2f25
Binary files /dev/null and b/demo_audio/4_Consistency model.wav differ
diff --git a/demo_audio/4_Diffusion baseline (TANGO).wav b/demo_audio/4_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..1aa6759
Binary files /dev/null and b/demo_audio/4_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/4_Ground truth.wav b/demo_audio/4_Ground truth.wav
new file mode 100644
index 0000000..fff6b62
Binary files /dev/null and b/demo_audio/4_Ground truth.wav differ
diff --git a/demo_audio/5_Consistency model + CLAP-FT.wav b/demo_audio/5_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..57a17cf
Binary files /dev/null and b/demo_audio/5_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/5_Consistency model.wav b/demo_audio/5_Consistency model.wav
new file mode 100644
index 0000000..fbfa3e9
Binary files /dev/null and b/demo_audio/5_Consistency model.wav differ
diff --git a/demo_audio/5_Diffusion baseline (TANGO).wav b/demo_audio/5_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..4840793
Binary files /dev/null and b/demo_audio/5_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/5_Ground truth.wav b/demo_audio/5_Ground truth.wav
new file mode 100644
index 0000000..6a825fb
Binary files /dev/null and b/demo_audio/5_Ground truth.wav differ
diff --git a/demo_audio/6_Consistency model + CLAP-FT.wav b/demo_audio/6_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..590f2bf
Binary files /dev/null and b/demo_audio/6_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/6_Consistency model.wav b/demo_audio/6_Consistency model.wav
new file mode 100644
index 0000000..e8220d9
Binary files /dev/null and b/demo_audio/6_Consistency model.wav differ
diff --git a/demo_audio/6_Diffusion baseline (TANGO).wav b/demo_audio/6_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..30a8cdf
Binary files /dev/null and b/demo_audio/6_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/6_Ground truth.wav b/demo_audio/6_Ground truth.wav
new file mode 100644
index 0000000..bf8e7c9
Binary files /dev/null and b/demo_audio/6_Ground truth.wav differ
diff --git a/demo_audio/7_Consistency model + CLAP-FT.wav b/demo_audio/7_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..316d0fb
Binary files /dev/null and b/demo_audio/7_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/7_Consistency model.wav b/demo_audio/7_Consistency model.wav
new file mode 100644
index 0000000..f73eded
Binary files /dev/null and b/demo_audio/7_Consistency model.wav differ
diff --git a/demo_audio/7_Diffusion baseline (TANGO).wav b/demo_audio/7_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..6a4fea9
Binary files /dev/null and b/demo_audio/7_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/7_Ground truth.wav b/demo_audio/7_Ground truth.wav
new file mode 100644
index 0000000..6c23530
Binary files /dev/null and b/demo_audio/7_Ground truth.wav differ
diff --git a/demo_audio/8_Consistency model + CLAP-FT.wav b/demo_audio/8_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..3f71cbf
Binary files /dev/null and b/demo_audio/8_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/8_Consistency model.wav b/demo_audio/8_Consistency model.wav
new file mode 100644
index 0000000..f045688
Binary files /dev/null and b/demo_audio/8_Consistency model.wav differ
diff --git a/demo_audio/8_Diffusion baseline (TANGO).wav b/demo_audio/8_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..bef9200
Binary files /dev/null and b/demo_audio/8_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/8_Ground truth.wav b/demo_audio/8_Ground truth.wav
new file mode 100644
index 0000000..78aafba
Binary files /dev/null and b/demo_audio/8_Ground truth.wav differ
diff --git a/demo_audio/9_Consistency model + CLAP-FT.wav b/demo_audio/9_Consistency model + CLAP-FT.wav
new file mode 100644
index 0000000..4c7c089
Binary files /dev/null and b/demo_audio/9_Consistency model + CLAP-FT.wav differ
diff --git a/demo_audio/9_Consistency model.wav b/demo_audio/9_Consistency model.wav
new file mode 100644
index 0000000..d8506ec
Binary files /dev/null and b/demo_audio/9_Consistency model.wav differ
diff --git a/demo_audio/9_Diffusion baseline (TANGO).wav b/demo_audio/9_Diffusion baseline (TANGO).wav
new file mode 100644
index 0000000..3f48a1d
Binary files /dev/null and b/demo_audio/9_Diffusion baseline (TANGO).wav differ
diff --git a/demo_audio/9_Ground truth.wav b/demo_audio/9_Ground truth.wav
new file mode 100644
index 0000000..99aea05
Binary files /dev/null and b/demo_audio/9_Ground truth.wav differ
diff --git a/evaluation.html b/evaluation.html
new file mode 100644
index 0000000..0e15eee
--- /dev/null
+++ b/evaluation.html
@@ -0,0 +1,2760 @@
+
+
+
+
+ Consistency TTA Model Human Eval
+
+
+
+
+
+
+
+
Example Human Evaluation Form
+ Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
+
+ Since the generative models were not trained on speech data,
+ they are expected to generate unintelligible speech.
+ Therefore, please DO NOT consider the intelligibility of speech as a part of the criteria
+ (the voice quality can be taken into consideration).
+
+
Criteria for audio-text correspondence
+
+ The quality of each rating is:
+
+ 5 - Excellent.
+ 4 - Temporal mismatch or other slight mismatches.
+ E.g., the prompt says one sound after another, but the audio has them simultaneously.
+ 3 - One of the sound components missing/redundant/incorrect.
+ E.g. the prompt requests four sound components, but the audio only has three or vice versa;
+ the prompt asks for one persor speaking but there are two people in the audio.
+ 2 - Missing/redundant/incorrect more than one components.
+ 1 - Totally incorrect.
+
+
Before starting the rating, clear the browser local storage using the following button.
+
+
+
After completing the ratings, click the following button to download the data into a CSV.
+ There is also a copy of this button at the bottom of the page.
+
+
+
+
Prompt 0
+
Rain and thunder
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 1
+
A loud bang followed by an engine idling loudly
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 2
+
A man speaking while water runs in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 3
+
An electric motor runs then a person speaks
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 4
+
A helicopter engine operating while wind blows heavily into a microphone
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 5
+
A sewing machine sews followed by a man talking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 6
+
A woman talks briefly as several goats bleat including one that has high pitched bleats. A crunch is followed by a man speaking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 7
+
High pressure liquid spraying as a radio plays in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 8
+
Male speech and then scraping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 9
+
Mechanical rotation and then a loud click occurs
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 10
+
A loud bang followed by an engine idling loudly
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 11
+
Humming from a large engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 12
+
A motor vehicle engine is revving
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 13
+
A bus engine driving in the distance then nearby followed by compressed air releasing while a woman and a child talk in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 14
+
A woman speaks, and a motor vehicle revs its engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 15
+
A vehicle accelerating then driving by as gusts of wind blow and leaves rustle in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 16
+
A car engine idling then starts to rev shortly after
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 17
+
Rain and thunder
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 18
+
A man talking followed by a camera muffling and footsteps shuffling then wood lightly clanking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 19
+
An electric motor runs then a person speaks
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 20
+
A helicopter engine operating while wind blows heavily into a microphone
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 21
+
Mechanical rotation and then a loud click occurs
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 22
+
A machine motor running as a man is speaking followed by rapid buzzing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 23
+
A vehicle accelerating then driving by as gusts of wind blow and leaves rustle in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 24
+
Train passing followed by short honk
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 25
+
A woman speaks, and a motor vehicle revs its engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 26
+
Several puppies yapping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 27
+
A person gulping followed by glass tapping then liquid shaking in a container proceeded by liquid pouring before plastic thumps on paper
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 28
+
A nearby insect buzzes with nearby vibrations
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 29
+
A bus engine driving in the distance then nearby followed by compressed air releasing while a woman and a child talk in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 30
+
A bus engine driving in the distance then nearby followed by compressed air releasing while a woman and a child talk in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 31
+
High pressure liquid spraying as a radio plays in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 32
+
A loud bang followed by an engine idling loudly
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 33
+
Mechanical rotation and then a loud click occurs
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 34
+
A motor vehicle engine is revving
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 35
+
A woman speaks, and a motor vehicle revs its engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 36
+
An electric motor runs then a person speaks
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 37
+
A man speaking while water runs in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 38
+
Man talking in the wind and someone yells in the background while an engine makes squealing and air puffing sounds
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 39
+
A person gulping followed by glass tapping then liquid shaking in a container proceeded by liquid pouring before plastic thumps on paper
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 40
+
Male speech and then scraping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 41
+
Mechanical rotation and then a loud click occurs
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 42
+
Several puppies yapping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 43
+
Train passing followed by short honk
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 44
+
An baby laughing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 45
+
Humming from a large engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 46
+
An baby laughing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 47
+
A man speaking while water runs in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 48
+
A man talking followed by a camera muffling and footsteps shuffling then wood lightly clanking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 49
+
A horse gallops then trot on grass as gusts of wind blow and thunderclaps in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 50
+
A sewing machine sews followed by a man talking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 51
+
An baby laughing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 52
+
A horse gallops then trot on grass as gusts of wind blow and thunderclaps in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 53
+
Train passing followed by short honk
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 54
+
A man speaking while water runs in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 55
+
Several puppies yapping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 56
+
Several puppies yapping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 57
+
A person gulping followed by glass tapping then liquid shaking in a container proceeded by liquid pouring before plastic thumps on paper
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 58
+
A woman talks briefly as several goats bleat including one that has high pitched bleats. A crunch is followed by a man speaking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 59
+
Rain and thunder
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 60
+
Humming from a large engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 61
+
A car engine idling then starts to rev shortly after
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 62
+
High pressure liquid spraying as a radio plays in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 63
+
A woman speaks, and a motor vehicle revs its engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 64
+
A nearby insect buzzes with nearby vibrations
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 65
+
Train passing followed by short honk
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 66
+
Rain and thunder
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 67
+
A bus engine driving in the distance then nearby followed by compressed air releasing while a woman and a child talk in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 68
+
Male speech and then scraping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 69
+
An electric motor runs then a person speaks
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 70
+
A machine motor running as a man is speaking followed by rapid buzzing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 71
+
A vehicle accelerating then driving by as gusts of wind blow and leaves rustle in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 72
+
A machine motor running as a man is speaking followed by rapid buzzing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 73
+
A car engine idling then starts to rev shortly after
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 74
+
A helicopter engine operating while wind blows heavily into a microphone
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 75
+
A man talking followed by a camera muffling and footsteps shuffling then wood lightly clanking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 76
+
A vehicle accelerating then driving by as gusts of wind blow and leaves rustle in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 77
+
A motor vehicle engine is revving
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 78
+
High pressure liquid spraying as a radio plays in the background
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 79
+
Man talking in the wind and someone yells in the background while an engine makes squealing and air puffing sounds
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 80
+
A woman talks briefly as several goats bleat including one that has high pitched bleats. A crunch is followed by a man speaking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 81
+
A sewing machine sews followed by a man talking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 82
+
A machine motor running as a man is speaking followed by rapid buzzing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 83
+
A loud bang followed by an engine idling loudly
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 84
+
Man talking in the wind and someone yells in the background while an engine makes squealing and air puffing sounds
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 85
+
Male speech and then scraping
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 86
+
An baby laughing
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 87
+
A nearby insect buzzes with nearby vibrations
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 88
+
A horse gallops then trot on grass as gusts of wind blow and thunderclaps in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 89
+
Humming from a large engine
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 90
+
A nearby insect buzzes with nearby vibrations
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 91
+
A motor vehicle engine is revving
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 92
+
A car engine idling then starts to rev shortly after
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 93
+
A helicopter engine operating while wind blows heavily into a microphone
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 94
+
A horse gallops then trot on grass as gusts of wind blow and thunderclaps in the distance
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 95
+
A man talking followed by a camera muffling and footsteps shuffling then wood lightly clanking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 96
+
A sewing machine sews followed by a man talking
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 97
+
A person gulping followed by glass tapping then liquid shaking in a container proceeded by liquid pouring before plastic thumps on paper
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 98
+
Man talking in the wind and someone yells in the background while an engine makes squealing and air puffing sounds
+
+
+
Rate on overall audio quality.
+
+
+
+
+
+
+
+
+
+
Rate on audio-text correspondence.
+
+
+
+
+
+
+
+
+
+
+
Prompt 99
+
A woman talks briefly as several goats bleat including one that has high pitched bleats. A crunch is followed by a man speaking
+ Diffusion models power a vast majority of the text-to-audio generation methods.
+ Unfortunately, diffusion models suffer from a slow inference speed due to iteratively querying the
+ underlying denoising network, thus unsuitable for applications with time or computational constraints.
+ This work modifies the recently proposed "consistency distillation" framework to train text-to-audio
+ models that only require a single neural network query, accelerating the generation hundreds of times.
+
+
+ By incorporating classifier-free guidance into the distillation framework, our models retain
+ diffusion models' impressive generation quality and diversity. Furthermore, the non-recurrent
+ differentiable structure resulting from the distillation allows fine-tuning with novel loss functions.
+ We use the CLAP loss as an example, confirming that end-to-end fine-tuning
+ further boosts the generation quality.
+
+
+
+
+
BibTeX
+
+
+
@article{bai2023accelerating,
+ author = {Yatong Bai, Trung Dang, Dung Tran, Kazuhito Koishida, Somayeh Sojoudi},
+ title = {Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation},
+ journal={arXiv preprint arXiv:2309.10740},
+ year = {2023}
+}
+
+
+
+
+
Contact
+
+ For any questions regarding our work, please email
+ yatong_bai@berkeley.edu.
+ We are more than happy to help with implenting our method and verifying our results.
+
+
+
+
+
+
+
+
diff --git a/js_script.js b/js_script.js
new file mode 100644
index 0000000..2c7ce81
--- /dev/null
+++ b/js_script.js
@@ -0,0 +1,111 @@
+function saveRating(rating, thing, aspect) {
+ // Construct the key for localStorage
+ let key = thing + '-' + aspect;
+
+ // Save the rating to localStorage
+ localStorage.setItem(key, rating);
+
+ // Reset color of all buttons for the current thing and aspect
+ let buttons = document.querySelectorAll('#' + key + ' .eval_rating_button');
+ buttons.forEach(button => button.classList.remove("clicked"));
+
+ // Change color of clicked button
+ let button = buttons[rating - 1];
+ button.classList.add("clicked");
+
+ // Display message
+ let messageElement = document.getElementById("message-" + key);
+ messageElement.textContent = "The rating of " + rating + " has been received.";
+}
+
+
+function clearRatings() {
+ // Confirm with the user before clearing ratings
+ if (confirm("Are you sure you want to clear all ratings?")) {
+ // Clear all data from localStorage
+ localStorage.clear();
+
+ // Reset all button colors
+ let buttons = document.querySelectorAll('.eval_rating_button');
+ buttons.forEach(button => button.classList.remove("clicked"));
+
+ // Clear all messages
+ let messages = document.querySelectorAll('[id^="message-"]');
+ messages.forEach(messageElement => messageElement.textContent = "");
+
+ alert("All ratings have been cleared.");
+ }
+}
+
+
+// When the page loads, retrieve ratings from localStorage (if any) and update button colors
+document.addEventListener('DOMContentLoaded', function() {
+ let things = ['thing1']; // Add other things to this array
+ let aspects = ['aspect1']; // Add other aspects to this array
+
+ things.forEach(thing => {
+ aspects.forEach(aspect => {
+ let rating = localStorage.getItem(thing + '-' + aspect);
+ if (rating) {
+ let buttons = document.querySelectorAll(
+ '#' + thing + '-' + aspect + ' .eval_rating_button'
+ );
+ buttons[rating - 1].classList.add("clicked");
+ }
+ });
+ });
+});
+
+
+function downloadLocalStorageData(name) {
+ // Create a CSV string
+ let csvContent = "Index,Aspect,Rating\n";
+
+ for (let i = 0; i < localStorage.length; i++) {
+ let key = localStorage.key(i);
+ let value = localStorage.getItem(key);
+
+ let [thing, aspect] = key.split('-');
+ csvContent += thing + "," + aspect + "," + value + "\n";
+ }
+
+ // Create a blob from the CSV string
+ let blob = new Blob([csvContent], { type: "text/csv;charset=utf-8" });
+
+ // Create a download link and trigger it
+ let link = document.createElement("a");
+ let url = URL.createObjectURL(blob);
+ link.setAttribute("href", url);
+ link.setAttribute("download", name + ".csv");
+ document.body.appendChild(link);
+ link.click();
+ document.body.removeChild(link);
+}
+
+
+// Function to set button colors based on localStorage values
+function setButtonColorsFromLocalStorage() {
+ for (let i = 0; i < localStorage.length; i++) {
+ let key = localStorage.key(i);
+ let rating = localStorage.getItem(key);
+
+ let buttons = document.querySelectorAll('#' + key + ' .eval_rating_button');
+ buttons.forEach(button => button.classList.remove("clicked")); // Reset all button colors
+ buttons[rating - 1].classList.add("clicked"); // Set the color of the rated button
+ }
+}
+
+// Event listener to execute the function when the content is loaded
+document.addEventListener('DOMContentLoaded', setButtonColorsFromLocalStorage);
+
+function copyToClipboard(elementId) {
+ let element = document.getElementById(elementId);
+ let range = document.createRange();
+ range.selectNode(element);
+ window.getSelection().removeAllRanges();
+ window.getSelection().addRange(range);
+ document.execCommand('copy');
+ window.getSelection().removeAllRanges();
+
+ alert("BibTeX copied to clipboard!");
+}
diff --git a/report.pdf b/report.pdf
new file mode 100644
index 0000000..2b22f4a
Binary files /dev/null and b/report.pdf differ
diff --git a/styles.css b/styles.css
new file mode 100644
index 0000000..4b80e78
--- /dev/null
+++ b/styles.css
@@ -0,0 +1,196 @@
+@import url(
+ 'https://fonts.googleapis.com/css2?family=Lato:wght@300;400;700&display=swap'
+);
+@import url(
+ 'https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.1/css/all.min.css'
+);
+
+body {
+ font-family: 'Lato', sans-serif;
+ margin: 0;
+ padding: 0;
+ background-color: #f9f9f9;
+ color: #333;
+}
+
+header {
+ background-color: #264653; /* Deep Blue */
+ color: #fff;
+ text-align: center;
+ padding: 2.4rem 0;
+}
+
+header h3 {
+ font-weight: 300;
+ font-size: 1.2em;
+ margin-top: 0.8em;
+}
+
+.section {
+ font-weight: 400;
+ max-width: 850px;
+ margin: 20px auto;
+ padding: 20px;
+ padding-top: 10px;
+ box-shadow: 0 0 15px rgba(0, 0, 0, 0.1);
+ background-color: #fff;
+}
+
+a {
+ text-decoration: none;
+ margin-left: 15px; /* Add spacing between the two buttons */
+ margin-right: 15px; /* Add spacing between the two buttons */
+ color: #ffffff;
+}
+
+a:hover {
+ color: #d0d0d0;
+ border-bottom: 3px #c0c0c0;
+}
+
+a:last-child {
+ margin-right: 0;
+}
+
+a[href^="mailto:"] {
+ /* styles for email links */
+ color: #264653;
+ text-decoration: underline;
+ margin-left: 0px;
+}
+
+button {
+ font-family: 'Lato', sans-serif;
+ color: #fff;
+ border: none;
+ padding: 12px 5px;
+ cursor: pointer;
+ transition: background-color 0.3s;
+ font-size: 1.2em;
+ width: 220px;
+ border-radius: 4px; /* Rounded button edges */
+}
+
+.eval_rating_button {
+ background-color: #4CAF50;
+ border: none;
+ color: white;
+ padding: 8px 20px;
+ text-align: center;
+ text-decoration: none;
+ display: inline-block;
+ font-size: 16px;
+ margin: 4px 2px;
+ cursor: pointer;
+ width: auto
+}
+
+.clicked {
+ background-color: #008CBA;
+}
+
+.home-button {
+ background-color: #597a47;
+ box-shadow: 0 0 15px rgba(0, 0, 0, 0.1);
+}
+
+.demo-button {
+ background-color: #7a5947;
+ box-shadow: 0 0 15px rgba(0, 0, 0, 0.1);
+}
+
+.paper-button {
+ background-color: #1db6c7;
+ font-weight: bold;
+ text-shadow: 3px 3px 1.5px rgba(0, 0, 0, 0.15);
+ box-shadow: 0 0 15px rgba(0, 0, 0, 0.1);
+}
+
+.eval-button {
+ background-color: #5d5d5d;
+}
+
+.eval-button-small {
+ background-color: #5d5d5d;
+ font-size: 1em;
+ width: 160px;
+ margin-left: 20px;
+ margin-bottom: 10px;
+ padding: 8px 5px;
+}
+
+button:hover {
+ opacity: 0.75 /* A generic hover effect for all buttons */
+}
+
+footer {
+ background-color: #264653; /* Dark Desaturated Blue */
+ color: #e9e9e9;
+ text-align: center;
+ padding: 1rem 0;
+ margin-top: 40px;
+}
+
+table {
+ width: 100%;
+ max-width: 600px; /* Adjusted for vertical layout */
+ margin: 0 auto;
+ border-collapse: collapse;
+}
+
+td {
+ padding: 2px;
+ padding-left: 16px;
+ text-align: left;
+ vertical-align: middle;
+ border-right: 5px solid #eeeeee; /* Add border to the right side of each cell */
+}
+
+/* Remove the right border from the last cell of each row */
+tr td:last-child {
+ border-right: none;
+}
+
+audio.scaled {
+ height: 36px; /* Adjust this value as needed */
+}
+
+.bibtex {
+ font-family: "Courier New", monospace;
+ background-color: #f4f4f4;
+ padding: 10px;
+ margin: 10px 0;
+ border: 1px solid #ddd;
+ border-radius: 5px;
+ position: relative;
+ cursor: pointer; /* makes the area look clickable */
+ overflow: hidden; /* Ensures the copy icon is contained within the div */
+}
+
+.bibtex pre {
+ margin: 0; /* Removes default margin from
*/
+ font-family: inherit; /* Ensures font consistency */
+ font-size: .9em;
+ white-space: pre-wrap; /* CSS3 */
+ white-space: -moz-pre-wrap; /* Firefox */
+ white-space: -pre-wrap; /* Opera <7 */
+ white-space: -o-pre-wrap; /* Opera 7 */
+ word-wrap: break-word; /* IE */
+}
+
+.copy-icon {
+ position: absolute;
+ top: 10px;
+ right: 10px;
+ width: 20px; /* or adjust as needed */
+ color: #000; /* Color of the icon */
+ background-color: transparent; /* No fill */
+}
+
+.bibtex:hover {
+ background-color: #e8e8e8; /* subtle hover effect */
+}
+
+.bibtex:hover .copy-icon {
+ color: #007BFF; /* Change color on hover for better user feedback */
+}