Lab II. Basic dialogue management

Getting started with development environment

Preflight step 1. Create Azure account and enable speech services

Apply for free student credits https://azure.microsoft.com/en-us/free/students/. You should be able to login with your GU account.
Make sure that you are logged into the Azure portal (https://portal.azure.com/).
Create a Resource group (you can use search field):
- Subscription: Azure for students
- Resource group: any name
- Region: (Europe) North Europe
Create a Speech service:
- Name: any name
- Subscription: Azure for students
- Location: (Europe) North Europe
- Pricing tier: Free (F0)
- Resource group: group name from the previous step
Within your Speech Service go to: Resourse management → Keys and Endpoint and save your KEY 1.

Preflight step 2. Run the example project

Install NodeJS (LTS version) and Yarn
Fork the example project (this project): https://github.com/GU-CLASP/dialogue-systems-1-2024. Clone your fork to your machine.
Go to the Code/ folder.
Install all dependencies:
```
yarn
    
```
Create a file called azure.js with the following contents:
```
export const KEY = "paste your KEY 1 here";
    
```
(Note: Don’t add this file to version control.)
Run the project:
```
yarn dev
    
```
Open the link that was shown in your browser, e.g. http://localhost:5173/
Allow access to you microphone.
When you unblock the pop-up window, and reload the page you will see the state inspector; this can be useful during development.

Assignment

Task 1 (“appointment”): Implementation of the finite state machine.
Task 2 (“features”): Add some extra features to your app.
Task 3 (“improvements”): Describe the limitations of your app and try to fix them.

Task 1. “Appointment”

In this task you will need to implement the flowchart above.

I have created a starting point for you, so you basically can use the project that you forked before:

You will need to edit the state chart defined by dmMachine in ./dm.js:
You will need to extend the entities in the grammar (const grammar) to understand more names, times and dates.
There are some helper functions that you can use. Feel free to add your own functions.
You will also need to create a similar grammar to understand “yes” and “no”, but also “of course”, “no way” etc.

Task 2. Additional features

Implement an auto-start of your application after 10 seconds of inactivity.
If the user is not talking, use ASR_NOINPUT event to tell the user “I didn’t hear you” and re-raise the question.

Define an action in setup() section of your machine and use it whenever you want to say something. See XState documentation and example below. You can also do the same for listening.

Example:

say: ({ context }, params) =>
  context.ssRef.send({
    type: "SPEAK",
    value: {
      utterance: params,
    },
  }),

That’s how you execute this action:

actions: [{
  type: "say",
  params: `Hello world!`,
}]

Task 3. Improvements

Write a report (max 1 page) which describes errors and limitation of your app.
Try to fix some of them and briefly describe your solution in the report. You don’t have to fix all the limitations.
Add your report to the repository in PDF format (report-lab2.pdf)

Resources

XState documentation
SpeechState documentation

Git docs:

Getting started with git
Working with forks

Submission

Commit your changes and push them to your repository (your fork of this repository)
On GitHub page of your repository, click Contribute -> Open pull request. Then click on Create pull request. Change the title to “Lab 2 submission” (if you want to ask a question about your code, use the title “Lab 2 work in progress”). Click on Create pull request.
On Canvas, submit URL to the pull request that you just created.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lab2.org

lab2.org

Lab II. Basic dialogue management

Getting started with development environment

Preflight step 1. Create Azure account and enable speech services

Preflight step 2. Run the example project

Assignment

Task 1. “Appointment”

Task 2. Additional features

Task 3. Improvements

Resources

Submission

Files

lab2.org

Latest commit

History

lab2.org

File metadata and controls

Lab II. Basic dialogue management

Getting started with development environment

Preflight step 1. Create Azure account and enable speech services

Preflight step 2. Run the example project

Assignment

Task 1. “Appointment”

Task 2. Additional features

Task 3. Improvements

Resources

Submission