https://www.youtube.com/watch?v=Lr_HoA2XRQQ
- A site for viewing the same Youtube videos at the same time with friends.
- Main features of the party (virtual room):
- Watching the same video synchronously.
- Adding mulitple videos to a playlist that will autoplay all the videos currently in the playlist
- Host can can choose to play a specific video in the playlist, reorder it, and also delete videos from it.
- Chatting with other users within the room through voice or text.
- Sending emotes and displaying them on top of the video as a way to react to the video content.
- There is only one host (initially the party owner) at any point in time that possesses a virtual remote, and that virtual remote can be passed around between the users of the room.
The general user flow is as follows:
- A user signs up on the site.
- The user then can update their profile picture, and any other information.
- The user can create a room and set a password for it.
- The user then shares the room link to his friends who will be able to join once they put in the room password.
- If any of the invited users is not currently signed in, he will be redirected to sign in to either sign in or create an account.
- Only the host user (initially the party owner/creator) can add youtube videos to a playlist and set the current video and video list to be played.
- This video queue can be modified as needed.
- The host can pass the remote control to other users.
- The original party owner/creator can retrieve the remote at any time.
- The current video playing is synchronized with the progress of the host.
- The host can pause, play, skip ahead, and other users will be brought to that timestamp at the same time.
- Other users will not be able to modify the current progress.
- All users can interact through the chat box by sending messages, or react by sending emojis on the video itself.
- If a user doesn't want to be disturbed by the emotes, there is a toggle for it.
- All users can join a voice call.
- There is an option to mute their microphone or leave call and then rejoin.
- Users can leave or rejoin the room or call whenever they feel.
- The room will be deleted once all users leave.
Task: Leaving deployment aside, explain how the app is built. Please describe the overall code design and be specific about the programming languages, framework, libraries and third-party api that you have used.
The app is made up of three components.
The frontend is built with the React framework, using javascript, some html and scss (which is similar to css but is easier to nest classes
- It uses axios to fetch user data asynchronously.
- It uses socketio for server communication with the client.
- It handles chat synchronization, video synchronization, voice call joining synchronization, emote synchronization, and playlist updates.
- Peerjs is used in the frontend for peer to peer webrtc connections in order to start the call between peers.
In terms of the technical overview of the frontend the app is built in the following way.
-
Pages are routed through the BrowserRouter component of the "react-router-dom" library, with various paths.
-
Certain pages will redirect to the sign in page through another component that checks if the user is currently signed in. Sources were used to figure out how to implement our own PrivateRoute component to redirect to the sign in page from the following article https://medium.com/@thanhbinh.tran93/private-route-public-route-and-restricted-route-with-react-router-d50b27c15f5e.
-
A navbar is included onto every page with logic to display the current user or a sign in button otherwise.
-
For the main page, sign in, sign up, room creation, room join, and edit profile pages. They are rendered mostly statically with form fields and buttons to submit async rest requests to the server for updating user information or creating new rooms. Redirection logic is also included with the button clicks.
-
For authentication, a jwt token is included in cookie with samesite,secure, and httponly flags included. This cookie is also sent to the socket on connection, to ensure that the socket client has an authenticated user. When user logs out, the token is removed.
For the dynamic aspects and the main feature of our application, it is found in the '/party' page.
- The overall logic is to use socketio to receive and emit events which will render components when needed.
- When a user joins, the socket would be initialized, and send a connection request to the server along with a join-room event.
- The server will send the appropriate information through the socket such as chat logs, video information, users and other room information to be rendered.
- The client will also continue to listen for any updates to the video progress, chat, video playlist, user joining updates through the socket and handle them accordingly.
- The party page will have several components such as chat, sidepanel, connected users, video playlist, and emotes.
- The chat includes an input that will emit a socket event to send the message to the server.
- When the user receives the message from the server, it will update the messages.
- The chat includes an input that will emit a socket event to send the message to the server.
- The video uses the React-Player library (https://www.npmjs.com/package/react-player) which provides an interface for playing youtube videos inside an iframe and allowing controls and video progress updates.
- Proper validation is done on the video url before playing it to ensure that it's a valid youtube link.
- The video progress updates will be emitted to the server where it can tell clients (other users) to play, pause, or update their progress.
- The user can click on emotes to display a specific emote on the video as a way to react to the video content.
- To sync the current user's emote to all the other users, a signal is sent to the server, that will then send an emote signal similar to all other users, similar to how we implemented instant messaging.
- Coordinates x,y of the emotes are also sent so that all the users will have the emotes appear at the same location on the video.
- Emotes were obtained from the EmojiAPI (https://emojiapi.dev/emojis).
- For the sidepanel part, it is implemented as a single component that takes either type 'playlist' or 'users'. Depending on the type, it will display the corresponding content.
- For voice calls, socketio is used to signal the client to establish a peer to peer connection with other clients.
- First the client gets an peer id from the server.
- Then the client tells the server its id, which the server can then map the user to the id, since the user has been authenticated through the socket.
- When another user attempts to join call, the server will send everyone in that call an event that tells the other users to call that joiner with its peer id.
- that way multiway calls can be established.
- mediaStreams are acquired through the browser interface, and similarly with muting the stream.
- The AnalyserNode html component (https://developer.mozilla.org/en-US/docs/Web/API/AnalyserNode) is also used to recognize voice activity and highlight the icon of any user who is currently talking in the voice call.
The backend is written in javascript.
The following frameworks/apis are used:
- mongoose which is an interface for querying mongodb using the mongodb drivers.
- Schemas are built on mongodb objects.
- Querying users, room information, and messages are done through mongodb.
- graphql
- These endpoints were used to handle requests for user information, and room information from the database.
- RESTAPI
- This was used to send cookie information and user information when the user signs in/up and for updating user profiles.
- express server
- The express server had middleware handling many other parts of the server.
- Socket io is listening on top of this express server.
- It handles all the coordinations of videos, voice calls, chat, and other information in order to send to all users in the room for updates.
- peerjs express server is listening to /peerjs path.
- peerjs is a peer to peer webrtc api to coordinate the sending of audio streams to peers.
- This server allows users to send the correct information to perform peer web rtc connections.
- The server is a library from peer.
Otherwise, handling was done through utility functions such as checking if user was authenticated, and then executing certain procedures.
bcryptjs was used to salt and hash the passwords stored in the database. json web tokens were used to authenticate users. validator and mongo-sanitize were used to ensure client data were correct and valid.
- This is a mongodb instance running from a docker container.
We are using the DigitalOcean Droplet VM for deployment. We build our application as a Docker image and push it to GHCR (GitHub Container Registry). We then pull the latest image in the VM and run it as a container using Docker. The whole deployment process can be described with the following steps:
- The
release
workflow is a GitHub action that automatically builds a Docker image whenever we pushed to themain
branch or merge a pull request into themain
branch. - After the image is built, we login to the VM by SSH and pull the latest image using
docker compose pull
. - And lastly, we use
docker compose up -d
to restart the containers.
The Nginx configuration file and the docker-compose.yml file to start the containers can be found here.
We use Nginx as a reverse proxy for load balancing. Another reason why we choose Nginx is it integrates well with Certbot and we do not need extra configurations for setting up HTTPS.
We use Certbot (Let's Encrypt) to certify our application.
- DigitalOcean Droplet VM
- Docker
- GHCR (GitHub Container Registry)
- GitHub Action
- Nginx
- Certbot
We use the graphs provided by DigitalOcean to monitor our server. We monitor data such as the CPU %, memory usage, and bandwidth. The following are some data that we found interesting and we investigated the reasons behind.
- The CPU usage is steady most of the time, around 10% to 20%. However, sometimes it spiked to 80%. After investigation, our assumption is those spikes were caused by the Docker service. The CPU usage spiked when we pulled the latest image and restart the containers.
- The memory usage is high all the time, around 50% to 65%. It is because we choose the 1GB memory Droplet and the Docker service consumes a lot of memory since the containers are VMs.
Task: What is the top 3 most challenging things that you have learned/developed for you app? Please restrict your answer to only three items.
- Using PEER-WEBRTC to connect users in a call and provide calling functionality both ways, for n users.
- Using socketio (webrtc) to synchronize events happening in the room, such as video progress, user joins, disconnects, sending emotes, updating video playlists, providing authentication.
- Setting up github container registry to pull and build from main, then deploy onto Digital Ocean, and setting up the production configurations for it.
Task: Describe the contribution of each team member to the project. Please provide the full name of each team member (but no student number).
Victor Ko: Added functionality for features such as voice call, messaging, and video synchronization.
JunXing Xu: Setup the framework for the application and implemented APIs needed for the frontend. Setup deployment and GitHub action for CI/CD. Server maintenance and monitoring the data on DigitalOcean.
Emile Li Tim Cheong: Was in charge of anything UI/UX related, i.e. placing and designing the different page components and sub-components (video player, sidepanel, chatbox, side buttons, etc.) in such a way that will give the best user experience. Also implemented dark mode.
Task: Any additional comment you want to share with the course staff?
- For testing purposes as a single user, using a normal browser window and an incognito one should enable the tester to sign in as two different users in order to test the virtual room features.
- Darkmode is implemented for this site for easy viewing and good cinematic experience when watching videos.