The Hearing Room

Landing page
Select from a number of public figures
Choose a recent discussion topic they participated in
Watch discussion on the source video

Description

The Hearing Room combines Symbl.ai and Machine Learning to create a catalogue of speaking points from videos of public figures.

Each person has a list of topics and links to time-stamped videos that go directly to where they spoke.

Progress

For the scope of this project, we focused on the Senate Floor audio recordings archived at TVW.org - a website that offers "unedited coverage of Washington state government, politics and public policy".

Using a combination of Symbl.ai async audio API and facial identifier algorithm, we were able to accomplish the following:

Retrieve the transcript and topics from each media file through the Symbl.ai async audio API
Run a facial identifier algorithm to identify the people present in the media.
Run our stitching algorithm to analyze which faces belong to which senators.
Using the output from our stitch, we identified which speaker spoke about which topics per media file
Display the catalogued data on our website for users to browse. For example, if someone was interested in what topics Senator Rolfes had discussed, a user can navigate to her thumbnail, click on it, and it will show the list of topics she has talked about in the Senate Floor. Each topic has a list of link(s), which navigates the user to the moment of where the speaker talks about the topic in the video.

Concept

Our project aims to provide further resources for voters by listing what topics the incumbent candidates have discussed during their terms and link it back to the unedited coverage.

The voter's pamphlet is always sent out near the time of elections, which has some small summaries of the candidates. Sometimes we'd like to know if the topics we care about were discussed by the incumbent official, which may not be captured in the voter's pamphlet. Through our project, we conducted analyses on topics that were discussed, as well as facial recognition to connect the speakers to the topics, from the media files.

The Hearing Room is designed so that we can look up incumbents, find the topics they've discussed, and allow users to find a time-stamped video(s) of where they can hear the official talk about the topic.

Challenges

Here are the limitations we encountered in our project:

While the facial recognition and stitching algorithm works in identifying people who has complete facial image (e.g. Zoom calls), it has a more difficult time identifying people who had their faces covered (e.g. Mask)
The async audio API requires an input of the number of people speaking in the media file prior to the start of the media file processing to separate out the speakers in the transcripts. In this project, we relied on the facial recognition algorithm to estimate the number of people who may be speaking in the media file. The estimated number of people may be inaccurate, because it may include people who may not be speaking or can't identify due to face covering (e.g. mask).

Feasibility

This project has ample rooms to grow in terms of information gathering to provide voters with easier access to the unedited coverage. Part of its growth involves, but not limited to, the following:

working with Symbl.ai to increase job processing as the trial we were using could only accommodate a max of 2 jobs at a time
expand coverage to other parts of the state government whose members are elected officials
identifying other sources from other states that provide unedited coverage of the government business that are open to the public for viewing
expanding it to the federal government level coverage (e.g. working with c-span)

Because a large part of the project's potential relies on automation, we estimate that over time the overhead operating cost to operate The Hearing Room will be significantly more efficient and productive compared to the initial setup.

Built With

Submitted to

API World Hackathon 2021
- Winner Symbl.ai Challenge - Build an AI App that Understands Natural Human Conversations at Scale

Created by

I gathered the transcripts and topics data using symbl.ai, as well as being one of the quality checkers of the facial stitching output. Also wrote up the project details.

Teresia Djunaedi
I was the tech lead, and developer for the data transformations and image recognition portion.

Fernando Arnez
Matthew Parker
Natalie Kiner

Updates

Teresia Djunaedi started this project — Oct 27, 2021 03:06 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.