Guy Uses Machine Learning To Help Him Count How Many Times Noku From Wadiwa Wepa Moyo Says “Hesi”

Farai Mudzingwa

1 May 2020

Stumbling upon this pointless but intriguing video made my day yesterday. Usually when we talk about machine learning and programming – it’s rarely this light-hearted.

A student from HIT – Tatenda Mushaya made use of Machine Learning techniques to attempt to figure out how many times Noku from Wadiwa Wepa Moyo says “Hesi”.

If you’ve watched the show, you’ll know why the “Hesi” has become equally iconic and infamous resulting in some of funniest social media reactions;

When Noku says "Hesi" #Wadiwawepamoyo pic.twitter.com/Kiikeb2825
— Sandy Muleya (@MuleyasandyA) April 24, 2020

These folks at #WadiwaWepaMoyo should jump on this and creat HESI tees inspired by Noku they would sell fast😂
— Taku Splits (@SplitsLoui) April 25, 2020

Lol each time someone types “hesi” all I can hear is Noku saying hesi 😂💀
— Hazel 🦅 (@muvahaze) April 21, 2020

Social media reactions aside, Tatenda followed the steps below;

Collect images from the internet which can be done using code.
Resize the images
Detect faces use script
Crop the detected face
Pick only Noku’s face
Make Noku encodings
Detect Noku’s face.
find the `Hie` subtitle

You can see the script he created to do this on Github

In the video above, Tatenda explained that the process was complicated because the script was trying to read text on varying backgrounds and as a result he could only pick up 4 Hesi’s.

Fellow programmers, comment below on how Tatenda can make his script better at identifying Noku’s “Hesi’s”.

Artificial Intelligence, Social Media, Software Development

9 comments

What’s your take?

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Leslie

1 May 2020 11:54

Thanks for open sourcing your code bro, by the way this isn’t a “stupid mission”

Reply
Imi Vanhu Musadaro

1 May 2020 12:21

I would frame this as an audio extraction and detection problem. “Hesi” could possibly be said whilst the subject isn’t within frame or not facing the camera. I’ve only watched one episode, so I can’t speak to the distribution of such events.

Audio can be trickier to work with though, as most AI tutorials, examples e.t.c focus on computer vision. One option would be to use text recognition to identify locations of the subtitle “Hie” (regardless of the speaker), then extract the audio within the neighbouring areas and process that accordingly, to identify the speaker.

Reply
Sam Manokore

1 May 2020 15:04

Dude, this is impressive. This is not stupid at all!
I can see something like this taking off and being useful some day. I home I will remember to hit you up when I find an alternative and more effective way of doing it.

Reply
Togara

1 May 2020 15:34

He should just use audio to use the algorithm to pick ‘hesi’ from audo.

Reply
1. Nyahwa
  
  3 May 2020 10:09
  
  Audio algorithm is a very complex but achievable
  
  Reply
DC

1 May 2020 18:23

Even if it’s “stupid”, it’s stupidly funny. I quite enjoyed. Things line these lead to interesting use cases. Remember mould gave us penicillin. It was a botched experiment. Kinda stupid I’d say 😁

Reply
1. Farai Mudzingwa
  
  2 May 2020 12:25
  
  Very true
  
  Reply
evermoreg

2 May 2020 20:16

im inspired by this young man. Great work ineed

Reply
Maps

3 May 2020 09:56

Dude you are a genious🔥

Reply

Connect with us

Home

Guy Uses Machine Learning To Help Him Count How Many Times Noku From Wadiwa Wepa Moyo Says “Hesi”

9 comments

What’s your take? Cancel reply

What’s your take?