Creating a Movie Recommender

edwinq · October 6, 2021, 7:28pm

Hi @ravimashru and team, not sure if I am late to the party but this seems like a very cool project. I’d love to be involved.

edwinq · October 8, 2021, 12:47am

@ravimashru @bhutanisanyam1 Is there another call scheduled?

ravimashru · October 8, 2021, 1:57am

Hey @edwinq… Welcome to the community!

You’re not late at all. We’ve just had one call so far and then I guess we got busy with other things. We’d love to have you on board.

I was planning to resume working on this over the weekend, but I haven’t scheduled a call yet.

I’m not sure what timezone you’re in. Does Saturday 2:00 p.m. GMT work for you?

@kurianbenoy-aot @vrc0503 will you be joining as well?

edwinq · October 8, 2021, 4:57pm

Sure, I’m in EST so I should be free for about an hour or so then. Still meeting here :

Video call link: https://meet.google.com/xtv-cgir-kbs ?

ravimashru · October 9, 2021, 5:43am

Awesome!

I think that’s the old one. I’ve set up a new meeting here.

hello34 · October 9, 2021, 5:55am

It would also love to join the meeting. I am still excited to work on this project. @ravimashru, I am not able to find the meeting link you shared

ravimashru · October 9, 2021, 6:45am

Oh, that’s strange. That was a link to a calendar invite. If that’s not working then here’s a link to the call directly: https://meet.google.com/heu-heqi-grq

bhutanisanyam1 · October 10, 2021, 5:04am

After fighting the errors for over 12 hours, I decided to submit a bug.

edwinq · October 13, 2021, 1:00am

Apologies, I was not able to attend the last call, are there any action items that we can get started on?

ravimashru · October 13, 2021, 11:12am

No worries @edwinq! Here’s a summary of what we did:

We created the fast-recsys org on GitHub to store everything we work on. I can add you to the org as well if you give me your GitHub username.
We created skeleton repositories for the UI (using VueJS) and the backend API server (using FastAPI).
We also started looking into NVIDIA Merlin since it has really cool tools for feature engineering (NVTabular), training (HugeCTR) and even an inference server (Triton). However, we pretty much hit a roadblack with installing the required dependencies when trying out the examples and @bhutanisanyam1 created the GitHub issue he mentions above.

We’re currently experimenting with two approaches:

Creating an API server to serve the model trained using fastai.
Use the NVIDIA Merlin ecosystem to train and deploy the recommendation system.

Here’s a few things we can start doing:

Create a web UI that a user can use to rate movies they have already watched (some inspiration: Book Recommender: Collaborative Filtering, Shiny | Kaggle) and view recommendations.
Convert the contents of this blog post into Python files and create REST API endpoints that we can use from the UI.
The maintainers of NVTabular have responded to the issue that Sanyam created. We can try to set up an environment with the required dependencies like they recommend and try to run the example notebooks (they already have notebooks that use the MovieLens25 dataset that we also plan to use )

Apart from this, I don’t think we have any other planned/concrete action items. We’re just planning to learn and figuring things out as we go!

We’ve planned to meet again at the same time this Saturday as well (Calendar invite). We’d love to see you there if you can make it.

edwinq · October 13, 2021, 5:20pm

that sounds awesome @ravimashru . My github username is halloffame0793, that would be great if you could add me to the org. And yes I am planning on attending on Saturday.

talha_darrxscale · October 13, 2021, 5:36pm

hey @ravimashru, I’m developing a Fashion Recommender System, I have created object detection models using yolov4, done image classification using FastiAI, I don’t know how to make a web app and create an API of it. i would love to join , it is my need though, kindly add me to your project.i want to learn.

ravimashru · October 14, 2021, 3:21am

Hey @talha_darrxscale… welcome the community!

I just read your project description and it looks very interesting! I see you plan to use VueJS for your UI and Django for your backend so there is some similarity there.

Please feel free to join us in our next call (Calendar invite). We’d be delighted if anything we do turns out to be useful for your project.

talha_darrxscale · October 14, 2021, 4:26am

Thanks for your invitation hope it would help me .

ravimashru · October 16, 2021, 7:20pm

Here’s a summary of what we did today

NVTabular and Merlin

Ran the first two example notebooks on movielens without any errors by replacing cuDF with pandas.

Installed PyTorch from pip instead of conda to get most of the third notebook to run. The last cell (actually training the model) fails with the following error:

RuntimeError: merge_sort: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered

This needs to be looked into further.

Frontend

The VueJS frontend was set up using tailwind and vuetify.

Backend

Data from the MovieLens25 dataset was added to the repository.

A few more backend APIs were created - to fetch random movies for the user to rate.

Action item for @ravimashru: create endpoint to fetch details of a single movie from the OMDB API.

edwinq · October 16, 2021, 7:28pm

Ahh, I was just getting ready to join but must have gotten the timezones mixed up :frowning : (
Great work though!

ravimashru · October 17, 2021, 11:37am

Ah… If I had a dollar for everytime I got the timezone for a meeting wrong I’d give Jeff Bezos a real run for his money.

Hopefully we’ll catch you next saturday.

Until then we’ll be working on this asynchronously. Feel free to reach out to me if you want to try out anything and have any questions.

ravimashru · October 18, 2021, 2:07pm

FYI: Since we’ve got quite a few people interested in this topic, I’ve created a Discord channel to allow us to have more free-flowing conversations without spamming this thread. Feel free to join if you’re interested: mashruravi's server.

I will update this thread from time to time, especially after any calls/coding sessions.

bhutanisanyam1 · October 19, 2021, 7:30am

@ravimashru We really want to create our discourse as the goto place for such discussions.

Is there any feature you find missing on discourse? I can look into integrations or even upgrading our plan if it helps

ravimashru · October 19, 2021, 10:32am

@edwinq and I were just talking about having more frequent day-to-day conversations on Discord so that we don’t spam this thread.

But keeping all discussions in one place makes sense as well. We’ll continue using this thread for all communication.

Topic		Replies	Views
#7 PyTorch Book Thread: Sunday, 10th Oct 8AM PT PyTorch Book Reading Group	34	2744	October 17, 2021
Week 12 Discussion Thread Fastbook Reading Group	51	2853	September 10, 2021
Week 11 Discussion Thread Fastbook Reading Group	18	2327	September 9, 2021
Week 15 Discussion Thread Fastbook Reading Group	29	1670	September 24, 2021
Creating a Fashion Recommender System Show the Community!	5	1962	December 27, 2023

Creating a Movie Recommender

NVTabular and Merlin

Frontend

Backend

Related topics