OpenVideoCrawler

2005-12-15

I've completed a crawler for open-video.org video repository. It's written in Python, and it works. It generates descriptors for H-DOSE semantic search engine, or a less complicated, non-xml-style information file. Then it computes some keyword statistics about the movies in the collection, tells you whether you indexed fictitious movies and which where the fictitious ones. It has good performances, but they largely depend on your connection to the internet. I publish this info here and in english for laziness sake. This is a subproject for my degree thesis in computer science. Related files:

All the above code is released under the terms of GPL.