Monday, August 8, 2011

Confirmation

With confirmation from Sir Jach, Sir Ludwig and my inner gut, I have finally decided on that extension that Sir Jach approved of last week: a webpage classifier.

So right now, my (tentative) title for my SP is:
Squidler - A Squid Log Parser with Webpage Classifier

.. Okay, Squidler may be kind of lame. It's subject to further scrutinization.
It's from the words "Squid" and "crawler" (web crawler).
As for the implementation, I am leaning more on a web-based one.

What I visualize for this SP is that it shows the different IP addresses connected at the proxy server, and it shows the different information about their connections (the websites they have visited and when they accessed them, how long they have accessed the internet, bandwidth consumed, etc) .

Afterwards, it classifies the webpages they have visited, and stores that information for later arrangement.
I have found a lot of papers that deals with webpage classification, and Sir Jach advised me to find something "unique" for my implementation.

Well, here goes nothing. I will pursue this one problem.

No comments:

Post a Comment