Topic Web Content Quality has various and often subjective aspects. In this year's Discovery Challenge we will try to explore various properties that may determine the overall rank, quality and importance of a Web site, with the task of developing automatic methods that can be used to estimate web content quality. Our main target is to help organizations, both commercial (such as commercial search engines) and non-commercial (such as non-commercial web archives), in their efforts to prioritize their procedures to gather, store and organize their collection of Web pages. The objectives of these entities may vary from institution to institution, e.g. an Archive may even want to include even Web spam but with lower priority, while others may prefer frequent refresh with extensive resources allocated to news sites. As another example, content generated by amateurs individually or in informal organizations may be considered either as an important part of our culture to be preserved, or as something that needs to be handled separate from content generated by professionals in formal organizations. Sponsorship, prizes This year's competition will have cash prizes sponsored by Google and travel grants sponsored by Yahoo!
Travel grants : USD 2500 in travel grants for up to 5 students. In the case that a participant wins 2 prizes they will get only the higher one and leave the other for the next participant in the ranked list. Data preparation and assessment is supported by the EU FP7 Project LIWA (Living Web Archives) and by the Hungarian national grant OTKA NK 72845. Publications Publications describing the design of the best systems will be peer reviewed and published. Details will be available soon. Note: as of April 2010 please consider this information as subject to change. |