Ateam/Projects/Uber-crawl

From MozillaWiki

Jump to: navigation, search

Contents

1 Overview
2 Goals
3 Non-Goals
4 Deadline/Deliverables
5 ATeam
6 Dependencies
7 Major Tasks
8 Notes

Overview

Uber-crawl is a proposal from the JS and Layout teams. It's purpose is:

collect javascript patterns from websites in the wild
collect SVG/CSS/HTML patterns from websites in the wild

This data would be stored into a queryable datastore so that teams could answer the following questions:

Is pattern x used on the web?
How many of the top x sites use pattern y?
How many of the top x sites that also do RTL layout use pattern y?

Goals

Crawl top x of web sites
Store JS/CSS/HTML/SVG from sites
Provide a web tool for querying datastore
Provide backend API access to data

Non-Goals

Write a search engine

Deadline/Deliverables

None yet.

ATeam

We're thinking of reusing a large portion of the bughunter machinery for this, so bc is a good choice.

Dependencies

Machines, lots of machines.

Major Tasks

TBD

Notes

Retrieved from "https://wiki.mozilla.org/index.php?title=Ateam/Projects/Uber-crawl&oldid=318488"