This server hosts large collections of files used and generated by www.altlaw.org, the free legal search engine.
All files on this server are PUBLIC DOMAIN. Likewise, they come with NO WARRANTY with regard to accuracy or completeness.
To repeat, these collections are not complete.
These files were downloaded from U.S. appeals court web sites, using a crawler written by Prof. Paul Ohm at the University of Colorado. They contain approximately 200,000 federal court cases decided between 1996 and 2007.
There is one compressed file for each court. Each case opinion has an accompanying XML file containing metadata such as date and title.
Daily updates to the Ohm1 Corpus, from 2007 to the present. Same format as the base collection.
You can find other case law collections used by AltLaw (as well as mirrors of the files here) at bulk.resource.org.