• Corpus of Founding Era American English (COFEA)

    Texts127.8K
    Words138.2M
    Words + Punctuation159.8M
    The Corpus of Founding Era American English covers the time period starting with the reign of King George III, and ending with the death of George Washington (1760-1799). COFEA contains documents from ordinary people of the day, the Founders, and legal sources, including letters, diaries, newspapers, non-fiction books, fiction, sermons, speeches, debates, legal cases, and other legal materials. The majority of texts have been pulled from the following six sources: the National Archive Founders Online; William S. Hein & Co., HeinOnline; Text Creation Partnership (TCP) Evans Bibliography (University of Michigan); Elliot's Debates; Farrand's Records; and the U.S. Statutes-at-Large from the first five Congresses.
  • BYU-Corpus of Early Modern English (BYU-COEME)

    Texts40.3K
    Words1.1B
    Words + Punctuation1.3B
    The BYU-Corpus of Early Modern English cover texts from 1475 – 1800 that were included in the Evans Bibliography, the Early English Books Online (EEBO), Eighteenth Century Collections Online (ECCO) corrected by the Text Creation Partnership (TCP) Evans Bibliography (University of Michigan).
  • Corpus of Supreme Court Opinions of the United States (COSCO-US)

    Texts61.8K
    Words98.4M
    Words + Punctuation113.9M
    The Corpus of Supreme Court Opinions of the United States includes all opinions in the United States Reports and opinions published by the Supreme Court through the 2017 term.
  • Corpus of State Conventions on the Adoption of the Constitution (COSCAC)

    Texts652
    Words1.5M
    Words + Punctuation1.7M
    The Corpus of State Conventions on the Adoption of the Constitution consists of five volumes of The Debates in the Several State Conventions on the Adoption of the Federal Constitution. According to the library of congress, they “remain the best source for materials about the national government's transitional period between the closing of the Constitutional Convention in September 1787 and the opening of the First Federal Congress in March 1789.”
  • Corpus of the Records of the Constitutional Convention (CORCC)

    Texts847
    Words693.8K
    Words + Punctuation820.6K
    The Corpus of the Records of the Constitutional Convention covers three of the four volumes of The Records of the Federal Convention of 1787. Published in 1911, Farrand's work attempted to represent the documentary records of the Constitutional Convention.
  • Corpus of Early Statutes at Large (CESAL)

    Texts481
    Words445.6K
    Words + Punctuation519.7K
    The Corpus of Early Statutes at Large includes laws passed by the United States Congress in chronological order. The first set published cover the first five Congresses and a small part of the sixth.
  • Corpus of US Caselaw (CUSC)

    Texts8.5K
    Words4.2M
    Words + Punctuation4.8M
    The Caselaw Access Project (“CAP”) expands public access to U.S. law. Its goal is to make all published U.S. court decisions freely available to the public online, in a consistent format, digitized from the collection of the Harvard Law Library.The first installment is state cases from 1760 to 1799.
  • Corpus of the Current US Code (COCUSC)

    Texts58
    Words50.1M
    Words + Punctuation65.2M
    The US Code Corpus contains the entirety of the US Code from the Office of Law Revision Counsel. The corpus was downloaded from https://uscode.house.gov/browse.xhtml in July of 2019 and, therefore, only reflects the version of the US Code that was available on that date. In this corpus, each title is treated as a separate document/text. In addition, the appendices to Titles 5, 11, 18, 28, and 50 are also treated as separate documents/texts. The corpus does not include the Front Matter to the US Code or the Acts Cited by Popular Names portion, but does include section text and notes.
  • Create Custom Corpus