Rare Book Monthly

Articles - August - 2019 Issue

A Deep-dive Database of Local History, Attitudes, and Ideas

Ulster County documents

Ulster County documents

Recently I purchased a small group of mid-Hudson Valley material that I found useful as examples of what would logically be included in a deep-dive experimental database for the New York State counties mid-way between New York City and Albany.  A what?  A deep-dive-database is a full text searchable database, something like what Google does in its Books section.  Whether it is a d3 or a FTSD or something else remains to be seen, but it is the future.

 

Databases of the printed word have generally been confined to brief descriptions and details of books and printed documents.  To see an actual copy, for example if you are using the OCLC, you are provided locations where such copies, physical and electronic, are found.   On RBH we focus on auction records and dealer descriptions to illuminate the emerging understanding of an example’s importance and value.  Such databases are potentially very large as ours is, more than 9 million full text records. 

 

But what is now emerging are full text databases.  That is, they capture the complete contents of a document in word searchable form, not only as a scan but as a word document.  Some efforts currently look for references in text but they have generally been dull instruments, in some cases because the references need to be dug out and in others because they are behind paywalls.    This will change and with this change there will be full text readable versions searchable online – and in many cases, searchable for free.

 

This experimental free database for the mid-Hudson Valley will include the standard reference materials, town and county histories, maps that convey changes, appropriate books by local authors, broadsides, pamphlets and ephemera – all in full searchable text.

 

The search will be different because the most common form posted will be ephemera that will outnumber books and pamphlets somewhere between a thousand and ten thousand to one.

 

Books usually include the title, author, publisher/printer, place and date printed.  When even one of these facts is missing it can complicate searches.  For ephemera you might be lucky to have three of these factors.  The others will require associated factors such as “they are among a group of letters in the same hand”.  Here’s an example.  A collection of letters from A.M. to B. R.  dated by day and month but not by year.  However, one envelope is dated 1863 and the events mentioned suggest the Battle at Chancellorsville.  Can this be figured out?  Probably.  As this example suggests, judgments will be made.

 

Here are some of the fields needed to identify and contextualize such letters.

 

Date or date range stated or implied

Names implied or known

Subject[s] such as events and places

Regimental references and information including cross-references

 

In addition, other fields will sometimes play a part:

 

Watermarks

Context of the document [among a group of similar items or with other related materials]

 

References gleaned from genealogical sites

 

References from online searches on Google and others

 

Altogether it will often, but not always, be possible to contextualize material, thus creating a deeper perspective – a perspective I believe that will change our understanding of the past.

 

Here are some other examples:  Ulster Mine at Ellenville, Ulster County, New York, a series of 5 printed documents, many with illustrations, that relate to this mine from 1852 to 1855 that include:

 

A 16 page report dated July 1st, 1852

 

An abbreviated broadside version dated July 1st, 1852

 

A 12 page report dated December 10th, 1852

 

A broadside, brief financial statement dated 15th December, 1852

 

A 16 page report dated January 3, 1854 titled Official Reports of the Ulster Company for the year 1853

 

This mine was located a short distance from the Delaware & Hudson Canal and was opened in 1852 during a period when Americans were looking everywhere for gold because of the stories emerging about the gold strikes in California.  In Ellenville they found lead while in Kingston some 20 miles away they believed they found gold that, when assayed, turned out to be pyrite or fool’s gold.  Such documents are so much more interesting than a title, date, author and print date.

 

Among the other documents I purchased is a stock receipt for the Hobart Branch Railroad Company signed by Thomas Cornell, who was a man of wealth whose steam boats coursed the Hudson River in the latter half of the 19th century.  He was based in Rondout but his influence reached in every direction.

 

Another is a menu for the Hotel Kaaterskill at Catskill for Thursday August 24, 1899.  Tastes have changed!

 

A small one is an 1857 7.625” x 5” broadside circular calling on teachers in Orange County to participate in a quarterly meeting to be instructed on new teaching approaches.  The teachers were expected to pay their own way but a handwritten note suggests the costs may be shared.

 

These are a few of the many documents that will contribute to an understanding of what life was like and altogether convey the changing assumptions and understanding people generally had.  Life has never been a paved highway and in the mid-Hudson Valley it seems more like a gravel path; every spec of gravel evidence of unique personal history.

 

An intensely focused, full text searchable database will bring these details to light.

 

Images of some of the examples are included with this article. 


Posted On: 2019-08-09 17:16
User Name: certainbooks

Hello Bruce: How would this proposed database differ from the current OCLC search fields, for instance? These search fields allow for choices in access method, accession number, author, author phrase, corporate or conference name, corporate and conference name phrase, personal name, personal name phrase, language type, material type, material type phrase and 18 more choices, per each search line - including a half-dozen under 'subject' alone. The search fields in OCLC offer these options, in three separate possible boxes, multiplying the search-ability by all those permutations. Additionally, there are year date, language and number of libraries searches as separate boxes. Limitation fields below go even further and allow for type of material: books, visual materials, computer files, internet resources, serial publications, sound recordings, archival materials, continually updated resources, articles, musical scores, maps allow for a narrowing of the field of search even further. There are additional limitations for availability possibilities too. Sincerely, George Krzyminski at Certain Books


Posted On: 2019-08-10 18:17
User Name: adminb

The OCLC, which I use but may not fully understand, shows how many copies are held among the more than 30,000 members of OCLC. So, for example I looked up “Art Work of Ulster County” recently and found 5 locations: LOC, NYPL, SUNY New Paltz, UCCC and Penn State. None of these copies are searchable online. Neither did I find it in Google Books.

To see the entire volume all pages including text and images will to be scanned and then converted into one or more word documents that random keywords searches can find. That’s the approach I’ll take to all material uploaded to this database.

In addition to books, all printed forms as well as manuscript material will be included.

This full text will be wide open to Google so that random terms and phrases found in this local database will create matches.

At a guess, and it’s strictly a guess, about 15% of the U. S. population has some connection to the mid-Hudson Valley.


Rare Book Monthly

  • Doyle
    The Collection of Mary Tyler Moore
    June 4, 2025
    DOYLE: Peter Max, Portrait of Mary Tyler Moore (Versions 1,2, 5, 6), 2001. Estimate $10,000-15,000
    DOYLE: The iconic screen-used wall-mounted "M" from The Mary Tyler Moore Show. Estimate $5,000-8,000
    DOYLE: The Mary Tyler Moore Show by Al Hirschfeld. Estimate $4,000-6,000
    Doyle
    The Collection of Mary Tyler Moore
    June 4, 2025
    DOYLE: Annie Leibovitz presents Mary Tyler Moore and Dick Van Dyke for Vanity Fair. Estimate $4,000-6,000
    DOYLE: Al Hirschfeld presents Mary Tyler Moore and Dick Van Dyke in the CBS Wednesday Night Lineup. Estimate $4,000-6,000
    DOYLE: Richard McKenzie, Portrait of Mary Tyler Moore. Estimate $1,000-2,000
    Doyle
    The Collection of Mary Tyler Moore
    June 4, 2025
    DOYLE: Three Original Bill Hargate Costume Designs for The Mary Tyler Moore Hour. Estimate $600-800
    DOYLE: The famous Bonnie and Clyde "Wanted" broadside. Estimate $500-800
    DOYLE: Ticket to the Final Episode of the Mary Tyler Moore Show Estimate $400-600
  • Sotheby's
    Bibliothèque Jacques Dauchez - Autour de Dubuffet
    5-19 June
    Sotheby’s, June 5-19: Bissière, Roger. Cantique à notre frère soleil de saint François. 1954. 1,000 - 1,500 EUR
    Sotheby’s, June 5-19: Céline, Louis-Ferdinand. La vie & l’œuvre de Philippe Ignace Semmelweis. 1924. Rare édition originale, avec envoi. Joint : La Quinine en thérapeutique, 1925. 4,000 - 6,000 EUR
    Sotheby’s, June 5-19: Céline, Louis-Ferdinand. Mort à crédit. 1936. Édition originale. Bel exemplaire sur Hollande. 2,500 - 3,500 EUR
    Sotheby's
    Bibliothèque Jacques Dauchez - Autour de Dubuffet
    5-19 June
    Sotheby’s, June 5-19: Chillida, Eduardo ─ Emil Cioran. Face aux instants. 1985. Un des 100 exemplaires sur Arches. Eau-forte signée. 600 - 800 EUR
    Sotheby’s, June 5-19: Dubuffet, Jean. Ler dla canpane. L’Art Brut, 1948. Édition originale. 3,000 - 5,000 EUR
    Sotheby’s, June 5-19: Dubuffet, Jean. L'Herne Jean Dubuffet. 1973. Un des 100 exemplaires du tirage de luxe avec une sérigraphie originale en couleurs. 1,000 - 1,500 EUR
  • Gros & Delettrez
    Livres & Manuscrits Arméniens
    Jeudi 12 juin 2025
    Paris, Francis
    Gros & Delettrez, June 12: BIBLE, Venise 1733, reliure arménienne
    Gros & Delettrez, June 12: CHARAKNOTS, manuscrit XVIIe-XVIIIe siècle
    Gros & Delettrez, June 12: CHARAKNOTS, manuscrit daté 1606, reliure arménienne
    Gros & Delettrez, June 12: CHARAKNOTS, manuscrit début XVIIIe siècle, reliure arménienne
    Gros & Delettrez, June 12: CHARAKNOTS, Amsterdam 1664
    Gros & Delettrez, June 12: CHARAKNOTS, Amsterdam 1702, reliure arménienne
    Gros & Delettrez, June 12: DICTIONNAIRE arménien, manuscrit XVIIe-XVIIIe siècle.
    Gros & Delettrez, June 12: EVANGILE, manuscrit 1735-1737, reliure arménienne
    Gros & Delettrez, June 12: LIVRE DE PRIERES, Grégoire de Narek, manuscrit
    Gros & Delettrez, June 12: GEOGRAPHIE, Ghoukas INDJIDJIAN, Venise 1802-1806
    Gros & Delettrez, June 12: MANUSCRIT THEOLOGIQUE, XVIe-XVIIe siècle
    Gros & Delettrez, June 12: MASHTOTS, manuscrit XVIIIe-XIXe siècle, reliure arménienne
    Gros & Delettrez, June 12: LETTRE ENCYCLIQUE, manuscrit XIXe siècle
    Gros & Delettrez, June 12: NOUVEAU TESTAMENT, Amsterdam 1668, reliure arménienne

Article Search

Archived Articles