Categories
Blog #4

Blog #4: Markup

We have done different type of reading on the same text and each time, there is more information being extracted. For the markup project, we conducted a very close reading and attempted to identify specific things, such as names, places, dates, events, etc. This project required more collaboration than ones before since we all had to agree on what to tag and what not to. As mentioned by Pierazzo, two readers could have two different interpretations of the same text. To solve this issue, we discussed as a group and formed rules/guidelines using Google Docs on what should and should not be tagged. There were no disputes amongst the group after this. Collaborating on this project was necessary since it helped eliminate the bias of having only one person work on the whole project.

The close reading required for this project allowed me to see more details about Samuel Tippett’s life. An important event in his life is his father’s death when he is young. This helps as readers understand Tippett even more. It also allows us to look at other people who were a part of his life, locations he travelled throughout his life and how he conveyed his emotions. We went more in depth during this project than we did during the initial transcription, as the transcription was just a private edition. This markup publication helps us get closer to the diplomatic edition that was mentioned in the reading, where we try to recreate and digitalize the initial text. It is not yet a perfect diplomatic as it is missing many of the ‘facts’ but it provides much more than the raw texts that comes with just transcribing and we have the option to change the codes of the XML file to make it look similar to the original text.

Going through the text word by word to determine the markup necessary then reading the context of that specific word, phrase or sentence helps us engage with the text. It is not just mindless reading, we actually had to engage with the text to figure out what needed to be tagged and what did not. The most difficult part of this project was deciding what to tag, such as do we mark any of pronouns that refer to God or Jesus as a person name, we decided not to. There were emotions and events that our group deemed too vague to be tagged. After marking up the texts, we compiled it as an HTML file, making it look similar to a webpage.

Categories
Blog #1

Blog Post #1

Because of the new wave of technology, people are able to gather more information and have it become more accessible to a larger audience. The ability to access a plethora of resources on various subjects gives digital archives an advantage over archival documents. It is more time efficient to access documents on an online database than searching through a physical collection of printed documents. Search mode enables people to save time by instantly looking up whatever information rather than manually looking through each print. Another advantage to creating digital versions of documents is the fact that physical documents deteriorate over time due to weathering and physical handling. However, a disadvantage of creating digital artifacts is losing the authenticity of the original works. The physical and written structure of printed documents and the style of handwriting get lost in translation when they are turned into digital artifacts. Due to this, learning about a period’s culture is not possible because of the disappearing evidence of handwriting styles, writing styles, etc.. The way people write shows the cultural expectations for that time period. For example, flowery poetry was popular from the late 1700s to the 1900s in Britain and the United States. Knowing the long, wordy style of “flowery poetry” it indicates what those cultures valued: artistic and emotive expression. Although digital documents can use visualization like word clouds to show the most commonly used words in a document, it does not show the adjectives in between which distinguishes flowery poetry from other styles. In terms of what kind of knowledge digital archives offer, they give people access to a multitude of documents from all time periods which enables them to make connections over long periods of time unlike physical archives which are more tiring to look though.

It is hard to answer the ultimate question of which method is better because each has its own advantages and disadvantages. However, the way people read digital versus print documents is different. So in a sense, digital artifacts do supplant our needs for print because visualization tools like word pattern graphs and text images that use shape, color, texture help the mind process the digital information better. However, reading online documents by browse and search mode encourages readers to skim and search for the information they want instead of reading slowly. Print documents require inspection, sift, and synthesis because they do not include visualization like word clouds. Close reading is a good skill for understanding the little nuances in a text and aids in understanding the information on a deeper level since it focuses on word choice and phrasing. To encourage that in digital texts, it might be effective to not put in so many images that it distracts the reader from reading the text itself. It is easy to mistake those images as conclusions themselves rather than a means to a conclusion.

Comparing the Moravian Lives project with other large DH projects, similarities like categorizing documents. The Moravian website has a map which enables the reader to visualize where the documents originate from. They are also categorized by language, country and archive. Quanitfying Kissinger also uses network analysis to visualize all the information sourced from the documents for comparison. Because large DH porjects have large quanitities of data, they use distant reading to visualize patterns to understand the overarching ideas.