Dieser Kurs bietet eine Einführung in grundlegenden Algorithmen und Datenstrukturen. Im Kurs …
Dieser Kurs bietet eine Einführung in grundlegenden Algorithmen und Datenstrukturen.
Im Kurs enthalten sind Programmbeispiele in verschiedenen Programmiersprachen. Für deren Verständnis wird die Kenntnis der jeweiligen Sprache vorausgesetzt.
Introduces practice and theory of literary criticism. Seminar focuses on topics such …
Introduces practice and theory of literary criticism. Seminar focuses on topics such as the history of critical methods and techniques, and the continuity of certain subjects in literary history. Instruction and practice in oral and written communication. Topic: Theory and Use of Figurative Language. This seminar offers a course of readings in lyric poetry. It aims to enhance the student's capacity to understand the nature of poetic language and the enjoyment of poetic texts by treating poems as messages to be deciphered. The seminar will briefly touch upon the history of theories of figurative language since Aristotle and it will attend to the development of those theories during the last thirty years, noting the manner in which they tended to consider figures of speech distinct from normative or literal expression, and it will devote particular attention to the rise of theories that quarrel with this distinction. The seminar also aims to communicate a rough sense of the history of English-speaking poetry since the early modern period. Some attention will be paid as well to the use of metaphor in science.
Here you can access PostScript/PDF and HTML versions of D.J. Arnold, Lorna …
Here you can access PostScript/PDF and HTML versions of D.J. Arnold, Lorna Balkan, Siety Meijer, R.Lee Humphreys and Louisa Sadler Machine Translation: an Introductory Guide, Blackwells-NCC, London, 1994, ISBN: 1855542-17x.
Computation has produced new and exciting ways of studying texts. Many of …
Computation has produced new and exciting ways of studying texts. Many of these methods do not require the use of expensive programs or detailed programming knowledge, but only the know-how to combine freely accessible resources to perform various tasks. This site describes common or interesting sequences of actions, or recipes. They are organized according to the objective of the recipe. Recipes fall into the three major categories of location and identification of ideas, themes or specific terms; analysis of textual devices or themes; or the construction of new entities or corpora. The Methods Commons community benefits from shared experience and learning how others make use of recipes. You can share your experience by adding your own recipes to the collection. More information about recipe and exercise structure and authoring is available on the Recipe Structure page. We also have a Glossary that we hope you will add to.
Your list is now clean enough that you can begin analyzing its …
Your list is now clean enough that you can begin analyzing its contents in meaningful ways. Counting the frequency of specific words in the list can provide illustrative data. Python has an easy way to count frequencies, but it requires the use of a new type of variable: the dictionary. Before you begin working with a dictionary, consider the processes used to calculate frequencies in a list.
This lesson uses Python to create and view an HTML file. If …
This lesson uses Python to create and view an HTML file. If you write programs that output HTML, you can use any browser to look at your results. This is especially convenient if your program is automatically creating hyperlinks or graphic entities like charts and diagrams.
Here you will learn how to create HTML files with Python scripts, and how to use Python to automatically open an HTML file in Firefox.
In this two-part lesson, we will build on what you’ve learned about …
In this two-part lesson, we will build on what you’ve learned about Working with Webpages, learning how to remove the HTML markup from the webpage of Benjamin Bowsey’s 1780 criminal trial transcript. We will achieve this by using a variety of string operators, string methods and close reading skills. We introduce looping and branching so that programs can repeat tasks and test for certain conditions, making it possible to separate the content from the HTML tags. Finally, we convert content from a long string to a list of words that can later be sorted, indexed, and counted.
In this lesson, you will learn the Python commands needed to implement …
In this lesson, you will learn the Python commands needed to implement the second part of the algorithm begun in the From HTML to a List of Words (part 1). The first half of the algorithm gets the content of an HTML page and saves only the content that follows the tags.
In this lesson you will first learn what topic modeling is and …
In this lesson you will first learn what topic modeling is and why you might want to employ it in your research. You will then learn how to install and work with the MALLET natural language processing toolkit to do so. MALLET involves modifying an environment variable (essentially, setting up a short-cut so that your computer always knows where to find the MALLET program) and working with the command line (ie, by typing in commands manually, rather than clicking on icons or menus). We will run the topic modeller on some example files, and look at the kinds of outputs that MALLET installed. This will give us a good idea of how it can be used on a corpus of texts to identify topics found in the documents without reading them individually.
Like in Output Data as HTML File, this lesson takes the frequency …
Like in Output Data as HTML File, this lesson takes the frequency pairs collected in Counting Frequencies and outputs them in HTML. This time the focus is on keywords in context (KWIC) which creates n-grams from the original document content – in this case a trial transcript from the Old Bailey Online. You can use your program to select a keyword and the computer will output all instances of that keyword, along with the words to the left and right of it, making it easy to see at a glance how the keyword is used.
Once the KWICs have been created, they are then wrapped in HTML and sent to the browser where they can be viewed. This reinforces what was learned in Output Data as HTML File, opting for a slightly different output.
At the end of this lesson, you will be able to extract all possible n-grams from the text. In the next lesson, you will be learn how to output all of the n-grams of a given keyword in a document downloaded from the Internet, and display them clearly in your browser window.
This lesson takes the frequency pairs created in Counting Frequencies and outputs …
This lesson takes the frequency pairs created in Counting Frequencies and outputs them to an HTML file.
Here you will learn how to output data as an HTML file using Python. You will also learn about string formatting. The final result is an HTML file that shows the keywords found in the original source in order of descending frequency, along with the number of times that each keyword appears.
This lesson builds on Keywords in Context (Using N-grams), where n-grams were …
This lesson builds on Keywords in Context (Using N-grams), where n-grams were extracted from a text. Here, you will learn how to output all of the n-grams of a given keyword in a document downloaded from the Internet, and display them clearly in your browser window.
This lesson shows how to use Python to transliterate automatically a list …
This lesson shows how to use Python to transliterate automatically a list of words from a language with a non-Latin alphabet to a standardized format using the American Standard Code for Information Interchange (ASCII) characters. It builds on readers’ understanding of Python from the lessons “Viewing HTML Files,” “Working with Web Pages,” “From HTML to List of Words (part 1)” and “Intro to Beautiful Soup.” At the end of the lesson, we will use the transliteration dictionary to convert the names from a database of the Russian organization Memorial from Cyrillic into Latin characters. Although the example uses Cyrillic characters, the technique can be reproduced with other alphabets using Unicode.
In this exercise we will use advanced find-and-replace capabilities in a word …
In this exercise we will use advanced find-and-replace capabilities in a word processing application in order to make use of structure in a brief historical document that is essentially a table in the form of prose. Without using a general programming language, we will gain exposure to some aspects of computational thinking, especially pattern matching, that can be immediately helpful to working historians (and others) using word processors, and can form the basis for subsequent learning with more general programming environments.
When you are working with online sources, much of the time you …
When you are working with online sources, much of the time you will be using files that have been marked up with HTML (Hyper Text Markup Language). Your browser already knows how to interpret HTML, which is handy for human readers. Most browsers also let you see the HTML source code for any page that you visit. The two images below show a typical web page (from the Old Bailey Online) and the HTML source used to generate that page, which you can see with the Tools -> Web Developer -> Page Source command in Firefox.
No restrictions on your remixing, redistributing, or making derivative works. Give credit to the author, as required.
Your remixing, redistributing, or making derivatives works comes with some restrictions, including how it is shared.
Your redistributing comes with some restrictions. Do not remix or make derivative works.
Most restrictive license type. Prohibits most uses, sharing, and any changes.
Copyrighted materials, available under Fair Use and the TEACH Act for US-based educators, or other custom arrangements. Go to the resource provider to see their individual restrictions.