Corpus details

Corpus Hermans

Official name:Corpus Hermans
Language type:written
Corpus type:special purpose
Size:1554 clauses; 2210 lines; 18.572 words
Description:W.F. Hermans' novella "Het Behouden Huis", Amsterdam 1968 (6th edition) [1st edititon 1951], analysed and coded to facilitate research of word order in Dutch
Exploration:The corpus consists of plain text files and can be explored with standard exploration software like WordSmith and Windows Grep.
Fragmentation:complete text
Example:5 1 (de grote tak, bijna de hele kruin/10 lag/17 ineens/44 onder de boom/41 zonder dat ik gekraak hoorde/43) +--
Origin:Faculteit der Letteren, Vrije Universiteit. The corpus is composed by D.M. Bakker and A. Verhagen.
Location:Faculty network, folder G:\LET\Data\Corpora\Nederlands\Hermans
Details:In the text, each element has been coded that can be permutated on clause level. The original goal of the corpus was to serve as input for the automatic generation of of all possible permutations in Dutch. Apart from the syntatic coding, each clause is also coded for tempus.
The sentences are identified by a page number, followed by a line number (on that page).
See Also: 
Name:Codes Corpus Hermans
Description:Meaning of the codes that are used in the corpus

back to overview