Instructions
Database structure
The database contains three types of entities: Dialects, Idioms, and Sentences. These are organised hierarchically: each idiom in the database belongs to one dialect only (even though similar or even identical idioms may sometimes be found in several dialects), and each sentence in the database is also linked to a single idiom.
The Browse option will first take you to the list of dialects. The entry for each of the dialects gives the principal location where the data was collected, and the list of dictionaries or grammars used as sources, followed by a list of all the idioms belonging to the dialect.
Idiom information
The list of idiom properties includes voice, tense, aspect, and modal information; furthermore, the following structural properties are recorded:
- General structure of the idiom, given in sentence element notation, e.g. V DO, or V DO PP
- The open positions (if any) that the idiom contains: IO, DO, PP or a combination thereof
- Whether the open position needs to be animate or inanimate
- Whether the verb that is part of the idiom is either zijn ‘to be’ or hebben ‘to have’
- The type of the direct object, e.g. definite, indefinite, bare, quantified, etc.
- Whether or not the direct object is modified (usually by an adjective)
- If the idiom contains a possessive structure, what its type is. E.g. possessive pronoun, PP, external possession
- Whether the possession in question is alienable or inalienable
Some idioms are also provided with English glosses and translations (the ones used in questionnaires, see below). The abbreviations employed in glosses are standard and straightforward, except gender morphology, which requires a clarification. In dialects with a three-way gender distinction, the abbreviations are (F)eminine, (M)asculine, and (N)euter, while in dialects with a two-way system, (N)euter stands in opposition to NN for non-neuter, i.e. common gender.
Manipulated idioms
A number of idioms have been used in questionnaires presented to native speakers of the dialects, with the goal of establishing the extent to which idioms are dependent on the functional material, and determining the level of morphosyntactic flexibility they allow (see also the home page of the database). Each of these idioms is linked to several test sentences containing a manipulated form of the idiom. The properties listed for each sentence, in turn, include the test property of the base idiom we are interested in (e.g. deontic modality, or definite determiner), the way the idiom was manipulated in the sentence (e.g. (change to) epistemic modality, or topicalization of the DP), and the judgments speakers provided for the sentence.
The possible judgments values are:
- Yes (“the idiomatic reading is available”)
- No (“the idiomatic reading is not available”)
- Marginal (“the idiomatic reading is marginally available”)
- Base idiom unknown (“I do not know this idiom to start with”)
- Not asked (the test sentence was not presented to this particular speaker)
An aggregate acceptability (note that this always refers to the availability of the idiomatic reading, not the grammaticality of the sentence as such) is calculated for each manipulated sentence from individual judgments in the following way:
- The non-responsive values, i.e. “Not asked” and “Base idiom unknown” are ignored (for the purpose of this calculation only)
- If the judgments contain both “Yes” and “No”, even just one of each, we mark it acceptability varies by speaker
- When speakers judge something marginal, we pay more attention to those who had a definite opinion. In particular:
- if there are more “Yes” than “Marginal” judgments, we mark it Mostly acceptable
- if there are more “No” than “Marginal” judgments, we mark it Mostly not acceptable
- If there are more “Marginal” than “Yes” (respectively “No") judgments, or an equal number of them, mark it Moderately acceptable or Barely acceptable
To summarize:
Mostly acceptable | “Yes” and (optionally) “Marginal” only, majority is “Yes” |
Moderately acceptable | “Yes” and “Marginal” only, no “Yes” majority |
Acceptability varies by speaker | Both “yes” and “no” responses received |
Barely acceptable | “No” and “Marginal” only, no “No” majority |
Mostly not acceptable | “No” and (optionally) “Marginal” only, majority is “No” |
Note that while these judgment aggregates provide a useful summary, all individual judgments by native speakers are still listed for each sentence.
Search options
The Search page allows you to search for idioms, test sentences, or dialects, using a combination of selection criteria. The left three buttons on the search form lead to results of these respective types. The search buttons are shown on both top and bottom to minimize scrolling after selecting your criteria. You can only choose one at a time, but you can click on the appropriate links in a results listing to view related entities.
The remainder of the Search page consists of three sections, with search criteria for dialects, idioms, and sentences. You can combine search criteria from as many sections as you wish, regardless of the result type you choose. For example, you can choose to search for idioms, and specify that the translation of test sentences must contain the word drunk; you will then see a list of idioms linked to such sentences. Note that idioms of three dialects (Brees, Dendermonds, Leeds) have no recorded manipulated sentences. Using any sentence property in a search will therefore always exclude results from those dialects (also when searching for Idioms).
The three sections are collapsed by default.
Click on a header to expand the section and show its contents.
To select multiple elements in a list, hold down the Ctrl
or Cmd
key as you click.
Simple clicking with the mouse | selects a single element. |
Clicking while holding down the Ctrl (or Cmd ) key |
adds this element to the elements already selected. |
Ctrl-clicking on a selected element | de-selects this element. |
Clicking while holding down the Shift key |
selects all elements between the click and the last element clicked. |
If you select multiple values in a criterion, a record matching any one of them will be shown. If you search by multiple search criteria, a record must match all of them to be shown. If you do not select any values in a search criterion, it will not be used in the search.
Simple free-text searches require every search term to be present, as a whole word.
For more complex queries you can use the boolean operators AND
, OR
, and NOT
.
You can enclose "a string in quotes"
to match it exactly, or search for the keyword NULL
, which means the field must be empty.
It is also possible to search on a word-initial string using the *
wildcard, for example: zeg *
gives results for both "zeg" and "zeggen" (the space between the search term and * character is required).
You can click the Reset form button to remove all currently entered criteria and start from scratch. After entering your search criteria, click on one of the entity type buttons to carry out the search.