Creating a corpus is time consuming but it may be a necessary task if available text corpora contain no useful (or very little) data for a research project. Before compiling a corpus:
Important considerations when compiling a corpus:
Representativeness: Ensure your corpus adequately reflects the language variety you are studying.
Balance: Distribute text types and genres proportionally to avoid overrepresentation of certain categories.
Size: Determine the appropriate corpus size depending on your research question and available resources.
Accessibility: Consider making your corpus available to other researchers if appropriate.
See below: Key steps in compiling a corpus.