R
Patrick Flor
https://web.archive.org/web/20150424032216/https://idrh.ku.edu/colang-workshops
Time: 8:30am – 10:00am
Week 2: R will meet 25 - 28 June 2012
Meeting Location (for both Workshops): 419 Watson Library
This workshop is an introduction to linguistic annotation and analysis with R.
What is R?
R is “a language and environment for statistical computing and graphics.” It encapsulates a number of statistical procedures and visualizations useful to linguists, * so that we can spend less time implementing analysis tools, and more time exploring our data and designing research questions. The workshop is in two parts, each one week long: 1) Introduction to XML. Our goal is to build a basic, functional knowledge of XML and related technologies (XPath, XSLT), in order to understand how we can structure our linguistic data for accessibility, analysis, and preservation. We will practice designing linguistic tagsets, tagging data, and interacting with it through simple queries. Participants are encouraged to bring along their own text data in order to make this exercise more relevant to their linguistic documentation and research projects. 2) XML and R. We will then learn how to design and ask more advanced questions about our data, by writing basic programs in R, a software tool for statistical analysis and visualization. No previous computer programming experience is assumed. Tools we will be using: R – (R is a free download, available for Windows, Mac, and Linux computers.) oXygen – (this is not free software, but a trial license is available—please download the oXygen Editor software and sign up for a trial registration a few days before the workshop begins. Windows, Mac and Linux versions are available.) * No prior coursework in statistics is assumed for this workshop.