Estudios Ingleses. (Plan a extinguir)

Grado y Doble Grado. Curso 2022/2023.

LINGÜÍSTICA COMPUTACIONAL Y DE CORPUS (LENGUA INGLESA) - 802231

Curso Académico 2022-23

Datos Generales

SINOPSIS

COMPETENCIAS

Generales
CG 1- Conocimiento instrumental de la lengua inglesa a un nivel elevado.
CG 2 - Conocimiento de las disciplinas del campo de la lengua inglesa.
CG 3 - Capacidad para utilizar herramientas informáticas y presentar en público productos, ideas e informes, todo ello en lengua inglesa.
CG 4 - Capacidad para manejar, programar y comprender programas informáticos, en especial aquellos relacionados con la gestión de bases de datos textuales, tanto en lengua inglesa como en española.
CG 5 - Capacidad para acometer el desarollo de trabajos relacionados con la gestión de bases de datos textuales, y el tratamiento informatizado de dichos datos, con un alto grado de autonomía.
CG 6 - Desarrollo de una actitud reflexiva y crítica sobre la perspectiva computacional aplicada a la lengua inglesa.
Específicas
CE 21: El alumno será capaz de desenvolverse con cierta soltura en el manejo de herramientas informáticas y de bases de datos textuales.

ACTIVIDADES DOCENTES

Clases teóricas
The course will combine theoretical aspects with hands-on practical activities involving computational and corpus analysis tools.
Clases prácticas
Hands-on practical activities involving computational and corpus analysis tools.
Laboratorios
Hands-on practical sessions in the computer lab

Presenciales

4

No presenciales

2

Semestre

2

Breve descriptor:

This course is designed to increase awareness of computational and corpus tools and their applications for linguistic research. It provides ways to learn about various tools to at least partially automate or accelerate linguistic analysis, providing students with a greater understanding of how and when to use empirical approaches to linguistic analysis. A large emphasis of the course is in how to deal with large amounts of language data and to understand practical issues in dealing with corpora, annotation and multi-lingual data. It covers computer methods for doing linguistics with on-line corpora. Includes extensive introduction (with lab) to different corpus tools, concordance programs, syntactic treebanks, propbanks, and corpora for lexicogrammatical, pragmatic and discourse research.

Requisitos

-Advanced computational skills to undertake the use and programming of computer tools and analyses. Students without adequate computer skills (including some basic programming skills) and in the first courses of the English Studies Degree should NOT register for this course.

Objetivos

 At the end of the course, students will have learnt basic aspects of the computational perspective on language, focusing on the preprocessing, markup and annotation of English computer corpora, as well as on the handling of corpus analysis and query tools for the extraction of linguistic information from machine-readable datasets. They will also become familiar with different computational and linguistic applications emerging from the computer-aided analysis of corpora

 

Contenido

Unit 1: Introductory Issues: Definitions and approaches. What is a corpus. Corpus types. Constructing corpora (week 1-3)


Unit 2: Second generation corpora: British National Corpus; Corpus of Contemporary American English; Corpus analysis tools (weeks 4-6)


Unit 3: Corpus data: access and analysis. Data processing; corpus mark-up and annotation; annotation standards; tokenising and part-of-speech tagging; lemmatising and morphological tagging; quality assurance, reusability and sustainability; syntactic processing, sense and semantic tagging; searching and concordancing.(weeks 7-11)


Unit 4: Corpus exploitation: corpora and lexicogrammatical studies, corpora and word meaning, corpora and discourse analysis, other computational applications. (weeks 12-14)

Evaluación

The final grade will be calculated according to the percentages below.The participation grade includes attending class regularly and participating actively. Other in-class activities (written questions about the readings, reading logs, reports, group work, oral presentations) will also count towards your participation grade. There will be several assignments, a group project and a a final take-home exam. More detail on each of these will be provided throughout the semester.

COMPONENT WEIGHT
Participation and activities 40%
Final exam 60%

Bibliografía

* Steven Bird, Ewan Klein, and Edward Loper. (2009). Natural Language Processing with Python. O’Reilly Press.
* Lavid, J. (2005). Lenguaje y nuevas tecnologías: nuevas perspectivas, métodos y herramientas para el lingüista del siglo XXI. Madrid, Ediciones Cátedra.
* Lüdeling, A., & Kytö, M. (Eds). (2008). Corpus Linguistics. An International Handbook.. Berlin - New York: Mouton de Gruyer.
* MCEnery, T. and A. Hardie (2012). Corpus Linguistics: Method, Theory and Practice. Cambrdige University Press.
* McEnery, McEnery, T., Xiao, R. and Tono, Y. (2006) Corpus based language studies. London: Routledge.
* Martin Wynne (ed.). 2005. Developing Linguistic Corpora: A Guide to Good Practice. Oxbow Books. http://www.ahds.ac.uk/creating/guides/linguistic-corpora/index.htm
Other readings available on-line. Links to those are provided from the course web site in Campus Virtual

Estructura

MódulosMaterias
PROFUNDIZACIÓN EN LENGUA Y LINGÜÍSTICA INGLESALINGÜÍSTICA INGLESA APLICADA

Grupos

Clases teóricas y/o prácticas
GrupoPeriodosHorariosAulaProfesor
Grupo A12/09/2022 - 21/12/2022JUEVES 12:30 - 14:30A-LAB 001MARIA JULIA LAVID LOPEZ
VIERNES 12:30 - 14:30A-LAB 001MARIA JULIA LAVID LOPEZ