-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path05_Concordancing.Rmd
57 lines (37 loc) · 1.41 KB
/
05_Concordancing.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
---
title: "Concordancing"
author: "| Rodrigo Esteves de Lima-Lopes \n| University of Campinas \n| [email protected]\n"
output:
md_document:
variant: markdown_github
html_document:
toc: yes
pdf_document:
toc: yes
number_sections: yes
---
```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```
# Introduction
This script was developed for the analysis of Portuguese. I hope it helps colleagues in the LC area and popularize the use of R. It is part of our research project developed with CNPQ. Please drop me a line if you have any doubts or need any help.
# Purpose of this repository
This git brings the scripts for my article:
- Lima-Lopes R.E. (forthcoming). Beyond the Binary: Trans Women's Video Activism on YouTube. Accepted for publication at *Digital Scholarship in the Humanities*.
This script is specifically about **Concordancing**.
# Package
For concordancing we will need only one package
```{r package, eval=FALSE}
library(quanteda)
```
Creating the corpus
```{r corpus, eval=FALSE}
corpus.caption <- corpus(df_topics,
text_field = "caption", metacorpus = NULL,
compress = FALSE)
```
Please, change the variable `pattern` for the word of your interest and `window` for the number of words (left and right)
```{r conc, eval=FALSE}
kwic(corpus.caption, pattern = "amiga",
window =6, valuetype = "fix")
```