Q: For what languages do we want to collect data for? #5

omarsar · 2020-06-28T14:23:41Z

Please include the languages that you think we should collect data for. If you have experience working in a specific language, that will be useful and you can propose collecting emotion-related data in that language.

omarsar · 2020-06-28T14:24:30Z

I have worked with both English and Spanish. I am also looking at my dialect, Creole.

fmplaza · 2020-06-28T21:03:09Z

In my PhD I'm working with both English and Spanish too, but I focus more on Spanish as it is my mother tongue. I have experienced in collecting Twitter messages.

maraimm · 2020-06-29T20:48:47Z

I will contribute for Arabic

Maybe it is good to discuss the data collection in the next meeting. Are we creating new resources or make use of the existing ones?

KhalidAlt · 2020-06-30T18:14:14Z

I would like to contribute in both Arabic and English.

omarsar · 2020-07-01T13:56:25Z

In my PhD I'm working with both English and Spanish too, but I focus more on Spanish as it is my mother tongue. I have experienced in collecting Twitter messages.

@fmplaza do you know of any large-scale dataset for Spanish? I haven't come across any.

omarsar · 2020-07-01T13:59:05Z

I will contribute for Arabic

Maybe it is good to discuss the data collection in the next meeting. Are we creating new resources or make use of the existing ones?

@maraimm we are creating new resources. I will emphasize on the data collection part next meeting. Thanks. Arabic data will be great as well. Have you looked around to see if there any existing datasets for emotion recognition?

Maybe @KhalidAlt feel free to share any information you come across.

Let's have some updates on this for our next meeting.

KwasiArhin · 2020-07-01T20:58:29Z

I will look up to see if there are any datasets with TWI that i can find.. other I can only participate with English haha

fmplaza · 2020-07-02T11:41:09Z

In my PhD I'm working with both English and Spanish too, but I focus more on Spanish as it is my mother tongue. I have experienced in collecting Twitter messages.

@fmplaza do you know of any large-scale dataset for Spanish? I haven't come across any.

@omarsar I know three different emotion datasets for Spanish labeled at tweet level but they don't include a large data set:

EmoEvent: A Multilingual Emotion Corpus based on different Events.
I'm one of the authors of this paper, it has been recently published in the LREC conference.
The Spanish version of EmoEvent dataset contains 8,409 tweets. Labels: anger, fear, sadness, joy, disgust, surprise, other.
Datasets from SemEval-2018 Task 1: Affect in Tweets AIT dataset comprises the datasets used in two subtasks:
1. E-c Multi-Label Classification. The dataset contains 7,094 tweets but it is a Multi-Label Classification Dataset. Labels: anger, anticipation, disgust, fear, joy, love, optimism, pessimism, sadness, surprise, trust, neutral, or no emotion.
2. EI-oc (emotion intensity ordinal classification) and EI-reg (emotion intensity regression) subtasks. The dataset contains 7,953 tweets. Labels: anger, fear, sadness, joy.

maraimm · 2020-07-13T22:16:19Z

For Arabic, there are many efforts and most of them result in small-sized datasets: The following are the datasets I found in the first phase of the search.

AETD dataset - Emotional-Tone - Type: Tweets - Size: 10065 - Lang: Arabic-Egyptian
SemEval-2018 Task 1: Affect in Tweets (AIT-2018)
Lama-dataset - available by contacting the author
Emotion in the Headlines

omarsar · 2020-07-14T14:03:50Z

@maraimm those are great findings. Do you mind giving us a short overview of your findings in the next meeting? It doesn't have to be a long presentation. We would just like an update.

maraimm · 2020-07-15T18:51:40Z

@omarsar Yeah, sure. I am not sure when is the next meeting, date and time?

omarsar · 2020-07-17T11:19:33Z

@maraimm it's scheduled for next Saturday (25 July 2020 - 15:00 CEST). I will send the zoom link in our Slack group.

maraimm · 2020-07-17T13:46:37Z

Thanks, @omarsar. Unfortunately, I am not sure I will be able to join the call on Saturday. In case, I was not able, shall I prepare something today to share it with the team tomorrow? (Summary for example)

Will the session be recorded?

omarsar · 2020-07-17T14:31:58Z

@maraimm the summary would be excellent. If it's a recording even better then I can share it with the group when we meet again. All sessions are being recorded.

maraimm · 2020-07-18T08:34:31Z

Hi @omarsar,

I emailed you a short recording.

Thanks

omarsar · 2020-07-18T13:08:00Z

@maraimm Thank you for the video recording. I have added it to our meeting notes.

cahya-wirawan · 2020-07-25T15:22:39Z

Hi,
Sorry for coming late. I found a paper from late 2018 about emotion classification on indonesian twitter: https://www.researchgate.net/publication/330674171_Emotion_Classification_on_Indonesian_Twitter_Dataset
They collected and annotated 7500 tweets with 5 emotions: love, joy, anger, sadness, and fear.

rfazeli · 2020-09-10T18:54:40Z

I can collect data for Persian

omarsar assigned manisnesan, omarsar, ranarag, RachitBansal, dhruvrnaik, fmplaza, KwasiArhin, KhalidAlt and angysaravia Jun 28, 2020

omarsar added research_emotion_analysis data_collection labels Jun 28, 2020

fmplaza unassigned ranarag Jun 28, 2020

manisnesan removed their assignment Apr 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q: For what languages do we want to collect data for? #5

Q: For what languages do we want to collect data for? #5

omarsar commented Jun 28, 2020

omarsar commented Jun 28, 2020

fmplaza commented Jun 28, 2020

maraimm commented Jun 29, 2020

KhalidAlt commented Jun 30, 2020

omarsar commented Jul 1, 2020

omarsar commented Jul 1, 2020

KwasiArhin commented Jul 1, 2020

fmplaza commented Jul 2, 2020 •

edited

maraimm commented Jul 13, 2020

omarsar commented Jul 14, 2020

maraimm commented Jul 15, 2020

omarsar commented Jul 17, 2020

maraimm commented Jul 17, 2020

omarsar commented Jul 17, 2020

maraimm commented Jul 18, 2020

omarsar commented Jul 18, 2020

cahya-wirawan commented Jul 25, 2020 •

edited

rfazeli commented Sep 10, 2020

Q: For what languages do we want to collect data for? #5

Q: For what languages do we want to collect data for? #5

Comments

omarsar commented Jun 28, 2020

omarsar commented Jun 28, 2020

fmplaza commented Jun 28, 2020

maraimm commented Jun 29, 2020

KhalidAlt commented Jun 30, 2020

omarsar commented Jul 1, 2020

omarsar commented Jul 1, 2020

KwasiArhin commented Jul 1, 2020

fmplaza commented Jul 2, 2020 • edited

maraimm commented Jul 13, 2020

omarsar commented Jul 14, 2020

maraimm commented Jul 15, 2020

omarsar commented Jul 17, 2020

maraimm commented Jul 17, 2020

omarsar commented Jul 17, 2020

maraimm commented Jul 18, 2020

omarsar commented Jul 18, 2020

cahya-wirawan commented Jul 25, 2020 • edited

rfazeli commented Sep 10, 2020

fmplaza commented Jul 2, 2020 •

edited

cahya-wirawan commented Jul 25, 2020 •

edited