The Dataset
Our 300+ participants generated a rich, multi-layered dataset requiring both quantitative and qualitative analysis:
- Quantitative Speech Data: 6,900+ audio recordings across five speaking tasks per participant (L1 baseline, four L2 English tasks including game recommendation, restaurant recommendation, gaming tutorial, genre discussion, and conversational agent dialogues)
- Survey Responses: Gaming preferences and habits, speaking and listening patterns in gaming contexts, online communication styles, self-assessment of gaming fluency, motivation and interaction patterns, demographic and language background
- User Experience Data: Conversational agent feedback, ratings, and qualitative comments about the libre software AI experience
- Community Discourse: Reddit comments and forum discussions revealing attitudes toward gaming and language learning