Victoria Firsanova

Portfolio

Teaching

2023-2024 Python – MA program, Computational Linguistics
Higher School of Economics
2023-2024 Digital Literacy – BA program, Media Communication
Higher School of Economics
2023-2024 Computational Linguistics – BA program, Philology
Higher School of Economics
2023-2024 Projects – "Developing a model for question answering and navigation for Hermitage visitors"; "Developing a chatbot for movie search"
Higher School of Economics

Recent Projects

TranscribePro
A mobile app for supporting inclusive education through Text-to-Speech (TTS) and Speech-to-Text (STT) technologies. Current state: prototyping.
Learn more
Virtual Teaching Assistant
A web app adapting Large Langusge Models (LLMs) for educators. Current state: prototyping.
Learn more
EMPI AI Ecosystem
A set of tools, including mobile apps, web apps, and a smart watch for supporting inclusive education. Current state: prototyping.
Learn more
The Black Box
A popular science podcast about Artificial Intelligence for Russian-speaking audience.
Learn more
Text-to-Image XAI
Experiments on text-to-image AI that combine knowledge from linguistics and computing. A contribution to the black box problem
Learn more

Selected Papers

Towards Building a Mobile App for People on the Spectrum
Victoria Firsanova
Companion Proceedings of the ACM Web Conference 2023, Austin, TX, USA.
The inclusion of autistic people can be augmented by a mobile app that provides information without a human mediator making information perception more liberating for people in the spectrum. This paper is an overview of a doctoral work dedicated to the development of a web-based mobile tool for supporting the inclusion of people on the autism spectrum. The work includes UX/UI research conducted with psychiatry experts, web information retrieval study and neural question-answering research. Currently, the study results comprise several mobile app layouts, a retriever-reader model design and fine-tuned neural network for extractive question-answering. Source code and other resources are available at https://github.com/vifirsanova/empi.
What do text-to-image models know about the languages of the world?
Victoria Firsanova
Data-Centric AI Workshop, 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia.
Text-to-image models use user-generated prompts to produce images. Such text-to-image models as DALL-E 2, Imagen, Stable Diffusion, and Midjourney can generate photorealistic or similar to human-drawn images. Apart from imitating human art, large text-to-image models have learned to produce combinations of pixels reminiscent of captions in natural languages. For example, a generated image might contain a figure of an animal and a symbol combination reminding us of human-readable words in a natural language describing the biological name of this species. Although the words occasionally appearing on generated images can be human-readable, they are not rooted in natural language vocabularies and make no sense to non-linguists. At the same time, we find that semiotic and linguistic analysis of the so-called hidden vocabulary of text-to-image models will contribute to the field of explainable AI and prompt engineering. We can use the results of this analysis to reduce the risks of applying such models in real life problem solving and to detect deepfakes. The proposed study is one of the first attempts at analyzing text-to-image models from the point of view of semiotics and linguistics. Our approach implies prompt engineering, image generation, and comparative analysis. The source code, generated images, and prompts have been made available at https://github.com/vifirsanova/text-to-image-explainable.
Two Approaches to Building Dialogue Systems for People on the Spectrum
Victoria Firsanova
Data-Centric AI Workshop, 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia.
The paper presents a study on combining model- and data-centric approaches to building a question answering system for inclusion of people with autism spectrum disorder. The study shows that applying sequentially model- and data-centric approaches might allow achieving higher metric scores on closed-domain lowresourced datasets.
Transformer Models for Question Answering on Autism Spectrum Disorder QA Dataset
Victoria Firsanova
Springer International Publishings, Digital Transformation and Global Society: 6th International Conference, St. Petersburg, Russia.
Question answering (QA) Transformer-based models might become efficient in inclusive education. For example, one can test and tune such models with small closed-domain datasets before the implementation of a new system in an inclusive organization. However, studies in the sociomedical domain show that such models can be unpredictable. They can mislead a user or evoke aversive emotional states. The paper addresses the problem of investigating safety-first QA models that would generate user-friendly outputs. The study aims to analyze the performance of SOTA Transformer-based QA models on a custom dataset collected by the author of the paper. The dataset contains 1 134 question-answer pairs about autism spectrum disorders (ASD) in Russian. The study presents the validation and evaluation of extractive and generative QA models. The author used transfer learning techniques to investigate domain-specific QA properties and suggest solutions that might provide higher QA efficiency in the inclusion. The study shows that although generative QA models can misrepresent facts and generate false tokens, they might bring diversity in the system outputs and make the automated QA more user-friendly for younger people. Although extractive QA is more reliable, according to the metric scores presented in this study, such models might be less efficient than generative ones. The principal conclusion of the study is that a combination of generative and extractive approaches might lead to higher efficiency in building QA systems for inclusion. However, the performance of such combined systems in the inclusion is yet to be investigated.

About

This is how DALL·E 2 sees me!

Hello there, I am Victoria, and I'm self-taught AI researcher. At school, I was passionate about two things: computers and languages. I knew that computational linguistics is the perfect field for me, but I failed the exams to enter the desired faculty and started to study philology at university. Something was wrong. I was interested in linguistics but I felt nauseous about the other humanities. At one summer school, a guy advised me to walk through Stanford's natural language processing course. I spent nights learning the algorithms and mathematics behind language models and machine learning. I was extremely insecure about my skills until my paper was accepted to a NeurIPS workshop.

Victoria Firsanova

BLOG

How to Train a Convolutional Neural Network for Sentiment Analysis

Portfolio

Teaching

Recent Projects

Selected Papers

CV

About

Recent Talks | YouTube