Applied Scientist at Amazon. PhD from the University of Edinburgh.
I am interested in building computational agents that process natural language, and react in a manner that brings value to our lives.
Towards this vision, I am considering problems that arise in generative artificial intelligence, in particular as supported by large language models (LLMs), including knowledge grounding and in-context learning. I have also worked on problems in Computational Social Science, Figurative Language Comprehension, Machine Translation, and Computer Vision.
silviu dot vlad dot oprea at gmail dot com
News
- 16 April 2024: Excited to announce that I'll be joining Samsung Research UK as a Principal Machine Learning Research Engineer. Research agenda coming soon ⏳🧐🤖🚀🙏🏻! Thanks to everyone at Amazon for the last two years.
- 3 March 2023: I passed my PhD viva with no reviewable corrections 🥳! Thanks be to God 🙏🏻; to my supervisors Walid Magdy, Bonnie Webber, and Maria Wolters; and to my examiners Alexandra Birch-Mayne and Rada Mihalcea. Check out my thesis, Computational Sarcasm Detection and Understanding in Online Communication.
- 4 April 2022: Our patent, Processing communications in a computing arrangement for semantic understanding and interpretation of code-switching, by Sourav Dutta, Silviu Vlad Oprea, Salama Hitham, and Hu Peng, was published.
- 30 June 2021: The flood segmentation model that we built at Frontier Development Lab has now been deployed by SpaceX on an actual satellite 🛰. Along the way, we collaborated with the European Space Agency and UNICEF. Our work was covered by this post from the University of Oxford; and by several media outlets: 1, 2, 3, 4, 5. Check out our Nature (Scientific Reports) paper and the video of the rocket launch 🚀
See more news here.
Work
2024 - present: Principal Machine Learning Research Engineer at Samsung Research
Details coming soon, once research agenda is shaped (updated 16 April 2024)
2022 - 2024: Applied Scientist at Amazon Alexa AI
I am working on improving the ability of large language models (LLMs) to generate more intuitive, personalised, and context-sensitive responses, and to interact with external systems, with the purpose of providing Alexa customers with a more delightful experience.
2021: Applied Scientist (Intern) at Amazon Alexa AI
At Amazon, I worked with Elisabeth Kwan, Molly Xia, Christos Christodoulopoulos, Dave Palfrey, and Stephen Teskey on language generation using language models.
2020: Research Scientist (Intern) at Huawei
At Huawei, I worked with Haytham Assem and Sourav Dutta on learning transformations between monolingual word embedding spaces, to enable unsupervised translation and transfer learning to low-resource languages. Check out our COLING 2022 paper based on this work.
2019: Researcher at Frontier Development Lab
At Frontier Developemnt Lab, we built a flood segmentation model. In the process, we collaborated with the European Space Agency and UNICEF. The model has now been deployed by SpaceX on an actual satellite 🛰.
Our work was covered by this post from the University of Oxford; and by several media outlets: 1, 2, 3, 4, 5. Check out our Nature (Scientific Reports) paper and the video of the rocket launch 🚀
2014 - 2017: Engineer at VisualDNA and TheySay
During this time, I was an engineer at two tech startups. First, a software engineer at VisualDNA, a data science and management platform, where I worked on data aggregation and reporting using Scala and the Scalding interface to Hadoop. After VisualDNA, I spent some time as a contractor. Next, I was an artificial intelligence engineer at TheySay, a startup providing text analytics services, where I used technologies such as Scala and MongoDB.
Both startups were acquired, see this article about VisualDNA, and this one about TheySay.
2012: Guest Researcher at the National Institute for Standards and Technology
I worked with Bruce Miller on extending LaTeXML, a TeX parser that he wrote in Perl. The goal of my extenssion was to convert TikZ graphics to SVG. See this paper that mentions my work.
Education
2018 - 2023: PhD in Data Science at the University of Edinburgh
Check out my thesis, Computational Sarcasm Detection and Understanding in Online Communication.
In summary, I used computational methods to detect and understand the phenomenon of sarcasm, as it is manifested in online textual communication, together with my supervisors, Walid Magdy, Bonnie Webber, and Maria Wolters.
More specifically, I built a dataset of texts annotated for sarcasm (ACL 2020 paper), introduced sarcasm detection models (ACL 2019 paper), and also organised a competition encouraging the community to build such models (SemEval 2022 paper). I showed that people of similar socio-demographic backgrounds understand each other's sarcasm more often than people of dissimilar backgrounds (CSCW 2022 paper). Finally, I built a sarcastic chatbot (EMNLP 2021 demo), and investigated when it is appropriate for chatbots to be sarcastic, and how they should formulate their utterances (ACL 2022 paper).
Along the way, I had fun as an intern at Frontier Development Lab in 2019 (20201 Nature (Scientific Reports) paper), at Huawei in 2020 (COLING 2022 paper), and at Amazon Alexa AI in 2021 (paper in the baking 👨🏻🍳). See below, in the Work section.
2017 - 2018: MRes in Data Science at the University of Edinburgh
I used computational methods to detect the presence of sarcasm in tweets, together with my supervisor, Walid Magdy.
2012 - 2013: MSc in Computer Science at the University of Oxford
I worked with Phil Blunsom on building character-level language models for the Romanian language using recurrent neural networks.
2009 - 2012: BSc in Computer Science at Jacobs University Bremen
This is where my interest in natural language processing was triggered, working with Michael Kohlhase.
Teaching
- 2021: Lab demonstrator for Text Technologies in Data Science at the University of Edinburgh.
- 2010 and 2011: Teaching assistant for Programming in C/C++ at Jacobs University Bremen.
Media coverage
Our paper, Towards global flood mapping onboard low cost satellites with machine learning, published in Nature (Scientific Reports) in 2021, was covered by the following articles:
- University of Oxford: Artificial Intelligence pioneered at Oxford to detect floods launches into space
- The Watchers: WorldFloods – AI pioneered at Oxford for global flood mapping launches into space
- Innovation News Network: A look at historic breakthroughs in flood mapping from space
- Homeland Security News Wire: Detecting Floods from Space Using Artificial Intelligence
- Chinese Academy of Sciences: AI加持遥感技术能否为防汛"备料"
- China Science Communication: AI加持,遥瞰洪涛
Patents
European patent office
- Processing communications in a computing arrangement for semantic understanding and interpretation of code-switchingSourav Dutta, Silviu Vlad Oprea, Haytham Assem, and Hu PengPatent WO2022069030A1 issued from application PCT/EP2020/077336. 2022.html
My Google Patents page is here.
Publications
Figurative language comprehension
- Sarcasm Detection is Way Too Easy! An Empirical Comparison of Human and Machine Sarcasm DetectionIbrahim Abu Farha, Steven Wilson, Silviu Vlad Oprea, and Walid MagdyFindings of the Association for Computational Linguistics. 2022.pdf
Computational social science
Controllable text generation
Machine translation
- Multi-Stage Framework with Refinement Based Point Set Registration for Unsupervised Bi-Lingual Word AlignmentSilviu Vlad Oprea, Sourav Dutta, and Haytham AssemProceedings of the 29th International Conference on Computational Linguistics. 2022.pdf
Computer vision
- Towards global flood mapping onboard low cost satellites with machine learningGonzalo Mateo-Garcia*, Joshua Veitch-Michaelis*, Lewis Smith*, Silviu Vlad Oprea, Guy Schumann, Yarin Gal, Atılım Güneş Baydin, and Dietmar BackesNature (Scientific Reports). 2021.html
* indicates equal contribution. Check the full list of publications on my Google Scholar profile.