In this grant-funded role, you will wear many hats - exploratory data scientist, text analysis expert, data pipeline engineer, research collaborator, product manager, and more. You will work closely with the principal investigators and a team of media researchers to research, prototype, and develop data analysis workflows that can scale from initial prototypes to corpora of millions of documents. Some of this will rely on skills you already have, but you will have to do significant work learning news skills and exploring cutting-edge supporting technologies and algorithms. This position provides an opportunity for someone to work on leading tools that support critical research into how social mobilization interacts with media and to help make Media Cloud more useful for researchers and non-profits trying to understand the role of media for democratic processes. We expect scholarly and popular press publications to come out of this research.
Given the conditions created by the ongoing pandemic, this position is open to part-time remote status. However, it does require being on site at Northeastern at regular intervals.
Primary Duties and Responsibilities
Keep up to date on research in data analysis architectures, text classification, hate speech detection, social media platform policies, machine learning, etc. to inform new functionalities in the tooling and research output.
Work with other team members to establish a technical vision for the project.
Contribute to research papers with planning, writing, and data needs.
Maintain, upgrade, and build new data pipelines with data from existing corpora, APIs, and other sources.
Write code that can scale systems to handle ever-expanding data requirements.
Engage in active collaboration and coordination with the cross-institution research team.
Contribute to related project data needs as needed.
Provide budget, logistical, and HR inputs to support grant management.
Contribute to a healthy remote workplace and cultures.