RESEARCH

My current research focus is hypergraph-based learning models on datasets with many modalities (e.g., text, image, and video), with applications in online news reporting and social media analysis. By extracting multimodal features using natural language processing (NLP) and computer vision (CV) techniques, I analyze the media’s responsibility of the biased representation of social minorities focusing on gender, race, migrants, and refugees. I have also worked on developing Bayesian deep learning on time series forecasting and implementing convolutional neural networks (CNN) for the neural mechanisms of political ideology using the fMRI brain dataset.

If you are interested in my research, please feel free to contact me (se.yang@northeastern.edu).

Advisee: Zhen Gao, PhD candidate in Network Science, Northeastern University

Research Assistants: Danielle Shariff, PhD student in Political Science, Northeastern University

Fan Zichen, Master's student in Applied Quantitative Methods & Social Analysis, Northeastern University

Keval Amol Dave, Master's student in Media Innovation and Data Communication, Northeastern University

Uzair Farid, Master's student in Applied Quantitative Methods & Social Analysis, Northeastern University

Sara Fuernkranz, Master's student in Applied Quantitative Methods & Social Analysis, Northeastern University

HunJun Shin, Master's student in Computer Science, Northeastern University

Pratham Sachinbhai Shah, Master's student in Computer Science, Northeastern University

Ella Bramwell, Undergraduate in English and Journalism, Northeastern University

Bayesian Latent Class Profile Analysis for Sequential Political Development

Screen Shot 2026-03-21 at 6.42.08 PM.png

Seo Eun Yang and Sara Fuernkranz

Political life unfolds in sequences. Citizens do not adopt extreme partisan identities overnight; interstate disputes do not erupt into war without warning; sanctions campaigns do not resolve instantaneously. Across these domains, the pattern of states actors occupy across multiple time points, their developmental trajectory, carries information that no single cross-sectional observation can reveal. Yet quantitative political science lacks a unified framework for modeling this stage-sequential logic. Cross-sectional latent class analysis recovers political typologies but discards developmental history; latent transition analysis incorporates longitudinal dynamics but under the restrictive first-order Markov assumption that prior trajectory is irrelevant once the most recent state is known. We introduce Bayesian Latent Class Profile Analysis (LCPA) to political science as a method that resolves this tension, simultaneously recovering the latent classes actors occupy at each occasion and the latent profiles, common sequences of class memberships, that characterize distinct developmental trajectories across the full observation window. We extend the original framework in three directions: Bayesian MCMC estimation that yields full posterior distributions and remains reliable when age-stratified or episode-restricted subsamples strain EM convergence; chain-averaged individual-level posteriors that propagate parameter uncertainty through to classification probabilities; and a Bayesian p-value procedure for covariate inference that avoids unreliable Hessian-based approximations near parameter boundaries. Two applications demonstrate the framework: political polarization using ANES (2016–2020–2024) and SHP (2017–2020–2023) panel data, and militarized dispute escalation and sanctions campaigns in international relations.

Beyond Pixels and Prose:
Evaluating Multimodal LLM Performance in Capturing Political Stance in Turkish Influencer Content

Seo Eun Yang, Fan Zichen and Nora Suren

Political activists in authoritarian contexts like Turkey have developed sophisticated multimodal strategies for evading censorship on social media — layering political signals across background imagery, typography, coded slang, and visual metaphors. As AI-based content moderation is increasingly deployed on global platforms, a critical question emerges: can state-of-the-art multimodal large language models (MLLMs) actually "read" these politically encoded signals? This study presents the first systematic social audit of MLLM performance on a novel expert-labeled "in-the-wild" corpus of 20 Turkish political influencers, stratified across three political risk tiers (low, medium, high). Evaluating eight leading models across cross-lingual prompting strategies, we find that AI systems consistently fail on the dimension that matters most for platform governance: political risk classification. Multiple leading models perform near chance level on minority risk categories, corresponding to the highest-risk activist content. No universal prompting strategy exists: optimal performance is model-specific and culturally contingent, with no consistent advantage of English over Turkish prompting. We argue that these failures reflect a structural "visibility gap" — AI systems render politically marginal, censorship-evading voices invisible to the very infrastructure increasingly tasked with governing global digital discourse.

Integrating Human Gaze Patterns and Machine Learning for Enhanced Detection of Visual Bias in Online Political Advertising

Lead PI: Seo Eun Yang,

Co-PI: Yakov Bart

As visual messages increasingly dominate online political advertising, detecting visual bias has become crucial for protecting democratic processes from disinformation. This project addresses the challenge of identifying visual bias by combining human semantic knowledge and gaze behavior with existing computer vision frameworks. Using a subset of political ads from the Facebook Ad Library and eye-tracking experiments, the project examines how people perceive visual bias in advertisements by analyzing gaze patterns, such as fixation duration and transitions between image segments. These gaze features are then integrated with visual features using machine learning classifiers to detect bias. By combining human perception data with machine learning, the project enhances automated bias detection, offering insights into visual persuasion techniques and improving media literacy to help the public critically evaluate political content.

Link

This project is supported by IDI Seedling Grant 2024-2025

VR’s Impact on Political Attitudes

toward Solitary Confinement

Co-PI: Seo Eun Yang, Martha Johnson

As VR becomes more accessible, its potential as an 'empathy machine' for promoting pro-social attitudes is of growing interest. Scholars debate whether virtual reality experiences in fact generate empathy or shift attitudes and behavior. A sub-set of these scholars have further sought to establish whether VR's impact is larger than what can be achieved with traditional (and cheaper, more readily accessible) interventions, like reading about others’ experiences or watching two-dimensional videos.

Our study investigates whether VR increases empathy and shapes individuals' attitudes and behaviors regarding marginalized groups and related political debates. Specifically, we focus on the issue of solitary confinement, and we integrate the use of physiological data to deepen our understanding of the role of emotional arousal as a potential mechanism for VR’s impact. Remarkably view works on VR and societal attitudes use physiological measures, yet measures such as skin conductivity and heartrate are well-established measures of emotional arousal. We contend they may provide more objective measures of emotional engagement and empathy than standard questionnaires.

This project is supported by NULab Seedling Grant 2024-2025

Analyzing Social Media Images used in Political Communication

Lead PI: Seo Eun Yang

co PI: Yakov Bart

https://cssh.northeastern.edu/nulab/social-media-images-in-politics/

This project focuses on the Instagram-based visual communication styles as a distinct form of political persuasion and image-making in US context. While Twitter and Facebook have been used as two main platforms for research on online political engagement, there is scarce research on Instagram and its political marketing. Instagram is an image-based social media platform where political players increasingly use still and moving images for attention-grabbing purpose, selfexpression, political messaging, mobilization, and engagement. In today’s visually oriented media environment, the heads of governments and lawmakers are constantly communicating with visual messages designed to influence public opinion on a wide range of topics. This is because visuals have the capacity to present concrete political ideas and political personas fluently. However, little is known about the role and impact of highly personalized forms of visual political communication using Instagram for public image and reputation management as an effective e-communication tool. For example, we have limited knowledge on distinct visual and verbal patterns in Instagram posts of Republicans and Democrats, and whether using a certain visual strategy may provide an edge in competitive election campaigns.

This project is supported by NULab Seedling Grant 2023-2024

The Dynamic Relationships
between Media Narratives, Foreign Policy, and Public Opinion

Getty Image

Lead PI: Seo Eun Yang,

co PIs: Xuechen Chen, Myojung Chung

Scholars in International Relations (IR) and Political Communication have established that media narratives of global events have a far-reaching impact on shaping public opinion and foreign policy behaviors. However, there is limited empirical research that unpacks the complex relationship between media discourse, foreign policy, and the public's responses to global crises. Besides, existing research on this topic tends to rely on small-scale media datasets and labor intensive case studies, which hinders comparative analysis of large-scale multilingual media narratives across different cultural and political contexts. To fill these gaps of research, this project adopts cutting-edge artificial intelligence techniques to uncover the relationship between media narratives, foreign policy, and public opinion concerning contemporary warfare and humanitarian crises. By bringing together an interdisciplinary team with its roots in Computational Social Science, IR, and Media Studies, this project also seeks to pursue greater societal impact by responding to the pressing need to develop effective models and tools which can be used by policy-makers and business sectors to make scientific prediction regarding international actors’ political communication, diplomacy and the public’s responses to global crises.

This project is supported by FY23 Transforming Interdisciplinary Experiential Research (TIER) 1

Functional Connectivity Signatures of Political Ideology

Seo Eun Yang, James Wilson, Zhong-Lin Liu, Skyler Cranmer

Paper link: https://doi.org/10.1093/pnasnexus/pgac066

News Article: https://news.osu.edu/brain-scans-remarkably-good-at-predicting-political-ideology/

Emerging research has begun investigating the neural underpinnings of the biological and psychological differences that drive political ideology, attitudes, and actions. Here we explore the neurological roots of politics through conducting a large sample, whole-brain analysis of functional connectivity (FC) across common fMRI tasks. Using convolutional neural networks, we develop predictive models of ideology using FC from fMRI scans for nine standard task-based settings in a novel cohort of healthy adults (n = 174, age range: 18-40, mean = 21.43) from the Ohio State University Wellbeing Project. Our analyses suggest that liberals and conservatives have noticeable and discriminative differences in functional connectivity that can be identified with high accuracy using contemporary artificial intelligence methods and that such analyses complement contemporary models relying on socio-economic and survey-based responses. Functional connectivity signatures from retrieval, empathy, and monetary reward tasks are identified as important and powerful predictors of conservatism, and activations of the amygdala, inferior frontal gyrus, and hippocampus are most strongly associated with political affiliation. Although the direction of causality is unclear, this study suggests that the biological and neurological roots of political behavior run much deeper than previously thought.

Beyond Pairwise Relationship: Hypergraph as a new Graph-based Paradigm in Political Networks

Seo Eun Yang

Multimodal datasets contain a huge amount of information from diverse modalities, such as text, audio, and video. Big data revolution and artificial intelligence have created unprecedented research opportunities for political scientists to delve deeply into huge amounts of multimedia objects. A growing number of scholars realize the need to develop a data-driven method to combine diverse unstructured data such as text, audio, and video in a unified model and encode complex relationships between them. To address such challenging issues and search for new methods, this project introduces a hypergraph as a new graph-based paradigm to handle multimodal datasets and model rich patterns of complex relationships among multimedia objects. Specifically, I introduce three hypergraph-based learning models and various applications ranging from political communication to legislative politics.

A Picture is worth a Thousand Words.

Machine-Learning Visual Framing Analysis

Seo Eun Yang

This paper presents an automated machine learning method to jointly explore word phrases and visual features of photographs in an unsupervised manner to measure media bias in contemporary media sources. I develop a scalable hypergraph regularized tensor decomposition that maps multimedia items stored in a three-order tensor into a low dimensional semantic space to uncover hidden topic structures in media coverages. Analyzing 173,204 articles with news photographs from 145 online newspapers for political bias in news reporting about abortion and immigration,my method examines the patterns of news reporting on the visual and verbal level and identifies politically charged phrases and visual characteristics.

Image with Text:

Multimodal Framing Analysis of Online News Coverage on the European Refugee Crisis

Seo Eun Yang

In news reporting about conflict and crisis, photographs convey stories that generate emotions of all kinds that words cannot always deliver. While the rapid growth of online news photographs has created unprecedented research opportunities, quantitative approaches that deal with the volume, variety, and complexity of both images and texts have lagged behind in social science. To address such challenging issues and search for new methods, this paper introduces a new method for quantitative framing research to examine the patterns of news reporting on the visual and verbal level and explore image-text relations in news stories. Specifically, I introduce hypergraph as a new graph-based method to integrate the various types of data and model their complex relationships in a network. Using hypergraph, I develop a hypergraph regularized topic model that fuses the visual, textual, and other multimedia features simultaneously to find the latent topic representation in media coverage during the European refugee crisis in 2015.

Automatically Finding Agenda Dimensions
from UNGA texts

Seo Eun Yang, Jared F. Edgerton

Most of this rapid growth of information owes its origin to the unstructured data in the wild like texts as compared to the structured information stored in databases. Every year, the UN generates thousands of publications and documents such as draft resolutions, annual reports, meeting records, agendas, vote records, and lists of participants. Currently, the UN offers one million digital links that contain bibliographic metadata records and text-heavy data. While the explosion of UN documents has created unprecedented research opportunities for IR scholars, quantitative approaches that deal with the volume, variety, and complexity of such data have not been sufficiently introduced. A core research challenge presents itself as to how to turn such massive unstructured data into structured knowledge and integrate structured and unstructured data in a unified model. Ignoring tons of UNGA texts and records loses a lot of valuable insights on states’ policy preferences, political proximity, and the political implications of change in the UN Security Council. To address this, this project develops a machine learning model for a total of 427,253 draft resolutions for 193 countries collected over 70 years. We will first discover the underlying agenda dimensions from large-scale draft resolutions. Then, we will evaluate the model by predicting which countries dominate specific agenda by mapping each country to each agenda dimension.

Bayesian Deep Learning

for Identifying Granger Causal Graphs and

Forecasting Political Dynamics

Seo Eun Yang, Skyler Cranmer, Caleb Pomeroy

Time series modeling and Forecasting conflicts has traditionally been made using regression models of different types with parametric assumptions in political science. Current pre-assumed regression models for time series forecasts still face limitations in many empirical applications. First, classical Bayesian time series models do not scale. With the advent of Big Data, we now have many alternative ways to forecast conflicts by extracting insights from massive high-quality data. Second, identifying a suitable forecasting model for a particular time series beforehand is not possible due to the lack or incompleteness of our domain knowledge in many cases. To address such challenges, we propose Bayesian scalable causal graph learning (BSCGL). BSCGL models find the form of mapping function between input and output directly from data and capture the nonlinearities that traditional linear/nonlinear statistical models cannot fully develop. Thus, more complex relationships between time series can be discovered without relying on domain knowledge and any distribution assumption, resulting in better in-sample and out-of-sample prediction performances. Our proposed model also discovers non-linearities in the underlying granger causal mechanisms in time series.

Seo Eun ''Sunny'' Yang

RESEARCH

Bayesian Latent Class Profile Analysis for Sequential Political Development

Beyond Pixels and Prose: Evaluating Multimodal LLM Performance in Capturing Political Stance in Turkish Influencer Content

Beyond Pixels and Prose:
Evaluating Multimodal LLM Performance in Capturing Political Stance in Turkish Influencer Content