Hi, my name is

Daniel Saeedi

I am PhD Candidate & Research Assistant at Georgia Tech

I'm a PhD candidate in Electrical and Computer Engineering at Georgia Tech, specializing in AI/ML with focus on agentic AI, generative models, NLP, AI safety, and diffusion models. My research spans CryoEM imaging optimization and NASA astrobiology data analysis.

<b>Zoé</b> Miller
AI Researcher

Resume

my Story
Education
Georgia Institute of Technology
Ph.D. in Electrical and Computer Engineering
Sep 2023 - Present

Advisor: Prof. Amirali Aghazadeh. Collaborative Specialization: Machine Learning and Digital Signal Processing. Relevant Coursework: Generative and Geometric Deep Learning (A), Statistical Machine Learning (A).

University of Tehran
B.S. in Computer Engineering
Sep 2019 - Jun 2023

GPA: A - 17.98/20. Relevant Coursework: Neural Networks and Deep Learning, Artificial Intelligence, Algorithm Design, Data Structures and Algorithms, Advanced Programming, Differential Equations.

Experience
AI Safety Research Fellow
Sep 2025 - Present

Research fellowship focused on AI safety and alignment, working on critical challenges in ensuring AI systems remain beneficial and controllable as they become more capable.

Graduate Research Assistant
Georgia Institute of Technology
Sep 2023 - Present

Developing deep learning-based methods to reduce the cost of CryoEM imaging, improving accessibility and efficiency in structural biology research. Leveraging AI agents to accelerate scientific discovery and automate complex research workflows. Analyzing astrobiological data from NASA using AI to advance research on the origin of life.

Research Fellow
Fatima Fellowship
May 2022 - Dec 2022

Detected issues of SOTA debiasing techniques such as SentDebias and INLP in masked language modeling. Developed a debiasing inference technique to mitigate biases in transformer models. Built PyDebiaser, an open-source library including seven debiasing methods in Transformers.

Founder
Jooyan
2014 - 2016

Founded a startup to help companies in hiring skilled programmers through programming challenges. Developed an online course forum (in Farsi) offering a range of learning opportunities with hands-on projects.

History

Press Coverage

Media Spotlight
Nature news highlight
AstroAgents coverage in Nature News

Our AI Scientists for hypothesis generation for Origins of Life is covered in a story published today in Nature.

Read Article
GeorgiaTech
AstroAgents coverahe in Georgia Tech News

As strange as it sounds, the key to understanding life’s origins might lie in artificial intelligence. At least, according to a new approached being pursued by researchers at Georgia Tech.

Read Article
Press Coverage 3
Astrobiology.com Coverage

Read Article
Press Coverage 3
Twitter Coverage

Read Article

AstroAgents

A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry Data
Year: 2025
Keywords: LangChain, Agentic AI, AI for science, Hypotheses generation

Description

With upcoming sample return missions across the solar system and the increasing availability of mass spectrometry data, there is an urgent need for methods that analyze such data within the context of existing astrobiology literature and generate plausible hypotheses regarding the emergence of life on Earth. Hypothesis generation from mass spectrometry data is challenging due to factors such as environmental contaminants, the complexity of spectral peaks, and difficulties in cross-matching these peaks with prior studies. To address these challenges, we introduce AstroAgents, a large language model-based, multi-agent AI system for hypothesis generation from mass spectrometry data. AstroAgents is structured around eight collaborative agents: a data analyst, a planner, three domain scientists, an accumulator, a literature reviewer, and a critic. The system processes mass spectrometry data alongside user-provided research papers.

The data analyst interprets the data, and the planner delegates specific segments to the scientist agents for in-depth exploration. The accumulator then collects and deduplicates the generated hypotheses, and the literature reviewer identifies relevant literature using Semantic Scholar. The critic evaluates the hypotheses, offering rigorous suggestions for improvement. To assess AstroAgents, an astrobiology expert evaluated the novelty and plausibility of more than a hundred hypotheses generated from data obtained from eight meteorites and ten soil samples. Of these hypotheses, 36% were identified as plausible, and among those, 66% were novel.

Description

AstroAgents introduces a novel paradigm that leverages the capabilities of large language models (LLMs) to analyze mass spectrometry data for origin-of-life research. Although this paper primarily focuses on a gas chromatography dataset, our methodology is versatile and can be applied to a wide range of datasets. The comparative performance of Claude 3.5 Sonnet and Gemini 2.0 Flash reveals important insights about the trade-offs between contextual capacity and collaborative ability in multi-agent systems.

Claude 3.5 Sonnet’s superior performance in consistency and clarity suggests that stronger agent collaboration capabilities may be more valuable than expanded context windows for generating reliable scientific hypotheses. However, Gemini 2.0 Flash’s higher novelty scores indicate that larger context windows might facilitate more creative connections across broader knowledge bases.

Selected High-Scoring Hypotheses

AstroAgents generated hypotheses that received high ratings from astrobiology experts.

1
Gemini 2.0: The presence of 1H-Phenalen-1-one or 9H-Fluoren-9-one (ID 44) exclusively in Orgueil and LEW 85311, and Biphenyl (ID 43) in the same meteorites, suggests a unique chemical environment shared by these samples, potentially indicating a similar formation region within the early solar system.
ID 44: Orgueil, LEW 85311
ID 43: Orgueil, LEW 85311
Novelty: 7/10
Literature: 9/10
Clarity: 9/10
Support: 9/10
2
Gemini 2.0: The co-occurrence of multiple unknown compounds in Iceland Soil, Atacama, and GSFC soil suggests that these soils share similar depositional environments and/or source material based on IDs 4, 5, and 10.
Unknown compound m/z 154.0 in Green River Shale and Lignite Soil
Novelty: 7/10
Literature: 10/10
Clarity: 10/10
Support: 8/10
3
Gemini 2.0: The detection of toluene, methylnaphthalenes, acenaphthene, dibenzothiophene, and trimethylnaphthalene in Orgueil and LEW 85311 suggests common formation pathways and stability of these PAHs under different environmental conditions.
Toluene, Methylnaphthalenes, Acenaphthene, Dibenzothiophene in Orgueil, LEW 85311
Novelty: 7/10
Literature: 10/10
Clarity: 10/10
Support: 8/10
4
Claude 3.5: The exclusive detection of 1,2,3,4-tetrahydro phenanthrene (ID 36) in Orgueil and Jbilet Winselwan suggests a specific hydrogenation pathway in certain meteorite parent bodies, indicating distinct redox conditions during organic synthesis.
ID 36: Orgueil, Jbilet Winselwan
ID 42: Orgueil, LEW 85311
Novelty: 4/10
Literature: 8/10
Clarity: 8/10
Support: 7/10
5
Claude 3.5: The detection of possible terpenes exclusively in soil samples indicates that complex branched isoprenoid structures require enzymatic biosynthesis and are not readily formed through abiotic processes in space, making them reliable biomarkers.
IDs 4, 17, 18 (terpenes) found only in soil samples (Iceland, Atacama, Utah, GSFC)
Novelty: 3/10
Literature: 10/10
Clarity: 10/10
Support: 10/10


LifeTracer

Discriminating Abiotic and Biotic Organics in Meteorite and Terrestrial Samples Using Machine Learning on Mass Spectrometry Data
Year: 2025
Keywords: Astrobiology, Mass spectrometry, Machine learning, Meteorites
Publication: Under Review

Description

This is one of the coolest projects I’ve done so far. In this project, I collaborated with two brilliant NASA scientists, Denise Buckner and José Carlos Aponte, to analyze meteorites and soil samples collected from extreme weather conditions—such as Antarctica—and uncover potential signatures of life.

The project involves pre-processing 750 GB of 2D gas chromatography data to distinguish meteorites from soil samples based on their molecular signatures. I developed a Python package called LifeTracer to support future analyses of 2D gas chromatography data. In addition, I created a website to help readers explore the data from the papers more interactively.

Cryo-EM and Diffusion Models

Guided Diffusion for CryoEM imaging purposes
Year: 2025
Keywords: Diffusion, Inverse-problems, CryoEM imaging
Publication: On going research

Description

In this project, I am working at the intersection of diffusion models and CryoEM imaging.

PyDebiaser

A debiasing library for Transformers models
Year: 2022
Keywords: AI Safety, LLMs
Publication: Github

Description

PyDebiaser was one of my undergraduate projects and a part of the Fatima Fellowship, supervised by Dr. Abubakar Abid. PyDebiaser is a Python library designed to reduce or remove bias in Transformer-based language models. It implements seven debiasing techniques, including SentenceDebias (projection-based debiasing of sentence embeddings), INLP (iterative nullspace projection to remove protected attributes), Self-Debias (discouraging biased text generation from within the model), and simpler prompt-based methods such as Bias-Swapping, Character-Neutralization, Prepend-Adjective, and Top-k selection of less toxic outputs. It supports models from Hugging Face (e.g., GPT-2, BERT) and is released under the MIT license.

Jooyan

Jooyan was an online platform for coding practice, skill assessments, and technical interviews.
Year: 2014-2016
Keywords: startup, competitive programming, skill assessments.

Description

As a teenager, I co-founded my first startup with Ali Panahi. Through this project I learned to build a website end-to-end—covering both front-end and back-end using Laravel (famous PHP framwork back then I guess) and I developed skills in pitching ideas and attracting investors.

Our platform aimed to inspire young people in Iran to learn programming and develop the skills needed to succeed in the tech industry. Similar to LeetCode, it offered algorithmic coding challenges and the ability to run code on the server with real-time feedback. This experience sharpened my technical foundations and sparked a lasting passion for creating tools that help others learn and grow.