Posts by Collection

course_projects

Distributed Movie Recommendation Pipelines with Apache Spark

This project builds a full-scale movie recommendation system using Apache Spark, incorporating data analytics, keyword-based filtering. Implemented on the MovieLens dataset, the system supports efficient data preprocessing, incremental rating updates, and personalized movie recommendations through LSH and predictive models.

NameCoin on Peerster: A Blockchain-Based Decentralized DNS Implementation

This project explores the design and implementation of a decentralized DNS system using blockchain and a gossip-based peer protocol. The system supports secure domain registration, updates, transfers, and resolution with robust anti-entropy synchronization and Proof-of-Work consensus. The project evaluates network resilience, consensus reliability, and mining efficiency.

Download Report

Multimodal Modeling of Entrepreneurial Teams: Predicting Opportunity Generation from Audio and Text

This project combines personality and emotion detection with machine learning to predict the number of ideas generated by entrepreneurial teams. We process multimodal data (transcripts and audio) to extract MBTI traits, emotional profiles, and speaker features, and use these to model team-level idea generation. This work advances behavioral modeling in early-stage startup evaluation.

Download Report

journal

publications

Generative Approaches to Kinetic Parameter Inference in Metabolic Networks via Latent Space Exploration

Published in bioRxiv, 2025

We introduce a generative framework for constructing large-scale kinetic metabolic models through latent space exploration. By repurposing pretrained neural network generators across different physiological contexts, our method enables efficient and interpretable inference of kinetic parameters, facilitating targeted model design for diverse metabolic behaviors.

Download Paper

research_projects

Kinetic Parameter Inference in Metabolic Networks via Latent Space Exploration

1 minute read

Published:

We present a novel framework to interpret and control the latent spaces of generative neural network models for kinetic metabolic modeling. By perturbing structured latent spaces learned via REKINDLE or RENAISSANCE, our method generates new dynamic models with targeted properties such as specific response times, regulatory bottlenecks, or alternative physiologies, unlocking deeper insight and reusability across metabolic contexts.

Download Paper

GemmaEdu: Enhancing Scientific Learning via Fine-Tuned Language Models and RAG

1 minute read

We developed an educational chatbot built on the quantized Gemma 2 7B model, optimized with Direct Preference Optimization (DPO) and enhanced with Retrieval-Augmented Generation (RAG). By leveraging fine-tuning on student-generated preference data and incorporating relevant external documents, our system significantly improves accuracy in answering STEM multiple-choice questions, outperforming baseline models like Mistral and Llama2.

Download Report

From Novice to Expert: Dimensionality Reduction and Policy Distillation in Reinforcement Learning for Motor Control

2 minute read

This project investigates how to accelerate motor skill acquisition in reinforcement learning using curriculum-based learning, dimensionality reduction, and policy distillation. Using the Myosuite Baoding balls task, we explore how expert policies can be transferred to novice agents via PCA-reduced feature and action spaces, offering an efficient alternative to prolonged training times.

Download Report

Learning-Based Multi-Robot Lane Navigation: Scalable Trajectory Prediction using Neural Networks

1 minute read

This project was conducted at DISAL, EPFL. We explore trajectory generation for multi-robot navigation using neural networks. We propose a scalable alternative to Webots simulation by training models using graph neural network, reinforcement and imitation learning. The final approach produces accurate trajectories in a lane-based environment, balancing precision and efficiency in robotic control.

Download Report

work

AI Research Intern — AXA Group Operations

I led applied research and prototyping efforts in multimodal AI, focusing on cross-modal representation learning, graph-based embeddings, and neural search systems. I developed scalable pipelines to generate scene graphs from satellite imagery and knowledge graphs from textual data, and to align their graph embeddings in a shared representation space.

Machine Learning Intern - Pixalione

Developed a machine learning pipeline to forecast daily ad spend on Google Ads based on client-specific campaign data. Deployed a web backend for dynamic budget strategy adjustment, automated alerts, and integration with Azure Cloud infrastructure.

Student Assistant — EPFL

During my studies, I served as a teaching assistant for multiple courses, assisting in lectures, labs, and tutorials