Project Alexandria, a DOE and NNSA initiative, centralizes data management for scientific experiments, addressing the costly and redundant efforts of individual project-level solutions. By providing a federated data repository powered by Databricks and generative AI, Alexandria frees researchers to focus on science, automating curation and computation. LakeFS acts like a version control system for data within Project Alexandria, ensuring that only approved, high-quality data is used for research and AI training.