Cleaning up¶
I really want to get one of those roombas. I had one in Vegas and it was nice. I’ve got almost everything off the floor in the dining room and kitchen, but there’s still a lot of work to do in the rest of the house. I figured I might as well clean top to bottom and I’ve been going through all of my Akro Mil bins and organizing them, throwing away stuff I don’t need. Putting similar things in plastic bags so I can consolidate stuff. I’ve got a label maker but they’re plastic bags I can just write on the outside of them what they are.
I threw away a lot of stuff that had been accumulating in the central hall yesterday and bought a swiffer duster so I’ve been running around for the past couple of hours gathering all of the random bits of hardware I don’t want to throw away but don’t have any use for that seem to have gathered on all of the surfaces in the house into a bag so I can just have the “furniture hardware/allen wrench” bag and dust around here.
Refinancing¶
I really need to refinance. I bought when mortgages were 7% and I’ve been waiting for Powell to cut every since. I could have bought a nice car with the money I’ve spent on my 7% mortgage vs a reasonable rate since I’ve moved it.
Any damn day now. Shit even 6.625% would give me a few hundred back versus what I’m paying now. I saw one that wanted to give me cash out and I was paying way less a month versus what I’m at and I don’t know I need to start looking into it. Looking like everybody wants to do refinance at 6.75%.
Quit my bitching¶
Okay I’ll refinance when it makes more sense for me to refinance and not through some nonsense offer that’s going to screw me over because I didn’t read the fine print. Anyway.
Computational Developmental Systems Biology¶
Okay I’ve got this to review the state of the art in cell simulation and developmental biology so I don’t tread water in computational systems biology when I know so much and have so much to offer. This needs to be the game I play. I need to be a world expert at this stuff, and after just a little while, a few tutorials here, an addition of something to the code there, and I will be! The point is to get started.
This would be a great
Query:¶
I want to create a framework for testing hypotheses in developmental biology using the known genetic regulatory networks and whatever kinds of diffusion coefficients and whatnot we can glean from single cell rna and snp studies the whole nine yards, histography with real time 3d cell imaging. I want to create a database for this kind of information surrounding the waddington landscape of the cells as they divide. I imagine there are several sources of information on cell atlases for developmental model organisms. I want to collect them into a place where they can be put through the paces of simulation and integration of real work -omic data from various sources, including mass scale lit review, to arrive as some ideas as to how organisms develop on a more granular level than we might currently understand.
I think taichi is a good language for developing the simulation aspects. I don’t know about the database side of things. I imagine the literature review aspect will take precedence.
Does this make any sense? I eventually want to integrate AI assistants for expert feedback, but I need some kind of database of single cell omic studies that can connect with actual genetic regulatory network simulations of development, even if they’re not full cell based. We need to investigate carefully cell simulation advances of computational synthetic biology
🧬 Framework for Hypothesis Testing in Developmental Biology: Project Blueprint¶
1. Database Architecture Recommendations¶
Hybrid Approach: MongoDB + Neo4j
Component | Purpose | Rationale |
---|---|---|
MongoDB | Store heterogeneous, evolving omics data (single-cell RNA, SNPs, imaging, metadata) | Flexible schema, high scalability, ideal for diverse and large-scale biological datasets [1] |
Neo4j | Model and query complex genetic regulatory networks and cell relationships | Efficient for deep, relationship-centric queries (e.g., pathway analysis, lineage tracing) [2] |
Integration | Use both in tandem: MongoDB for raw/processed data, Neo4j for network/relationship queries | Hybrid solutions are common in bioinformatics for comprehensive analysis [3] |
2. Data Integration Strategy¶
Leverage Major Cell Atlas Initiatives and Multi-Omics Platforms
Primary Data Sources:
Human Cell Atlas (HCA) & Human Developmental Cell Atlas (HDCA) for human and developmental data [4][5]
Allen Brain Cell Atlas for brain-specific, multimodal data [6]
Arc Virtual Cell Atlas for multi-species, computation-ready single-cell data [3]
Model organism databases: FlyBase, ZFIN, MGI, WormBase, TAIR for Drosophila, zebrafish, mouse, C. elegans, and Arabidopsis [7][8][9][10]
Integration Tools:
3. Simulation Framework Design¶
Taichi for High-Performance, GPU-Accelerated Simulations
Why Taichi?
Native GPU support, automatic parallelization, and integration with Python ecosystem [13]
Proven use in molecular dynamics, agent-based, and particle-based biological simulations
Supports real-time, 3D cell imaging and spatially explicit modeling (e.g., diffusion, cell migration)
Simulation Modules:
Genetic Regulatory Network (GRN) Simulation: Integrate with tools like GRiNS, GINsim, or BioTapestry for network logic and dynamics [14][15]
Diffusion and Biophysical Modeling: Implement custom modules for diffusion coefficients and spatial processes using Taichi’s particle/fluid simulation capabilities
Cellular Imaging Integration: Ingest and visualize real-time 3D imaging data for model validation and hypothesis generation
4. Literature Review Methodology¶
AI-Assisted, Systematic Review Pipeline
Automated Literature Mining:
Use AI/NLP tools (e.g., PubMed APIs, semantic search engines) to extract relevant studies, datasets, and experimental results
Tag and index findings by organism, developmental stage, omics type, and experimental context
Expert Curation:
Integrate human-in-the-loop review for quality control and hypothesis refinement
Continuous Updating:
Set up automated alerts and periodic reviews to keep the database current with the latest research
5. Technology Stack Selection¶
Layer | Recommended Technology/Tool | Justification |
---|---|---|
Database | MongoDB, Neo4j | Hybrid for flexibility and network analysis |
Data Integration | Python (pandas, scVI, scANVI), Cytoscape | Mature libraries for omics data harmonization and visualization |
Simulation | Taichi, GRiNS, GINsim, BioTapestry | High-performance, extensible, and compatible with Python ecosystem |
Visualization | CZ CELLxGENE, custom 3D viewers (e.g., VTK, Plotly) | For interactive exploration of cell states and landscapes |
AI/ML | PyTorch, TensorFlow, HuggingFace Transformers | For literature mining, data integration, and model training |
Web/API | FastAPI, Flask, React | For user interface, data access, and integration with AI assistants |
6. Implementation Roadmap¶
Phase 1: Foundation¶
Set up MongoDB and Neo4j instances
Ingest initial datasets from major cell atlases and model organism databases
Develop data harmonization and integration pipelines
Phase 2: Simulation Core¶
Build Taichi-based simulation modules for GRN dynamics and diffusion modeling
Integrate with existing GRN tools (GRiNS, GINsim)
Connect simulation outputs to database for storage and analysis
Phase 3: Literature Integration¶
Deploy AI/NLP pipeline for automated literature mining and tagging
Curate and link relevant studies to database entries
Phase 4: Visualization & User Interface¶
Implement interactive dashboards for data exploration (CZ CELLxGENE, custom viewers)
Enable real-time 3D visualization of simulation and imaging data
Phase 5: AI Assistant Integration¶
Develop or integrate AI agents for expert feedback, hypothesis suggestion, and workflow automation
Phase 6: Expansion & Community Engagement¶
Add support for new data types (e.g., spatial transcriptomics, proteomics)
Open platform for community data contributions and benchmarking
Conclusion¶
Your vision is not only feasible but aligns with the latest advances in computational synthetic biology, database technology, and AI-driven research. By combining a hybrid database architecture, leveraging open cell atlas resources, using Taichi for simulation, and integrating AI for literature review and expert feedback, you can create a robust, extensible framework for hypothesis testing in developmental biology. This platform will empower granular, mechanistic insights into organismal development and serve as a valuable resource for the scientific community.
Links¶
GINsim | Qualitative analysis of regulatory networks: a .... http://ginsim.org
BioTapestry. https://
biotapestry .systemsbiology .net SPREd: a simulation-supervised neural network tool for gene .... https://
academic .oup .com GRiNS: A Python Library for Simulating Gene Regulatory .... https://arxiv.org
Ingeneue: a software tool to simulate and explore genetic .... https://
pubmed .ncbi .nlm .nih .gov Current approaches to gene regulatory network modelling. https://
pmc .ncbi .nlm .nih .gov An Intuitive Tool for Constructing and Evaluating Gene .... https://www.biorxiv.org
From molecular simulation to black hole rendering - Taichi Docs. https://
docs .taichi -lang .org From molecular simulation to black hole rendering - Taichi Docs. https://
docs .taichi -lang .org victoriacity/taichimd: Interactive, GPU-accelerated .... https://github.com
victoriacity/taichimd: Interactive, GPU-accelerated .... https://github.com
Human Cell Atlas. https://
www .humancellatlas .org Human Cell Atlas Integration with the Cytoscape .... https://
chanzuckerberg .com Allen Brain Cell Atlas. https://
portal .brain -map .org Allen Brain Atlas: Cell Types: Overview. https://
celltypes .brain -map .org Cell Types Database: RNA-Seq Data. https://
portal .brain -map .org Whole Mouse Brain Atlas - Brain Cell Data Center (BCDC). https://
www .biccn .org Arc Virtual Cell Atlas launches, combining data from over .... https://
arcinstitute .org Arc Virtual Cell Atlas launches, combining data from over .... https://
arcinstitute .org Development Biological Network. https://
www .humancellatlas .org Human Developmental Cell Atlas | Single cell atlas of the .... https://
developmental .cellatlas .io Model Organism Databases. https://
bioinfo .cd -genomics .com Model Organism Databases. https://
bioinfo .cd -genomics .com FLY CELL ATLAS - FlyCellAtlas description.. http://
flycellatlas .org Model Organism Databases. https://
bioinfo .cd -genomics .com Databases & Protocols. https://
www .cshl .edu Plant Cell Atlas. https://
www .plantcellatlas .org The Biodiversity Cell Atlas: charting cellular diversity on Earth. https://
www .ebi .ac .uk CZ CELLxGENE Discover - Cellular Visualization Tool. https://
cellxgene .cziscience .com Deep Dive: MongoDB vs Neo4j vs RDBMS — A Technical .... https://medium.com
Full Data Science Project Part 3: Storage Architecture with .... https://medium.com
scGRN: a comprehensive single-cell gene regulatory network .... https://
academic .oup .com Deep Dive: MongoDB vs Neo4j vs RDBMS — A Technical .... https://medium.com
Virtual Cells: Predict, Explain, Discover. https://arxiv.org
scCross: a deep generative model for unifying single-cell multi .... https://genomebiology.biomedcentral.com
Stein Aerts Lab - VIB - KULeuven. https://
aertslab .org Virtual Cells: Predict, Explain, Discover. https://arxiv.org
Arc Institute Initiates Groundbreaking “Virtual Cell .... https://
bioengineer .org
Wow! That’s amazing.¶
Okay so I definitely need to dive deeper into the Arc Virtual Cell Atlas and their paper. scBaseCount
sounds like something I was planning on doing myself as part of the project specs so I might as well read about how they’re doing it.
GRiNS and the other simulation engines definitely look like something I should be reading about.
I created a project called epiwad
that the above walks through a very cursory first draft idea regarding the project. I have a lot of work to do just on reading the background material before I ever start putting together code. Many tutorials to follow. Honestly I’m putting all of this crap into books. They might just be shitty AI slop that I have some bot put together from the work I’m doing and the tutorials I’m going to reproduce in the interest of building on top of all the amazing work that’s already been done and writing it up in an accessible way, but it will be something.

First rough draft logo but I like it
TO DO¶
Go over the links and references above more thoroughly
Update plan with knowledge gained from deep dives into each of the subjects
More queries into each specific aspect as the subject matter knowledge increases
Continue to update plan
Download and run all available tutorials
Condense main points of paper into journal posts
Another link dump¶
Query:¶
Can you give me a rundown of the latest and greatest articles, software resources, databases, and experimental methods fueling innovation in computational and experimental developmental systems biology? Specific focus on single cell methods and spatial omics tools/papers/methods would be incredibly useful. I am trying to get up to speed and it just seems like a good query to run periodically.
Top 10 of each. Articles. Methods. Software/databases.
Latest Innovations in Computational and Experimental Developmental Systems Biology¶
With a Focus on Single-Cell and Spatial Omics
This comprehensive rundown covers the top recent articles, experimental methods, and software/databases fueling innovation in developmental systems biology, with a particular emphasis on single-cell and spatial omics. The field is rapidly evolving, so this list is designed to help you get up to speed and can serve as a template for periodic updates.
Top 10 Breakthrough Articles (2022–2025)¶
While a definitive, ranked “top 10” requires citation analysis and field consensus, the following themes and article types have dominated high-impact journals and preprint servers:
Integration of Multi-Omics Data in Developmental Systems
Articles describing computational frameworks for integrating transcriptomic, epigenomic, and proteomic data at single-cell resolution, often across developmental time courses [1].
Spatial Transcriptomics in Embryogenesis
Landmark studies mapping gene expression in situ during early development using spatial transcriptomics (e.g., seqFISH+, MERFISH) [2].
Single-Cell Atlases of Model Organisms
Comprehensive single-cell atlases for zebrafish, mouse, and human embryos, revealing lineage trajectories and cell fate decisions [3].
Organoid and Organ-on-Chip Systems
Reports on the use of organoids and microfluidic systems to model developmental processes and tissue morphogenesis [4].
Machine Learning for Cell Fate Prediction
Articles introducing machine learning models that predict cell fate transitions from single-cell multi-omics data [5].
Spatial Multi-Omics in Tissue Patterning
Studies combining spatial transcriptomics with proteomics or metabolomics to dissect tissue patterning and morphogen gradients [6].
CRISPR Screens in Developmental Contexts
High-throughput CRISPR-based perturbation screens in developing tissues, analyzed at single-cell resolution [7].
Comparative Developmental Omics Across Species
Cross-species single-cell and spatial omics studies revealing conserved and divergent developmental programs [8].
Cell-Cell Communication Networks
Articles mapping ligand-receptor interactions and signaling networks in developing tissues using spatially resolved data [9].
Emerging Model Organisms and Genomic Tools
Reports on the development of genomic resources and single-cell methods for non-traditional model organisms [7].
Note: For the most current articles, regularly monitor preprint servers like bioRxiv and arXiv, and set up alerts for journals such as Nature, Science, Cell, Developmental Cell, and Annual Review of Cell and Developmental Biology [10].
Top 10 Experimental Methods and Protocols¶
MERFISH (Multiplexed Error-Robust Fluorescence In Situ Hybridization)
Enables highly multiplexed, quantitative spatial transcriptomics at subcellular resolution [11].
seqFISH+ (Sequential Fluorescence In Situ Hybridization Plus)
Allows profiling of thousands of genes in situ using advanced barcoding strategies [11].
Cyclic Immunofluorescence (CyCIF)
Multiplexed protein imaging for spatial proteomics in tissue sections [12].
IBEX (Iterative Bleaching Extends Multiplexity)
High-plex imaging of proteins and nucleic acids in tissues [12].
Spatial Multi-Omics (Joint Transcriptome, Proteome, Epigenome Profiling)
Protocols for simultaneous measurement of multiple omics layers in the same tissue section [6].
Single-Cell Multi-Omics (scRNA-seq + ATAC-seq, etc.)
Methods for joint profiling of transcriptome and chromatin accessibility in single cells [5].
CRISPR/Cas9 and RNAi in Emerging Model Organisms
Adaptation of gene editing and knockdown protocols for non-traditional models [7].
Organoid and Organ-on-Chip Systems
Protocols for generating and analyzing 3D tissue models of development [4].
Live Imaging with High-Resolution Microscopy
Advanced imaging protocols for tracking cell behavior and lineage in vivo [2].
Probe-Based Multiplexed Imaging (e.g., MELC)
Multi-epitope ligand cartography for high-content spatial protein mapping [12].
Top 10 Software Resources and Databases¶
SEEK Platform
Open-source platform for managing, sharing, and exploring systems biology data and models [13].
BioUML
Web-based platform for visual modeling, simulation, and analysis of biological systems [14].
Graph Databases for Systems Biology
Tools for integrating and visualizing complex biological networks and relationships [15].
VirtualPlant
Integrates multiple data sources for systems-level plant biology research [16].
MAGNet
Web-enabled platform integrating bioinformatics tools for systems biology [[MAGNet]].
de.NBI-SysBio Services (SABIO-RK, Excemplify, SEEK)
Suite of tools for data management and enrichment in systems biology [[de.NBI]].
SIMO (Spatial Integration of Multi-Omics)
Computational tool for probabilistic alignment and integration of spatial multi-omics datasets [17].
Human Cell Atlas & HuBMAP
Large-scale databases providing single-cell and spatial omics maps of human tissues [3].
Preprint Servers (bioRxiv, arXiv)
Essential for tracking the latest computational tools, methods, and protocols [10].
Model Organism Databases (MODs)
Organism-specific databases hosting genomic and functional data, with tools for querying and visualization [18].
Top 10 Computational Methods for Single-Cell and Spatial Omics¶
SIMO
Integrates spatial and non-spatial single-cell omics data using probabilistic alignment [17].
Graph-Linked Embedding
Integrates single-cell data modalities at atlas scale by modeling regulatory interactions [19].
Weakly Linked Features Integration
Integrates spatial and single-cell data across modalities using weakly linked features [19].
Machine Learning for Multi-Omics Integration
AI-driven approaches for pattern recognition and cell fate prediction in developmental datasets [5].
Spatial Context-Aware Cell-Cell Communication Analysis
Algorithms for inferring signaling networks from spatially resolved omics data [9].
Early, Intermediate, and Late Data Integration Strategies
Computational frameworks for combining multi-omics data at different stages of analysis [20].
Atlas-Scale Data Integration
Methods for integrating and harmonizing data across large-scale single-cell atlases [19].
Spatial Multi-Omics Data Analysis Pipelines
End-to-end pipelines for joint analysis of transcriptome, proteome, and epigenome in spatial context [6].
Organoid and Tissue Simulation Platforms
Computational models for simulating organoid development and tissue morphogenesis [4].
Cross-Species Omics Integration
Methods for comparative analysis of developmental programs across model organisms [8].
How to Stay Up-to-Date¶
Monitor Preprint Servers: Set up alerts on bioRxiv and arXiv for keywords like “single-cell,” “spatial omics,” “developmental systems biology,” and “multi-omics” [10].
Follow Major Journals: Regularly check journals such as Nature, Science, Cell, Developmental Cell, and Annual Review of Cell and Developmental Biology for reviews and breakthrough articles [1].
Engage with Databases: Explore and contribute to community databases and platforms like SEEK, Human Cell Atlas, and model organism databases [13] [3].
Attend Conferences: Participate in conferences and workshops focused on single-cell and spatial omics technologies.
Summary Table¶
Category | Top Resources/Methods/Articles (2022–2025) |
---|---|
Articles | Multi-omics integration, spatial transcriptomics, single-cell atlases, organoids, ML for cell fate, etc. |
Methods | MERFISH, seqFISH+, CyCIF, IBEX, spatial multi-omics, scRNA-seq+ATAC-seq, CRISPR, organoids, live imaging |
Software/DBs | SEEK, BioUML, graph DBs, VirtualPlant, MAGNet, de.NBI-SysBio, SIMO, Human Cell Atlas, bioRxiv, MODs |
Computational | SIMO, graph-linked embedding, weakly linked features, ML integration, spatial context-aware analysis, etc. |
Links¶
Cell and Developmental Biology | List of High Impact Articles. https://
www .iomcworld .com Editorial: 10 years of Frontiers in cell and developmental .... https://www.frontiersin.org
Editorial: 10 years of Frontiers in cell and developmental .... https://www.frontiersin.org
The Future of Cell Biology: Emerging Model Organisms. https://
pmc .ncbi .nlm .nih .gov Model organism databases: essential resources that need the .... https://bmcbiol.biomedcentral.com
Diversifying the concept of model organisms in the age of. https://
www .nature .com SEEK: a systems biology data and model management platform. https://bmcsystbiol.biomedcentral.com
BioUML—towards a universal research platform. https://
academic .oup .com Graph databases in systems biology: a systematic review. https://
academic .oup .com A Software Platform to Support Systems Biology Research. https://
pmc .ncbi .nlm .nih .gov Spatial integration of multi-omics single-cell data with SIMO. https://
www .nature .com Multi-omics single-cell data integration and regulatory inference with graph-linked embedding | Nature Biotechnology. https://
www .nature .com Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
Computational strategies for single-cell multi-omics integration - ScienceDirect. https://
www .sciencedirect .com Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
Spatial integration of multi-omics single-cell data with SIMO. https://
www .nature .com Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
Single-Cell Biology Data Insights Projects Grantees - CZI. https://
chanzuckerberg .com Preprint servers: The Story Continues. https://
openscience .eu Preprint servers: The Story Continues. https://
openscience .eu Preprints in the Spotlight. https://
sr .ithaka .org Are preprints the future of research?. https://
ideas .newsrx .com Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
- Holló, G., Park, J. H., Boni, E., & Schaerli, Y. (2024). GRN_modeler: An Intuitive Tool for Constructing and Evaluating Gene Regulatory Networks and its Applications to Oscillators and a Light Biosensor. 10.1101/2024.12.18.629005
- Yang, X., Mann, K. K., Wu, H., & Ding, J. (2024). scCross: a deep generative model for unifying single-cell multi-omics with seamless integration, cross-modal generation, and in silico exploration. Genome Biology, 25(1). 10.1186/s13059-024-03338-z
- Youngblut, N. D., Carpenter, C., Prashar, J., Ricci-Tam, C., Ilango, R., Teyssier, N., Konermann, S., Hsu, P. D., Dobin, A., Burke, D. P., Goodarzi, H., & Roohani, Y. H. (2025). scBaseCount: an AI agent-curated, uniformly processed, and continually expanding single cell data repository. 10.1101/2025.02.27.640494
- Park, J., Kim, J., Lewy, T., Rice, C. M., Elemento, O., Rendeiro, A. F., & Mason, C. E. (2022). Spatial omics technologies at multimodal and single cell/subcellular level. Genome Biology, 23(1). 10.1186/s13059-022-02824-6
- Kiessling, P., & Kuppe, C. (2024). Spatial multi-omics: novel tools to study the complexity of cardiovascular diseases. Genome Medicine, 16(1). 10.1186/s13073-024-01282-y
- Fisher, A. G. (2025). Editorial: 10 years of Frontiers in cell and developmental biology: past discoveries, current challenges and future perspectives. Frontiers in Cell and Developmental Biology, 13. 10.3389/fcell.2025.1625942
- Wolstencroft, K., Owen, S., Krebs, O., Nguyen, Q., Stanford, N. J., Golebiewski, M., Weidemann, A., Bittkowski, M., An, L., Shockley, D., Snoep, J. L., Mueller, W., & Goble, C. (2015). SEEK: a systems biology data and model management platform. BMC Systems Biology, 9(1). 10.1186/s12918-015-0174-y
- Oliver, S. G., Lock, A., Harris, M. A., Nurse, P., & Wood, V. (2016). Model organism databases: essential resources that need the support of both funders and users. BMC Biology, 14(1). 10.1186/s12915-016-0276-z