Cleaning up¶
I really want to get one of those roombas. I had one in Vegas and it was nice. I’ve got almost everything off the floor in the dining room and kitchen, but there’s still a lot of work to do in the rest of the house. I figured I might as well clean top to bottom and I’ve been going through all of my Akro Mil bins and organizing them, throwing away stuff I don’t need. Putting similar things in plastic bags so I can consolidate stuff. I’ve got a label maker but they’re plastic bags I can just write on the outside of them what they are.
I threw away a lot of stuff that had been accumulating in the central hall yesterday and bought a swiffer duster so I’ve been running around for the past couple of hours gathering all of the random bits of hardware I don’t want to throw away but don’t have any use for that seem to have gathered on all of the surfaces in the house into a bag so I can just have the “furniture hardware/allen wrench” bag and dust around here.
Refinancing¶
I really need to refinance. I bought when mortgages were 7% and I’ve been waiting for Powell to cut every since. I could have bought a nice car with the money I’ve spent on my 7% mortgage vs a reasonable rate since I’ve moved it.
Any damn day now. Shit even 6.625% would give me a few hundred back versus what I’m paying now. I saw one that wanted to give me cash out and I was paying way less a month versus what I’m at and I don’t know I need to start looking into it. Looking like everybody wants to do refinance at 6.75%.
Quit my bitching¶
Okay I’ll refinance when it makes more sense for me to refinance and not through some nonsense offer that’s going to screw me over because I didn’t read the fine print. Anyway.
Computational Developmental Systems Biology¶
Okay I’ve got this to review the state of the art in cell simulation and developmental biology so I don’t tread water in computational systems biology when I know so much and have so much to offer. This needs to be the game I play. I need to be a world expert at this stuff, and after just a little while, a few tutorials here, an addition of something to the code there, and I will be! The point is to get started.
This would be a great
Query:¶
I want to create a framework for testing hypotheses in developmental biology using the known genetic regulatory networks and whatever kinds of diffusion coefficients and whatnot we can glean from single cell rna and snp studies the whole nine yards, histography with real time 3d cell imaging. I want to create a database for this kind of information surrounding the waddington landscape of the cells as they divide. I imagine there are several sources of information on cell atlases for developmental model organisms. I want to collect them into a place where they can be put through the paces of simulation and integration of real work -omic data from various sources, including mass scale lit review, to arrive as some ideas as to how organisms develop on a more granular level than we might currently understand.
I think taichi is a good language for developing the simulation aspects. I don’t know about the database side of things. I imagine the literature review aspect will take precedence.
Does this make any sense? I eventually want to integrate AI assistants for expert feedback, but I need some kind of database of single cell omic studies that can connect with actual genetic regulatory network simulations of development, even if they’re not full cell based. We need to investigate carefully cell simulation advances of computational synthetic biology
🧬 Framework for Hypothesis Testing in Developmental Biology: Project Blueprint¶
1. Database Architecture Recommendations¶
Hybrid Approach: MongoDB + Neo4j
Component | Purpose | Rationale |
---|---|---|
MongoDB | Store heterogeneous, evolving omics data (single-cell RNA, SNPs, imaging, metadata) | Flexible schema, high scalability, ideal for diverse and large-scale biological datasets [1] |
Neo4j | Model and query complex genetic regulatory networks and cell relationships | Efficient for deep, relationship-centric queries (e.g., pathway analysis, lineage tracing) [2] |
Integration | Use both in tandem: MongoDB for raw/processed data, Neo4j for network/relationship queries | Hybrid solutions are common in bioinformatics for comprehensive analysis [3] |
2. Data Integration Strategy¶
Leverage Major Cell Atlas Initiatives and Multi-Omics Platforms
- Primary Data Sources:
- Human Cell Atlas (HCA) & Human Developmental Cell Atlas (HDCA) for human and developmental data [4][5]
- Allen Brain Cell Atlas for brain-specific, multimodal data [6]
- Arc Virtual Cell Atlas for multi-species, computation-ready single-cell data [3]
- Model organism databases: FlyBase, ZFIN, MGI, WormBase, TAIR for Drosophila, zebrafish, mouse, C. elegans, and Arabidopsis [7][8][9][10]
- Integration Tools:
3. Simulation Framework Design¶
Taichi for High-Performance, GPU-Accelerated Simulations
- Why Taichi?
- Native GPU support, automatic parallelization, and integration with Python ecosystem [13]
- Proven use in molecular dynamics, agent-based, and particle-based biological simulations
- Supports real-time, 3D cell imaging and spatially explicit modeling (e.g., diffusion, cell migration)
- Simulation Modules:
- Genetic Regulatory Network (GRN) Simulation: Integrate with tools like GRiNS, GINsim, or BioTapestry for network logic and dynamics [14][15]
- Diffusion and Biophysical Modeling: Implement custom modules for diffusion coefficients and spatial processes using Taichi’s particle/fluid simulation capabilities
- Cellular Imaging Integration: Ingest and visualize real-time 3D imaging data for model validation and hypothesis generation
4. Literature Review Methodology¶
AI-Assisted, Systematic Review Pipeline
- Automated Literature Mining:
- Use AI/NLP tools (e.g., PubMed APIs, semantic search engines) to extract relevant studies, datasets, and experimental results
- Tag and index findings by organism, developmental stage, omics type, and experimental context
- Expert Curation:
- Integrate human-in-the-loop review for quality control and hypothesis refinement
- Continuous Updating:
- Set up automated alerts and periodic reviews to keep the database current with the latest research
5. Technology Stack Selection¶
Layer | Recommended Technology/Tool | Justification |
---|---|---|
Database | MongoDB, Neo4j | Hybrid for flexibility and network analysis |
Data Integration | Python (pandas, scVI, scANVI), Cytoscape | Mature libraries for omics data harmonization and visualization |
Simulation | Taichi, GRiNS, GINsim, BioTapestry | High-performance, extensible, and compatible with Python ecosystem |
Visualization | CZ CELLxGENE, custom 3D viewers (e.g., VTK, Plotly) | For interactive exploration of cell states and landscapes |
AI/ML | PyTorch, TensorFlow, HuggingFace Transformers | For literature mining, data integration, and model training |
Web/API | FastAPI, Flask, React | For user interface, data access, and integration with AI assistants |
6. Implementation Roadmap¶
Phase 1: Foundation¶
- Set up MongoDB and Neo4j instances
- Ingest initial datasets from major cell atlases and model organism databases
- Develop data harmonization and integration pipelines
Phase 2: Simulation Core¶
- Build Taichi-based simulation modules for GRN dynamics and diffusion modeling
- Integrate with existing GRN tools (GRiNS, GINsim)
- Connect simulation outputs to database for storage and analysis
Phase 3: Literature Integration¶
- Deploy AI/NLP pipeline for automated literature mining and tagging
- Curate and link relevant studies to database entries
Phase 4: Visualization & User Interface¶
- Implement interactive dashboards for data exploration (CZ CELLxGENE, custom viewers)
- Enable real-time 3D visualization of simulation and imaging data
Phase 5: AI Assistant Integration¶
- Develop or integrate AI agents for expert feedback, hypothesis suggestion, and workflow automation
Phase 6: Expansion & Community Engagement¶
- Add support for new data types (e.g., spatial transcriptomics, proteomics)
- Open platform for community data contributions and benchmarking
Conclusion¶
Your vision is not only feasible but aligns with the latest advances in computational synthetic biology, database technology, and AI-driven research. By combining a hybrid database architecture, leveraging open cell atlas resources, using Taichi for simulation, and integrating AI for literature review and expert feedback, you can create a robust, extensible framework for hypothesis testing in developmental biology. This platform will empower granular, mechanistic insights into organismal development and serve as a valuable resource for the scientific community.
Links¶
- GINsim | Qualitative analysis of regulatory networks: a .... http://ginsim.org
- BioTapestry. https://
biotapestry .systemsbiology .net - SPREd: a simulation-supervised neural network tool for gene .... https://
academic .oup .com - GRiNS: A Python Library for Simulating Gene Regulatory .... https://arxiv.org
- Ingeneue: a software tool to simulate and explore genetic .... https://
pubmed .ncbi .nlm .nih .gov - Current approaches to gene regulatory network modelling. https://
pmc .ncbi .nlm .nih .gov - An Intuitive Tool for Constructing and Evaluating Gene .... https://www.biorxiv.org
- From molecular simulation to black hole rendering - Taichi Docs. https://
docs .taichi -lang .org - From molecular simulation to black hole rendering - Taichi Docs. https://
docs .taichi -lang .org - victoriacity/taichimd: Interactive, GPU-accelerated .... https://github.com
- victoriacity/taichimd: Interactive, GPU-accelerated .... https://github.com
- Human Cell Atlas. https://
www .humancellatlas .org - Human Cell Atlas Integration with the Cytoscape .... https://
chanzuckerberg .com - Allen Brain Cell Atlas. https://
portal .brain -map .org - Allen Brain Atlas: Cell Types: Overview. https://
celltypes .brain -map .org - Cell Types Database: RNA-Seq Data. https://
portal .brain -map .org - Whole Mouse Brain Atlas - Brain Cell Data Center (BCDC). https://
www .biccn .org - Arc Virtual Cell Atlas launches, combining data from over .... https://
arcinstitute .org - Arc Virtual Cell Atlas launches, combining data from over .... https://
arcinstitute .org - Development Biological Network. https://
www .humancellatlas .org - Human Developmental Cell Atlas | Single cell atlas of the .... https://
developmental .cellatlas .io - Model Organism Databases. https://
bioinfo .cd -genomics .com - Model Organism Databases. https://
bioinfo .cd -genomics .com - FLY CELL ATLAS - FlyCellAtlas description.. http://
flycellatlas .org - Model Organism Databases. https://
bioinfo .cd -genomics .com - Databases & Protocols. https://
www .cshl .edu - Plant Cell Atlas. https://
www .plantcellatlas .org - The Biodiversity Cell Atlas: charting cellular diversity on Earth. https://
www .ebi .ac .uk - CZ CELLxGENE Discover - Cellular Visualization Tool. https://
cellxgene .cziscience .com - Deep Dive: MongoDB vs Neo4j vs RDBMS — A Technical .... https://medium.com
- Full Data Science Project Part 3: Storage Architecture with .... https://medium.com
- scGRN: a comprehensive single-cell gene regulatory network .... https://
academic .oup .com - Deep Dive: MongoDB vs Neo4j vs RDBMS — A Technical .... https://medium.com
- Virtual Cells: Predict, Explain, Discover. https://arxiv.org
- scCross: a deep generative model for unifying single-cell multi .... https://genomebiology.biomedcentral.com
- Stein Aerts Lab - VIB - KULeuven. https://
aertslab .org - Virtual Cells: Predict, Explain, Discover. https://arxiv.org
- Arc Institute Initiates Groundbreaking “Virtual Cell .... https://
bioengineer .org
Wow! That’s amazing.¶
Okay so I definitely need to dive deeper into the Arc Virtual Cell Atlas and their paper. scBaseCount
sounds like something I was planning on doing myself as part of the project specs so I might as well read about how they’re doing it.
GRiNS and the other simulation engines definitely look like something I should be reading about.
I created a project called epiwad
that the above walks through a very cursory first draft idea regarding the project. I have a lot of work to do just on reading the background material before I ever start putting together code. Many tutorials to follow. Honestly I’m putting all of this crap into books. They might just be shitty AI slop that I have some bot put together from the work I’m doing and the tutorials I’m going to reproduce in the interest of building on top of all the amazing work that’s already been done and writing it up in an accessible way, but it will be something.

First rough draft logo but I like it
TO DO¶
- Go over the links and references above more thoroughly
- Update plan with knowledge gained from deep dives into each of the subjects
- More queries into each specific aspect as the subject matter knowledge increases
- Continue to update plan
- Download and run all available tutorials
- Condense main points of paper into journal posts
Another link dump¶
Query:¶
Can you give me a rundown of the latest and greatest articles, software resources, databases, and experimental methods fueling innovation in computational and experimental developmental systems biology? Specific focus on single cell methods and spatial omics tools/papers/methods would be incredibly useful. I am trying to get up to speed and it just seems like a good query to run periodically.
Top 10 of each. Articles. Methods. Software/databases.
Latest Innovations in Computational and Experimental Developmental Systems Biology¶
With a Focus on Single-Cell and Spatial Omics
This comprehensive rundown covers the top recent articles, experimental methods, and software/databases fueling innovation in developmental systems biology, with a particular emphasis on single-cell and spatial omics. The field is rapidly evolving, so this list is designed to help you get up to speed and can serve as a template for periodic updates.
Top 10 Breakthrough Articles (2022–2025)¶
While a definitive, ranked “top 10” requires citation analysis and field consensus, the following themes and article types have dominated high-impact journals and preprint servers:
Integration of Multi-Omics Data in Developmental Systems
- Articles describing computational frameworks for integrating transcriptomic, epigenomic, and proteomic data at single-cell resolution, often across developmental time courses [1].
Spatial Transcriptomics in Embryogenesis
- Landmark studies mapping gene expression in situ during early development using spatial transcriptomics (e.g., seqFISH+, MERFISH) [2].
Single-Cell Atlases of Model Organisms
- Comprehensive single-cell atlases for zebrafish, mouse, and human embryos, revealing lineage trajectories and cell fate decisions [3].
Organoid and Organ-on-Chip Systems
- Reports on the use of organoids and microfluidic systems to model developmental processes and tissue morphogenesis [4].
Machine Learning for Cell Fate Prediction
- Articles introducing machine learning models that predict cell fate transitions from single-cell multi-omics data [5].
Spatial Multi-Omics in Tissue Patterning
- Studies combining spatial transcriptomics with proteomics or metabolomics to dissect tissue patterning and morphogen gradients [6].
CRISPR Screens in Developmental Contexts
- High-throughput CRISPR-based perturbation screens in developing tissues, analyzed at single-cell resolution [7].
Comparative Developmental Omics Across Species
- Cross-species single-cell and spatial omics studies revealing conserved and divergent developmental programs [8].
Cell-Cell Communication Networks
- Articles mapping ligand-receptor interactions and signaling networks in developing tissues using spatially resolved data [9].
Emerging Model Organisms and Genomic Tools
- Reports on the development of genomic resources and single-cell methods for non-traditional model organisms [7].
Note: For the most current articles, regularly monitor preprint servers like bioRxiv and arXiv, and set up alerts for journals such as Nature, Science, Cell, Developmental Cell, and Annual Review of Cell and Developmental Biology [10].
Top 10 Experimental Methods and Protocols¶
MERFISH (Multiplexed Error-Robust Fluorescence In Situ Hybridization)
- Enables highly multiplexed, quantitative spatial transcriptomics at subcellular resolution [11].
seqFISH+ (Sequential Fluorescence In Situ Hybridization Plus)
- Allows profiling of thousands of genes in situ using advanced barcoding strategies [11].
Cyclic Immunofluorescence (CyCIF)
- Multiplexed protein imaging for spatial proteomics in tissue sections [12].
IBEX (Iterative Bleaching Extends Multiplexity)
- High-plex imaging of proteins and nucleic acids in tissues [12].
Spatial Multi-Omics (Joint Transcriptome, Proteome, Epigenome Profiling)
- Protocols for simultaneous measurement of multiple omics layers in the same tissue section [6].
Single-Cell Multi-Omics (scRNA-seq + ATAC-seq, etc.)
- Methods for joint profiling of transcriptome and chromatin accessibility in single cells [5].
CRISPR/Cas9 and RNAi in Emerging Model Organisms
- Adaptation of gene editing and knockdown protocols for non-traditional models [7].
Organoid and Organ-on-Chip Systems
- Protocols for generating and analyzing 3D tissue models of development [4].
Live Imaging with High-Resolution Microscopy
- Advanced imaging protocols for tracking cell behavior and lineage in vivo [2].
Probe-Based Multiplexed Imaging (e.g., MELC)
- Multi-epitope ligand cartography for high-content spatial protein mapping [12].
Top 10 Software Resources and Databases¶
SEEK Platform
- Open-source platform for managing, sharing, and exploring systems biology data and models [13].
BioUML
- Web-based platform for visual modeling, simulation, and analysis of biological systems [14].
Graph Databases for Systems Biology
- Tools for integrating and visualizing complex biological networks and relationships [15].
VirtualPlant
- Integrates multiple data sources for systems-level plant biology research [16].
MAGNet
- Web-enabled platform integrating bioinformatics tools for systems biology [[MAGNet]].
de.NBI-SysBio Services (SABIO-RK, Excemplify, SEEK)
- Suite of tools for data management and enrichment in systems biology [[de.NBI]].
SIMO (Spatial Integration of Multi-Omics)
- Computational tool for probabilistic alignment and integration of spatial multi-omics datasets [17].
Human Cell Atlas & HuBMAP
- Large-scale databases providing single-cell and spatial omics maps of human tissues [3].
Preprint Servers (bioRxiv, arXiv)
- Essential for tracking the latest computational tools, methods, and protocols [10].
Model Organism Databases (MODs)
- Organism-specific databases hosting genomic and functional data, with tools for querying and visualization [18].
Top 10 Computational Methods for Single-Cell and Spatial Omics¶
SIMO
- Integrates spatial and non-spatial single-cell omics data using probabilistic alignment [17].
Graph-Linked Embedding
- Integrates single-cell data modalities at atlas scale by modeling regulatory interactions [19].
Weakly Linked Features Integration
- Integrates spatial and single-cell data across modalities using weakly linked features [19].
Machine Learning for Multi-Omics Integration
- AI-driven approaches for pattern recognition and cell fate prediction in developmental datasets [5].
Spatial Context-Aware Cell-Cell Communication Analysis
- Algorithms for inferring signaling networks from spatially resolved omics data [9].
Early, Intermediate, and Late Data Integration Strategies
- Computational frameworks for combining multi-omics data at different stages of analysis [20].
Atlas-Scale Data Integration
- Methods for integrating and harmonizing data across large-scale single-cell atlases [19].
Spatial Multi-Omics Data Analysis Pipelines
- End-to-end pipelines for joint analysis of transcriptome, proteome, and epigenome in spatial context [6].
Organoid and Tissue Simulation Platforms
- Computational models for simulating organoid development and tissue morphogenesis [4].
Cross-Species Omics Integration
- Methods for comparative analysis of developmental programs across model organisms [8].
How to Stay Up-to-Date¶
- Monitor Preprint Servers: Set up alerts on bioRxiv and arXiv for keywords like “single-cell,” “spatial omics,” “developmental systems biology,” and “multi-omics” [10].
- Follow Major Journals: Regularly check journals such as Nature, Science, Cell, Developmental Cell, and Annual Review of Cell and Developmental Biology for reviews and breakthrough articles [1].
- Engage with Databases: Explore and contribute to community databases and platforms like SEEK, Human Cell Atlas, and model organism databases [13] [3].
- Attend Conferences: Participate in conferences and workshops focused on single-cell and spatial omics technologies.
Summary Table¶
Category | Top Resources/Methods/Articles (2022–2025) |
---|---|
Articles | Multi-omics integration, spatial transcriptomics, single-cell atlases, organoids, ML for cell fate, etc. |
Methods | MERFISH, seqFISH+, CyCIF, IBEX, spatial multi-omics, scRNA-seq+ATAC-seq, CRISPR, organoids, live imaging |
Software/DBs | SEEK, BioUML, graph DBs, VirtualPlant, MAGNet, de.NBI-SysBio, SIMO, Human Cell Atlas, bioRxiv, MODs |
Computational | SIMO, graph-linked embedding, weakly linked features, ML integration, spatial context-aware analysis, etc. |
Links¶
- Cell and Developmental Biology | List of High Impact Articles. https://
www .iomcworld .com - Editorial: 10 years of Frontiers in cell and developmental .... https://www.frontiersin.org
- Editorial: 10 years of Frontiers in cell and developmental .... https://www.frontiersin.org
- The Future of Cell Biology: Emerging Model Organisms. https://
pmc .ncbi .nlm .nih .gov - Model organism databases: essential resources that need the .... https://bmcbiol.biomedcentral.com
- Diversifying the concept of model organisms in the age of. https://
www .nature .com - SEEK: a systems biology data and model management platform. https://bmcsystbiol.biomedcentral.com
- BioUML—towards a universal research platform. https://
academic .oup .com - Graph databases in systems biology: a systematic review. https://
academic .oup .com - A Software Platform to Support Systems Biology Research. https://
pmc .ncbi .nlm .nih .gov - Spatial integration of multi-omics single-cell data with SIMO. https://
www .nature .com - Multi-omics single-cell data integration and regulatory inference with graph-linked embedding | Nature Biotechnology. https://
www .nature .com - Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
- Computational strategies for single-cell multi-omics integration - ScienceDirect. https://
www .sciencedirect .com - Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
- Spatial integration of multi-omics single-cell data with SIMO. https://
www .nature .com - Spatial multi-omics: novel tools to study the complexity of .... https://genomemedicine.biomedcentral.com
- Single-Cell Biology Data Insights Projects Grantees - CZI. https://
chanzuckerberg .com - Preprint servers: The Story Continues. https://
openscience .eu - Preprint servers: The Story Continues. https://
openscience .eu - Preprints in the Spotlight. https://
sr .ithaka .org - Are preprints the future of research?. https://
ideas .newsrx .com - Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
- Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
- Spatial omics technologies at multimodal and single cell .... https://genomebiology.biomedcentral.com
- Holló, G., Park, J. H., Boni, E., & Schaerli, Y. (2024). GRN_modeler: An Intuitive Tool for Constructing and Evaluating Gene Regulatory Networks and its Applications to Oscillators and a Light Biosensor. 10.1101/2024.12.18.629005
- Yang, X., Mann, K. K., Wu, H., & Ding, J. (2024). scCross: a deep generative model for unifying single-cell multi-omics with seamless integration, cross-modal generation, and in silico exploration. Genome Biology, 25(1). 10.1186/s13059-024-03338-z
- Youngblut, N. D., Carpenter, C., Prashar, J., Ricci-Tam, C., Ilango, R., Teyssier, N., Konermann, S., Hsu, P. D., Dobin, A., Burke, D. P., Goodarzi, H., & Roohani, Y. H. (2025). scBaseCount: an AI agent-curated, uniformly processed, and continually expanding single cell data repository. 10.1101/2025.02.27.640494
- Park, J., Kim, J., Lewy, T., Rice, C. M., Elemento, O., Rendeiro, A. F., & Mason, C. E. (2022). Spatial omics technologies at multimodal and single cell/subcellular level. Genome Biology, 23(1). 10.1186/s13059-022-02824-6
- Kiessling, P., & Kuppe, C. (2024). Spatial multi-omics: novel tools to study the complexity of cardiovascular diseases. Genome Medicine, 16(1). 10.1186/s13073-024-01282-y
- Fisher, A. G. (2025). Editorial: 10 years of Frontiers in cell and developmental biology: past discoveries, current challenges and future perspectives. Frontiers in Cell and Developmental Biology, 13. 10.3389/fcell.2025.1625942
- Wolstencroft, K., Owen, S., Krebs, O., Nguyen, Q., Stanford, N. J., Golebiewski, M., Weidemann, A., Bittkowski, M., An, L., Shockley, D., Snoep, J. L., Mueller, W., & Goble, C. (2015). SEEK: a systems biology data and model management platform. BMC Systems Biology, 9(1). 10.1186/s12918-015-0174-y
- Oliver, S. G., Lock, A., Harris, M. A., Nurse, P., & Wood, V. (2016). Model organism databases: essential resources that need the support of both funders and users. BMC Biology, 14(1). 10.1186/s12915-016-0276-z