π¦ Semantic Similarity Index Measure how isolated or connected disease research is across the global biomedical literature, using PubMedBERT embeddings on PubMed abstracts spanning 175 GBD diseases. What it does 1. Takes a disease list (GBD taxonomy) as input 2. Retrieves PubMed abstracts (2000-2025) for each disease with quality filtering 3. Generates 768-dimensional PubMedBERT embeddings for every abstract 4. Computes four semantic equity metrics per disease: - Semantic Isolation Index (SII) : average cosine distance to k-nearest disease neighbours; higher = more isolated, less connected reβ¦