<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Boston Women in Bioinformatics&apos;s Blog</title><description>The online presence of Women in Bioinformatics in the Boston Area</description><link>https://boston-wib.org</link><item><title>A Coffee with CompBio: Behind the &quot;We&apos;ll Be in Touch&quot; Emails</title><link>https://boston-wib.org/blog/coffeewithcompbio/s2-e4</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s2-e4</guid><pubDate>Wed, 29 Apr 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio-logo2.png&quot; alt=&quot;Coffee with CompBio Podcast Logo: Four painted women under that podcast title&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;ArkeaBio&apos;s Lauren Fitch and Kaye Guerard-Hatziioannou pull back the curtain on how bioinformatics roles actually get filled.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  data-testid=&quot;embed-iframe&quot;
  style=&quot;border-radius:12px&quot;
  src=&quot;https://open.spotify.com/embed/episode/5W9nBMlCwnKThMycvRinpr?utm_source=generator&quot;
  width=&quot;100%&quot;
  height=&quot;352&quot;
  frameBorder=&quot;0&quot;
  allowfullscreen=&quot;&quot;
  allow=&quot;autoplay; clipboard-write; encrypted-media; fullscreen; picture-in-picture&quot;
  loading=&quot;lazy&quot;
&gt;&lt;/iframe&gt;

&lt;p&gt;Ever wonder what actually happens after you submit a job application? In this episode of A Coffee with CompBio, Dina and Sharvari sit down with Lauren Fitch (Head of Computational Biology at ArkeaBio) and Kaye Guerard-Hatziioannou (HR Business Partner at ArkeaBio) to walk through the full hiring process for a Bioinformatics Software Engineer role — from resume screening to interview rounds. Lauren and Kaye share the things hiring managers wish candidates knew, offering practical advice for both early-career and mid-career professionals navigating the bioinformatics job market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Listen On&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/behind-the-well-be-in-touch-emails-arkeabios-lauren/id1817024741?i=1000764603126&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/5W9nBMlCwnKThMycvRinpr&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaBell size={32} className=&quot;text-yellow-600&quot; /&gt; &lt;a href=&quot;https://podcast.boston-wib.org/feed.xml&quot;&gt;RSS Feed&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;More on this Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastry&lt;/a&gt; for editing and management support, &lt;a href=&quot;https://www.linkedin.com/in/dinaissakova/&quot;&gt;Dina Issakova&lt;/a&gt; for the cover art, and &lt;a href=&quot;https://www.linkedin.com/in/valisha/&quot;&gt;Valisha Shah&lt;/a&gt; for social media support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Support the Podcast&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/sharvarinarendra/&quot;&gt;Sharvari&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/saba-nafees/&quot;&gt;Saba&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>computational-biology</category><category>professional-development</category><category>bioinformatics</category><category>networking</category><category>job-search</category><category>career</category></item><item><title>Biobank Intro Series: You Have What You Need</title><link>https://boston-wib.org/blog/biobank-intro-series/09-wrapping-up</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/09-wrapping-up</guid><pubDate>Tue, 21 Apr 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/full_blogseries_graphic.png&quot; alt=&quot;Subway-map style illustration of the Biobank Intro Series showing the journey through UK Biobank and All of Us topics.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A wrap-up of the series and a checklist before you start your analysis&lt;/em&gt;&lt;/p&gt;

&lt;figure class=&quot;my-8 mx-auto&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/square_blogseries_graphic.png&quot; alt=&quot;Subway-map style illustration of the Biobank Intro Series showing the journey through UK Biobank and All of Us topics.&quot; class=&quot;mx-auto w-full&quot; &gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;The Biobank Intro Series, visualized as a subway map: each stop builds the foundation you need before diving into biobank analysis.&lt;/em&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If you&amp;#39;ve made it here, you&amp;#39;ve covered a lot of ground in UK Biobank and All of Us:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The research environments&lt;/li&gt;
&lt;li&gt;The observational data&lt;/li&gt;
&lt;li&gt;The genetic data&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This series was never meant to be exhaustive, but I hope it compressed your learning curve.&lt;/p&gt;
&lt;h2&gt;Before You Start Analysis&lt;/h2&gt;
&lt;p&gt;Use this as a gut check. If you can answer yes to everything here, you&amp;#39;re ready.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Platform&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Identified the compute model you&amp;#39;re working in (DNAnexus vs. Google Cloud)&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; A plan is made for saving outputs that won&amp;#39;t disappear when your session ends&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Phenotypes and Covariates&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Identified the right fields or concepts for your phenotypes of interest and covariates&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Checked for data quality warnings and understand what they mean for your analysis&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Know which timepoints or visits have data and which one you&amp;#39;re using (instances on UKB, enrollment/EHR timing on AoU)&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Confirmed the sample size is sufficient for your question&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Genetic Data&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Located the genotype data format you need (unphased WGS, phased WGS, or exome sequencing)&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Confirmed files are indexed (.tbi or .csi) before running any region queries&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Subset to your region or variant list&lt;/li&gt;
&lt;li&gt;&lt;input disabled=&quot;&quot; type=&quot;checkbox&quot;&gt; Confirmed sample overlap between your phenotype and genotype data&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;What Comes Next&lt;/h2&gt;
&lt;p&gt;These posts will update if I find mistakes. If something looks off, feel free to reach out to me.&lt;/p&gt;
&lt;p&gt;This series ends here, but, as the cliché goes, the journey is only beginning. Depending on your question, you might be headed toward GWAS, PheWAS, rare variant burden testing, or cross-biobank comparisons. Topics that may appear in a future series.&lt;/p&gt;
&lt;p&gt;For now: you have the data, you know where it lives, and you know enough to not be too surprised by what you find. That&amp;#39;s further than a lot of people get before they start. Good luck!&lt;/p&gt;
&lt;h2&gt;Reflections&lt;/h2&gt;
&lt;p&gt;None of these posts would have happened without a few people.&lt;/p&gt;
&lt;p&gt;Thank you to Sneha Grandhi who helped me recognize my strengths in this field. The biobank work I did at Genscience with her is what this series is built on.&lt;/p&gt;
&lt;p&gt;Almost a year ago, &lt;a href=&quot;https://www.linkedin.com/in/linafaller/&quot;&gt;Lina Faller&amp;#39;s&lt;/a&gt; dedication to sharing her data management work on the Boston Women in Bioinformatics website was the push I needed to finally write something of my own.&lt;/p&gt;
&lt;p&gt;Thank you to Mitibketa Ilboudo and &lt;a href=&quot;https://www.linkedin.com/in/sharvarinarendra/&quot;&gt;Sharvari Narendra&lt;/a&gt; for editing. Taking time to give feedback on a stranger&amp;#39;s blog posts is not a small thing, and it&amp;#39;s the reason I could post these with confidence.&lt;/p&gt;
&lt;p&gt;Thank you to Lance and my family for the blind support. They may never understand what it is that I do, but I still love them.&lt;/p&gt;
</content:encoded><category>biobank</category><category>research-strategy</category><category>ukb</category><category>all-of-us</category></item><item><title>Member Spotlight: Sharvari Narendra</title><link>https://boston-wib.org/blog/interview/sharvari-narendra</link><guid isPermaLink="true">https://boston-wib.org/blog/interview/sharvari-narendra</guid><pubDate>Thu, 16 Apr 2026 14:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//team/Sharvari_Narendra.jpg&quot; alt=&quot;Sharvari Narendra&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Finding Her Voice, One Pipeline at a Time&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;From the outside, Sharvari Narendra seems like someone who has it all figured out. As a host of &lt;em&gt;A Coffee with Comp Bio Season 2 Podcast&lt;/em&gt;, she speaks with confidence and ease, clearly grounded in her work in bioinformatics. It’s the kind of presence that makes you assume the path behind her mastery was just as steady. So when I sat down to interview her about her career journey, I expected a straightforward story. I was wrong, and I left the conversation genuinely inspired.&lt;/p&gt;
&lt;p&gt;When Sharvari completed her Master’s in Microbiology in India, she was already on a seemingly strong, stable career path. However, stepping into an entry job, doing work that made sense on paper, she very quickly realized something important. It wasn’t the work she wanted to be doing.&lt;/p&gt;
&lt;p&gt;She had pivoted to what she calls a “dry lab”, but not in the way we think of it now. It wasn’t biological data analysis. It was simply sitting behind a computer, doing work that didn’t feel connected to anything she cared about. She knew it wasn’t her calling, but like many people in that position, she didn’t have a clear reason to leave yet. That push came from somewhere unexpected.&lt;/p&gt;
&lt;p&gt;In 2017, Sharvari was selected to attend the Harvard College Project for Asian and International Relations (&lt;a href=&quot;https://www.hpair.org/&quot;&gt;HPAIR&lt;/a&gt;), an experience that would quietly change the course of her life. This global conference brings together students and young professionals to discuss major economic, political, and social issues across the Asia-Pacific region. It’s a space for future leaders, people thinking broadly about the world and their place in it. For Sharvari, it was also her first time traveling to the United States, and her first time traveling alone. She arrived in snowy Boston in the middle of winter, stepping into the cold, the unknown, and a completely new world all at once. And something shifted. It wasn’t just the setting. It was the people. The conversations. The energy in the room. As she put it, “It wasn’t the HPAIR conference per se that captured my interest, it was the attendees. They were from so many different walks of life, but all of them had one thing in common. They were all passionate about something in their lives. I wanted that.”&lt;/p&gt;
&lt;p&gt;That moment had nothing to do with bioinformatics. It had everything to do with perspective. For the first time, she wasn’t just thinking about finding a stable job. She was thinking about what it would mean to care deeply about the work she was doing. That realization gave Sharvari the courage to make a difficult decision. When she returned to India, she left her job. Not because she had a perfect plan waiting, but because she was finally ready to search for something that felt right. What followed wasn’t a clean transition. It was a period of exploration. She tried workshops in science communication, pursued language certifications, and even considered going back into academia for a PhD in Molecular Biology. Each step didn’t necessarily bring her closer to a clear answer, but it helped her understand what didn’t fit, which turned out to be just as important.&lt;/p&gt;
&lt;p&gt;Then she came across a 10-day workshop in bioinformatics and metagenomics. “That one stuck with me,” she reflected. This time, the interest felt different. It wasn’t entirely new, she had been introduced to bioinformatics during her microbiology training, but now she was seeing its potential up close. The idea that you could use a computer to understand complex biological systems, like the gut microbiome, opened up a completely new way of thinking. That curiosity turned into a decision. She applied to Northeastern University’s Master’s program in Bioinformatics and returned to Boston, this time with purpose. Same city, but a completely different chapter.&lt;/p&gt;
&lt;p&gt;For Sharvari, choosing the “right” path didn’t suddenly make things easy. She explained, “I found out how much I did not know. Many times I wondered if I had done the right thing coming all the way from India.” Coding wasn’t intuitive at first. The coursework was demanding. The job search felt unpredictable. And layered on top of it all was the weight of being far from home, navigating a system that didn’t always feel built for you. It’s the kind of stretch where confidence comes and goes, sometimes hourly.&lt;/p&gt;
&lt;p&gt;She had been looking for a sign. Fortunately, it showed up as an opportunity.&lt;/p&gt;
&lt;p&gt;During her internship at McLean Hospital, Sharvari worked with RNA-seq data, studying how alcohol affects gene expression in the brain. For the first time, the abstract became tangible. This wasn’t just theory anymore. It was an application. “This internship reminded me why I had chosen bioinformatics. The passion to solve biological problems using a computer.” And just like that, things didn’t feel perfect, but they felt right enough to keep going.&lt;/p&gt;
&lt;p&gt;Today, Sharvari works as a Bioinformatics Analyst at the University of Virginia, focusing on microbial genomics. Which, if you take out the technical language, means she spends her time using data to understand how microbes behave, spread, and evolve. From bacterial genomes to wastewater surveillance, her work sits at the intersection of computation and real-world impact. When she talks about it, Sharvari feels less uncertainty as she perseveres in the field, and more curiosity, “the revolution that next generation sequencing has brought to microbiology cannot be understated.” Sharvari shares as she lights up talking about how we can now track infectious diseases, identify resistance genes, and even trace the origins of microbes across time. It’s complex work, but her excitement makes it feel accessible.&lt;/p&gt;
&lt;p&gt;Her technical experience is impressive, but what stood out most in our conversation wasn’t just the science. It was how she talked about her journey of growth. When I asked about challenges, she didn’t mention coding or technical obstacles. She said one word: agency, “I will give my opinion and if someone disagrees, I don’t really push for my opinion.” It’s the kind of thing many people experience, especially early in their careers, but rarely say out loud. Learning how to take up space, how to stand behind your ideas, how to engage instead of stepping back. These aren’t things you pick up from a textbook. They come with time, and practice, and a willingness to be uncomfortable. It’s something she’s actively working on, by stepping into leadership roles and taking ownership of her work, even when it would be easier not to.&lt;/p&gt;
&lt;p&gt;When the conversation shifted to AI, her answer felt refreshingly balanced, “It’s so scary and so incredible all at once” Sharvari admitted. She doesn’t see it as something that replaces bioinformaticians, but something that raises the bar. Less time spent on repetitive coding, more time spent actually thinking about the biology behind the data. In a way, it’s pushing the field closer to what it was always meant to be.&lt;/p&gt;
&lt;p&gt;Outside of her technical work, Sharvari also serves as co-chair of the Boston Women in Bioinformatics podcast. And here, her perspective becomes even clearer, “What makes a scientific conversation truly impactful is if it can reach the layman in the way it was intended.” Science isn’t meant to feel exclusive. It’s meant to be understood. And she’s intentional about creating conversations that feel accessible, not intimidating.&lt;/p&gt;
&lt;p&gt;Toward the end of our conversation, I asked her what she would say to someone who feels curious about bioinformatics but also hesitant to step into it. Her answer had nothing to do with technical skills, “the intimidation comes from the fear of failing at something new.” And then she reframed it saying, “who is going to care if I fail? Nobody who truly cares about you is going to judge you.” It’s a perspective that feels obvious once you hear it, and surprisingly hard to internalize before you take it into action for yourself.&lt;/p&gt;
&lt;p&gt;Last month Sharvari moderated a fantastic webinar on &lt;em&gt;The Perfect Predator&lt;/em&gt;, a story rooted in infectious disease and real-world impact. It fits naturally into the work she’s already doing, connecting science, storytelling, and community. Check it out at this link: &lt;a href=&quot;https://www.youtube.com/watch?v=-xvSOig0VMg&amp;t=2901s&quot;&gt;https://www.youtube.com/watch?v=-xvSOig0VMg&amp;amp;t=2901s&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Looking back, her journey reads like a series of decisions made with just enough clarity to take the next step. Leaving a job. Trying something new. Moving across the world. Staying when it got hard.&lt;/p&gt;
&lt;p&gt;And somewhere along the way, without forcing it, she built something meaningful. Not all at once. Not perfectly.&lt;/p&gt;
&lt;p&gt;Just one step, and one pipeline, at a time.&lt;/p&gt;
</content:encoded><category>member-spotlight</category><category>interview</category><category>career-development</category><category>academia-to-industry</category><category>bioinformatics</category><category>computational-biology</category><category>microbial-genomics</category><category>metagenomics</category><category>rna-seq</category><category>women-in-science</category><category>career-advice</category><category>professional-development</category></item><item><title>Member Spotlight: Aysheh Alrfooh</title><link>https://boston-wib.org/blog/interview/aysheh-alrfooh</link><guid isPermaLink="true">https://boston-wib.org/blog/interview/aysheh-alrfooh</guid><pubDate>Tue, 14 Apr 2026 14:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/20260414-Aysheh-Interview-02.png&quot; alt=&quot;Aysheh Alrfooh sits on a mountain rock in her hiking outfit, sunlight glowing behind her like a spotlight.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;An interview on methylation, mentorship, and movies&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;When I first decided to interview &lt;a href=&quot;https://www.linkedin.com/in/aysheh-alrfooh/&quot;&gt;Aysheh Alrfooh&lt;/a&gt; for the BWiB Member Spotlight, I did what most of us do - I opened her LinkedIn, clicked through her webpage, and tried to piece together a sense of who she was. What I had to go on was this: she co-chairs BWiB&amp;#39;s Career-Sponsorship Committee, and as someone who was selected as a mentor in BWiB&amp;#39;s inaugural mentorship program this year, that alone was enough to pique my curiosity. But by the end of our conversation, I realized that her role at BWiB is just one thread in a much richer story.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Aysheh holds a PhD from the University of Iowa, where her research in epigenetics centered on identifying methylation profiles in individuals with mental health disorders - work that sits at the intersection of computation, biology, and a deeply human question: how do we better understand what&amp;#39;s happening inside the minds of people who are suffering? After her PhD, she completed a Computational Biologist co-op at Biogen before joining Diagonal Therapeutics in Watertown, Massachusetts, where she now works as a Scientist. Diagonal is developing a new class of clustering antibody medicines aimed at addressing the root causes of severe genetic diseases.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;There&amp;#39;s a quiet stereotype that follows scientists around, especially the ones who are serious about their craft - that they are one-dimensional, that conversations with them will inevitably loop back to pipelines and p-values. I&amp;#39;ll admit, I walked into this interview half-expecting exactly that, given the breadth of Aysheh&amp;#39;s expertise. But what I&amp;#39;ve learned over the years is that the most accomplished people are usually the most layered. They carry within them whole worlds of curiosity, humor, conviction, and warmth that you don&amp;#39;t see on a CV. Aysheh is one of those people. By the end of our conversation, I wasn&amp;#39;t just impressed by her credentials - I was genuinely charmed by who she is.&lt;/em&gt;&lt;/p&gt;
&lt;figure style=&quot;max-width: 400px; margin: 0 auto;&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/20260414-Aysheh-Interview-01.jpg&quot; alt=&quot;Aysheh Alrfooh smiling at her desk&quot; style=&quot;width: 100%; height: auto;&quot; /&gt;
&lt;/figure&gt;

&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; First of all, you have such a beautiful name! What does it mean, and how do you pronounce it?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; My name comes from an Arabic root (asha) which means &amp;#39;to live,&amp;#39; so my name means &amp;#39;alive&amp;#39; or &amp;#39;living,&amp;#39; which totally describes me as a person, as I am very passionate and I want to live life to the fullest. My name is pronounced in standard Arabic as &amp;#39;AA-ee-sha,&amp;#39; and in a simpler, less formal Arabic it is &amp;#39;AH-isha&amp;#39;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; Honestly, that makes the name even more beautiful - there&amp;#39;s something poetic about a name that captures a person so precisely. Thank you for sharing that. Speaking of how you show up in the world - you co-chair the Career-Sponsorship Committee, and BWIB just launched its inaugural mentorship program this year. What drew you to that particular work within the organization?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; I think it chose me, not the other way around, haha! I always believe that things find me before I find them, but that doesn&amp;#39;t mean I am depriving myself of free will. I do set intentions for causes I want to pursue, such as women&amp;#39;s empowerment, which has always been on my mind. Just as science empowered me, I want to empower women, and the organization was the perfect fit for that. And of course, when the opportunity appeared, I applied to be co-chair, went through the interview, and got the role. I have been with the organization for over a year now and am entering my second year with them. So grateful for this opportunity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; And speaking as someone on the receiving end of that work - it really is making a difference. The mentorship program has been one of the most valuable learning experiences I&amp;#39;ve had this year, so thank you. On that note, here&amp;#39;s something I personally wrestle with, and I think you&amp;#39;d be the perfect person to ask: if you were mentoring someone brand new to bioinformatics, what&amp;#39;s the very first thing you&amp;#39;d teach them?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; Do not be afraid to spend time writing a script from scratch and watching days go by just through learning. The learning phase is supposed to be slow, difficult, and sometimes frustrating, but that is a sign that you are truly growing. Do not be discouraged by a lack of output or low productivity in the early stages of learning. Embrace the process and be patient with yourself.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; That&amp;#39;s such grounding advice - and honestly, applicable far beyond bioinformatics. There&amp;#39;s a thread of empathy that runs through everything you say, and I noticed it especially in your PhD work at the University of Iowa. Your research connects molecular biology to mental health - studying DNA methylation in bipolar disorder, schizophrenia, and suicidal behavior. How does the human side of that work shape the way you approach it?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; It guided the project to pursue the potential of DNA methylation as a biomarker, in addition to its role as a factor contributing to the risk of suicide. The project targeted DNA methylation sequencing around SNPs that showed an association with suicide attempts in bipolar disorder, and I studied them in blood samples to determine whether we could detect a strong signal that could later be translated into a biomarker.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; That&amp;#39;s really powerful work. From there, you moved from epigenetics research in academia at the University of Iowa to your current role as a scientist at Diagonal Therapeutics. Looking back across that arc, what has been your most enriching professional experience so far?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; I cannot decide between two different experiences, where one prepared me for the other, and the current one is preparing me for future opportunities. A PhD is a very individualistic experience whose main focus is learning how to pursue a research question and answer it scientifically, while also developing analytical thinking. Your first job after a PhD is where you apply what you learned, adjusting it to suit the fast pace of a corporate environment. My PhD prepared me for my current job, and my current job is preparing me for future opportunities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; I love that framing - every chapter doing the quiet work of setting up the next one. Sticking with that transition for a moment: you made the leap from an academic PhD lab to industry. What surprised you most about that shift?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; Finding something interesting is not enough on its own - the real question is whether it has a practical application. That said, I have a deep love for science, and I sometimes appreciate it from a more artistic perspective, finding beauty in a discovery simply because it is fascinating. But in industry, curiosity alone is not enough; there has to be a clear path to real-world impact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; That tension between science as art and science as application is such a real one. Thank you for naming it so honestly. On the topic of paths and turning points - is there a moment in your career where you thought, &amp;quot;If I had chosen this other path, things could have been different&amp;quot;? How has that shaped where you are now?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; Life has given me so many interesting and challenging options. One moment that stands out, among many, is when I got accepted into dental school to pursue a dentistry career but chose to study pharmacy instead. After graduation, I worked in a community pharmacy for one month, and on my second day I handed in my notice. And absolutely no regret!!&lt;br&gt;I believe we—most of the time—have a choice, but I realized that what matters more than the choice itself is how you deal with the unexpected. You can think about your choices a lot, study them carefully, and do your research, but if you are not ready for the unexpected by having resilience and flexibility, you will not be ready for life. Choice is a fun game, but it is not enough on its own.&lt;br&gt;That said, I recognize this is a very privileged perspective. Not everyone has the luxury of choice, and in many parts of the world people are simply fighting to survive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; &amp;quot;If you are not ready for the unexpected by having resilience and flexibility, you will not be ready for life&amp;quot; - I might be quoting that one back to myself for a long time. Speaking of being ready for the unexpected, AI is reshaping bioinformatics rapidly. How is it changing your day-to-day work, and what should newcomers to the field understand about that shift?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; It is changing the field a lot, just as it is changing the whole world, and there is a lot of fear as well as excitement as the AI industry is currently navigating the &amp;#39;Peak of Inflated Expectations&amp;#39; and beginning its descent into the &amp;#39;Trough of Disillusionment&amp;#39; of the hype cycle. In my day to day, it is increasing my productivity by giving me faster access to answers, such as fixing a bug in my script, so the expectations are now higher for delivering more in a shorter time.&lt;br&gt;For newcomers, I would say: keep yourself updated with all the new AI tools and their latest versions that best suit and serve your role. Always double check the scripts written with AI and the outcomes generated by AI, and make sure you truly understand them. It is also worth learning about Agentic AI. Finally, work on your communication skills, because the shift in bioinformatics will place us at the interface between AI, research, and biology groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; A measured and very practical take - I appreciate that you neither hype it nor dismiss it. Now, if AI is buying us back some time, here&amp;#39;s a fun question: if you had all the time and resources in the world, what bioinformatics project would you work on?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; It would be a project in aging research, and I will be returning to DNA methylation, but this time with a more integrated approach. DNA methylation is my first love in science, and as they say, it is very difficult to forget your first love!!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; I&amp;#39;ll definitely be keeping an eye out for your work - &lt;em&gt;“DNA meth-aysheh-lation”&lt;/em&gt;, if you will. Here&amp;#39;s one I&amp;#39;ve been excited to ask: if you ran into the 10-year-younger and the 10-year-older version of yourself, and you could only say one thing to each of them - what would it be?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; The advice I would give my younger self is: trust the process and just keep going. I always used to put in more effort than necessary, worrying about what might go wrong and fearing I would regret not trying hard enough, which kept me in a cycle of unnecessary stress. Stress is inevitable in adult life, but it is not something young people should have to carry. I wish I had enjoyed my school years more. As for my older self, I cannot wait to meet you, and things will be great and even more exciting. It is only upward from here in every aspect, because we are ready.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; That message to your older self genuinely gave me goosebumps. I have a feeling both versions of you would be deeply proud of who you are right now - not just for what you&amp;#39;ve achieved, but for how you&amp;#39;ve grown into yourself along the way. As someone juggling a lot of responsibilities, what&amp;#39;s the best way for you to unwind after a tough day?&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; On a warm sunny day, I would love to go for a walk and watch the sunset. On a rainy or cold day, I would prefer staying in and watching TV. I actually have a spreadsheet of all the movies I have watched along with my ratings, if you are interested!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; Oh, consider me very interested - I&amp;#39;ll be following up on that. If someone could also generate a spreadsheet of all the bioinformatics tools along with their ratings, I would also be interested in that, haha..! Before we wrap, one last thing - let&amp;#39;s play a game. Tell me two truths and a lie about yourself, and we&amp;#39;ll leave our readers to guess which is which.&lt;br&gt;&lt;strong&gt;Aysheh:&lt;/strong&gt; I completed the Everest Base Camp trek, I have 6.2 million views on my Google Maps photos, and I have 10 siblings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharvari:&lt;/strong&gt; I have my guess. Readers, I’ll let you form yours. Aysheh, thank you so much for this conversation, for the openness you brought to every question, and for the wisdom tucked into all of your answers. I hope I get the chance to talk to your 10 year older self in the future for another interview!&lt;/p&gt;
</content:encoded><category>member-spotlight</category><category>interview</category><category>career-development</category><category>epigenetics</category><category>computational-biology</category><category>bioinformatics</category><category>mentorship</category><category>women-in-science</category><category>career-advice</category><category>sponsorship</category></item><item><title>Biobank Intro Series: All of Us Genetic Data</title><link>https://boston-wib.org/blog/biobank-intro-series/08-aou-genetic-data</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/08-aou-genetic-data</guid><pubDate>Tue, 14 Apr 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/bucketofblobs.png&quot; alt=&quot;A tree ring visualization where each ring is a relevant UKB release&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Navigating All of Us genotype data for variant extraction and analysis&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;If UK Biobank RAP gives you a filing cabinet with labeled drawers, All of Us gives you a bucket of blobs. Technically everything you need is in there. Somewhere. In blob form. The official Google Cloud Storage documentation uses the word &amp;quot;blob&amp;quot; without apparent embarrassment. I have thoughts about this. They are not printable.&lt;/p&gt;
&lt;p&gt;Yes, &amp;quot;blobs&amp;quot; is the technical term for flat storage objects, organized by path conventions rather than a true directory hierarchy. There are no folders, just prefixes that cosplay as folders. Until you internalize this, you may spend a lot of time fishing in the bucket.&lt;/p&gt;
&lt;h2&gt;Quick Overview: What&amp;#39;s Available&lt;/h2&gt;
&lt;figure class=&quot;my-8 mx-auto !max-w-lg&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/aou_genetic_data.png&quot; alt=&quot;Woman with a flashlight searching through a dark basement packed with disorganized filing cabinets, papers, and boxes, representing the scattered data organization within All of Us.&quot; class=&quot;!max-w-none mx-auto w-full&quot; &gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;Working with All of Us data can feel like searching a dim warehouse. Valuable datasets are everywhere, but finding the right files requires some exploration.&lt;/em&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;The All of Us documentation assures us that the data exists. Somewhere. In some format. Accessible via... &lt;a href=&quot;https://support.researchallofus.org/hc/en-us/articles/29475233432212-Controlled-CDR-Directory&quot;&gt;environment variables&lt;/a&gt;? Googling variations of &amp;quot;all of us wgs sequencing data&amp;quot; will lead you to get increasingly creative with &lt;code&gt;gsutil ls&lt;/code&gt; commands. Let me save you some time. Here&amp;#39;s where the genotype data actually lives.&lt;/p&gt;
&lt;p&gt;Similar to the &lt;a href=&quot;../07-genotypeUKB&quot;&gt;recent WGS releases in UK Biobank&lt;/a&gt;, All of Us provides whole genome sequencing data in phased and unphased VCF formats. The concepts covered there, what “phased” means and when you’d choose one format over the other, carry over directly so I won’t re-explain them here. What’s different is where the files live and how you get data out of them.&lt;/p&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Format&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;th&gt;Path&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;&lt;tr&gt;
&lt;td&gt;Phased VCFs&lt;/td&gt;
&lt;td&gt;By chromosome, easiest to access&lt;/td&gt;
&lt;td&gt;&lt;code&gt;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/aux/phasing/chr${CHROM}_AOU_v8.2_allsamples_phased.vcf.gz&lt;/code&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Unphased VCFs&lt;/td&gt;
&lt;td&gt;Sharded into 20,016 files&lt;/td&gt;
&lt;td&gt;&lt;code&gt;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/exome/vcf/${BATCH}.vcf.bgz&lt;/code&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hail MatrixTable&lt;/td&gt;
&lt;td&gt;For distributed computing&lt;/td&gt;
&lt;td&gt;&lt;code&gt;$WGS_EXOME_MULTI_HAIL_PATH&lt;/code&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Variant Annotation Table (VAT)&lt;/td&gt;
&lt;td&gt;Gene-level annotations&lt;/td&gt;
&lt;td&gt;&lt;code&gt;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/aux/vat/vat_complete.bgz.tsv.gz&lt;/code&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;h2&gt;Streaming VCFs from GCS&lt;/h2&gt;
&lt;p&gt;On UKB RAP, we worked around &lt;code&gt;dx cat&lt;/code&gt; buffering problems by generating a temporary HTTPS URL with &lt;code&gt;dx make_download_url&lt;/code&gt; and pointing bcftools at that directly (see &lt;a href=&quot;../02-hardwareOnUKBandAoU&quot;&gt;post 02&lt;/a&gt;). On AoU, &lt;code&gt;gsutil cat&lt;/code&gt; actually streams without buffering, so you can pipe it straight into bcftools. No URL workaround needed. If your environment is authenticated via &lt;code&gt;GOOGLE_APPLICATION_CREDENTIALS&lt;/code&gt; or &lt;code&gt;gcloud auth&lt;/code&gt;, bcftools may be able to read gs:// paths directly, though I haven&amp;#39;t tested this.&lt;/p&gt;
&lt;details open&gt;
&lt;summary&gt;Code&lt;/summary&gt;

&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# Phased VCF
PHASED_VCF=&amp;quot;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/aux/phasing/chr${CHROM}_AOU_v8.2_allsamples_phased.vcf.gz&amp;quot;
gsutil -u $GOOGLE_PROJECT cat ${PHASED_VCF} \
  | bcftools view -t ${CHROM}:${REGION_START}-${REGION_END} -O z -o phased_region.vcf.gz

# Unphased VCF shard
UNPHASED_VCF=&amp;quot;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/exome/vcf/${BATCH}.vcf.bgz&amp;quot;
gsutil -u $GOOGLE_PROJECT cat ${UNPHASED_VCF} \
  | bcftools view -t ${CHROM}:${REGION_START}-${REGION_END} -O z -o unphased_region.vcf.gz
&lt;/code&gt;&lt;/pre&gt;
&lt;/details&gt;

&lt;p&gt;The phased files do come with &lt;code&gt;.tbi&lt;/code&gt; indexes, but since piped input isn&amp;#39;t seekable, bcftools can&amp;#39;t use them. The &lt;code&gt;-t&lt;/code&gt; flag streams and filters for both file types. The index is there if you ever download a file locally and query it repeatedly, but for the streaming workflow it doesn&amp;#39;t factor in.&lt;/p&gt;
&lt;h2&gt;Finding Your Unphased Shard&lt;/h2&gt;
&lt;p&gt;The unphased VCF files live at &lt;code&gt;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/exome/vcf&lt;/code&gt;, sharded by genomic position into files numbered &lt;code&gt;0000000000&lt;/code&gt; to &lt;code&gt;0000020016&lt;/code&gt;. There is no chromosome in the filename and no documented estimate of how many base pairs each shard covers. Each shard has a companion &lt;code&gt;.interval_list&lt;/code&gt; file that tells you what regions it contains, which means the only way to find your region is to open interval lists until you find it.&lt;/p&gt;
&lt;p&gt;The internet has fancy solutions for finding your regions of choice, but if you only need one region and you&amp;#39;re willing to be a little scrappy, a manual binary search is often faster than building proper infrastructure: jump to the middle of the file range, check the interval list with gsutil, then halve the range again based on whether your region fell above or below. Looping through all 20,016 files programmatically would be painfully slow. Use the script below as a verification tool, not a search engine, unless you have a long weekend and a lot of patience.&lt;/p&gt;
&lt;details open&gt;
&lt;summary&gt;Code&lt;/summary&gt;

&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;import pandas as pd
import subprocess
from tqdm import tqdm

# SET YOUR REGION OF INTEREST HERE
chrom = &amp;quot;11&amp;quot;
region_start = 47331406
region_end = 47352702

vcf_filelist = []
start = False
for i in tqdm(range(0, 20016)):
    file_int = str(i).zfill(10)
    interval_f = f&amp;quot;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/exome/vcf/{file_int}.interval_list&amp;quot;
    cmd = f&amp;quot;gsutil -u $GOOGLE_PROJECT cat {interval_f} | grep &amp;#39;^chr&amp;#39;&amp;quot;
    p = subprocess.Popen(cmd, shell=True, stdout=subprocess.PIPE)
    range_df = pd.read_csv(p.stdout, header=None, sep=&amp;quot;\t&amp;quot;)
    range_df.columns = [&amp;#39;chrom&amp;#39;, &amp;#39;start&amp;#39;, &amp;#39;stop&amp;#39;, &amp;#39;strand&amp;#39;, &amp;#39;score&amp;#39;]
    if not start:
        region_count = (
            (range_df[&amp;#39;chrom&amp;#39;].str.contains(chrom)) &amp;amp;
            (range_df[&amp;#39;start&amp;#39;] &amp;gt;= region_start)
        ).sum()
        if region_count &amp;gt; 0:
            vcf_filelist.append(file_int)
            print(file_int)
            start = True
    else:
        region_count = (
            (range_df[&amp;#39;chrom&amp;#39;].str.contains(chrom)) &amp;amp;
            (range_df[&amp;#39;stop&amp;#39;] &amp;lt;= region_end)
        ).sum()
        vcf_filelist.append(file_int)
        if region_count &amp;gt; 0:
            print(file_int)
            break
&lt;/code&gt;&lt;/pre&gt;
&lt;/details&gt;

&lt;p&gt;If you&amp;#39;re planning to do this repeatedly, build an index of shard positions once and query that instead. Why one doesn&amp;#39;t already exist is a great question.&lt;/p&gt;
&lt;h2&gt;Extracting and Merging Your Region&lt;/h2&gt;
&lt;p&gt;Once you have your shard list, stream each one through bcftools to subset to your region, then merge:&lt;/p&gt;
&lt;details open&gt;
&lt;summary&gt;Code&lt;/summary&gt;

&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# SET YOUR REGION OF INTEREST HERE
chrom=&amp;quot;11&amp;quot;
region_start=47331406
region_end=47352702

shard1=0000002366  # FIRST_SHARD_NUMBER
shard2=0000002367  # LAST_SHARD_NUMBER

BASE=&amp;quot;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/exome/vcf&amp;quot;

# 1. Subset each shard to your region
gsutil -u $GOOGLE_PROJECT cat ${BASE}/${shard1}.vcf.bgz \
  | bcftools view -t ${chrom}:${region_start}-${region_end} -O z -o ${shard1}_region.vcf.gz

gsutil -u $GOOGLE_PROJECT cat ${BASE}/${shard2}.vcf.bgz \
  | bcftools view -t ${chrom}:${region_start}-${region_end} -O z -o ${shard2}_region.vcf.gz

# 2. Index the filtered VCFs
tabix -p vcf ${shard1}_region.vcf.gz
tabix -p vcf ${shard2}_region.vcf.gz

# 3. Concatenate
bcftools concat -a -O z \
  -o merged_AOU_v8_unphased.vcf.gz \
  ${shard1}_region.vcf.gz \
  ${shard2}_region.vcf.gz

# 4. Index the merged VCF
tabix -p vcf merged_AOU_v8_unphased.vcf.gz

# 5. Save to your workspace bucket
gsutil -u $GOOGLE_PROJECT cp merged_AOU_v8_unphased.vcf.gz $WORKSPACE_BUCKET/data/
&lt;/code&gt;&lt;/pre&gt;
&lt;/details&gt;

&lt;h2&gt;What About Hail?&lt;/h2&gt;
&lt;p&gt;As I mentioned in &lt;a href=&quot;../02-hardwareOnUKBandAoU&quot;&gt;post 02&lt;/a&gt;, Hail requires an expensive Spark cluster and is overkill for single-gene or single-region work. For those cases, bcftools is faster and cheaper. If you&amp;#39;re doing something genuinely genome-wide that needs distributed computing, the MatrixTable is available at &lt;code&gt;$WGS_EXOME_MULTI_HAIL_PATH&lt;/code&gt;. If you go that route, always filter at read time using the &lt;code&gt;_intervals&lt;/code&gt; parameter. Loading the full genome MatrixTable on the default cluster will crash it.&lt;/p&gt;
&lt;details open&gt;
&lt;summary&gt;Code&lt;/summary&gt;

&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;import hail as hl
import os

chrom = &amp;quot;11&amp;quot;
region_start = 47331406
region_end = 47352702

aou_wgs_mt = os.getenv(&amp;quot;WGS_EXOME_MULTI_HAIL_PATH&amp;quot;)

# Filter at READ time, not after loading
your_interval = hl.eval(hl.parse_locus_interval(f&amp;#39;{chrom}:{region_start}-{region_end}&amp;#39;))
mt = hl.read_matrix_table(
    aou_wgs_mt,
    _intervals=[your_interval]  # Only load this region!
)
&lt;/code&gt;&lt;/pre&gt;
&lt;/details&gt;

&lt;h2&gt;Bonus: The Variant Annotation Table (VAT)&lt;/h2&gt;
&lt;p&gt;If you need variant annotations like gene names and predicted consequences, All of Us provides a massive TSV file you can grep by gene name:&lt;/p&gt;
&lt;details open&gt;
&lt;summary&gt;Code&lt;/summary&gt;

&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;#!/bin/bash

# SET YOUR GENE OF INTEREST HERE
gene=&amp;quot;MYBPC3&amp;quot;

project=$GOOGLE_PROJECT
bucket=$WORKSPACE_BUCKET
vat_f=&amp;quot;gs://fc-aou-datasets-controlled/v8/wgs/short_read/snpindel/aux/vat/vat_complete.bgz.tsv.gz&amp;quot;

echo &amp;quot;Starting ${gene} extraction at $(date)&amp;quot;
gsutil -u $project cat $vat_f | gunzip | head -1 &amp;gt; ${gene}_vat_v8.tsv
echo &amp;quot;Header retrieved at $(date)&amp;quot;

echo &amp;quot;Extracting ${gene} variants...&amp;quot;
gsutil -u $project cat $vat_f | gunzip | grep $&amp;#39;\t&amp;#39;&amp;quot;${gene}&amp;quot;$&amp;#39;\t&amp;#39; &amp;gt;&amp;gt; ${gene}_vat_v8.tsv
echo &amp;quot;Done at $(date)&amp;quot;

wc -l ${gene}_vat_v8.tsv

echo &amp;quot;Uploading to workspace bucket...&amp;quot;
gsutil -u $project cp ${gene}_vat_v8.tsv ${bucket}/data/${gene}_vat_v8.tsv
echo &amp;quot;Upload complete at $(date)&amp;quot;
&lt;/code&gt;&lt;/pre&gt;
&lt;/details&gt;

&lt;p&gt;&lt;strong&gt;Before you run this:&lt;/strong&gt; set your idle timeout to at least 8 hours. The file is ~150 GB compressed, grepping through it takes 45 minutes or more, and if your instance sleeps halfway through you&amp;#39;re starting over. The output is human-readable and easy to work with, but it&amp;#39;s not indexed, so tabix isn&amp;#39;t an option. What you see above is about as fast as it gets without an index.&lt;/p&gt;
&lt;h2&gt;Putting It All Together&lt;/h2&gt;
&lt;p&gt;If there&amp;#39;s a theme to this post, it&amp;#39;s this: All of Us has enormous data and almost no indexes to help you navigate it. The unphased VCFs are sharded into 20,016 files with no chromosome in the filename. The VAT is 150 GB compressed with no tabix support. The documentation points you to environment variables and leaves the rest to you. You&amp;#39;re not doing something wrong when it takes a while. That&amp;#39;s just the deal. Start the script, set your idle timeout, and go do something else.&lt;/p&gt;
&lt;p&gt;Could someone build proper indexes for all of this? Yes. Should they? Absolutely. Is that a job for a skilled computational biologist who understands the data well enough to do it right? Also yes. If you&amp;#39;re reading this and thinking &amp;quot;someone should fix that,&amp;quot; that someone could be you.&lt;/p&gt;
&lt;p&gt;Go forth and blob on.&lt;/p&gt;
</content:encoded><category>biobank</category><category>all-of-us</category><category>genotype-data</category><category>variant-calls</category><category>variant-annotation</category></item><item><title>Biobank Intro Series: UK Biobank Genetic Data</title><link>https://boston-wib.org/blog/biobank-intro-series/07-ukb-genetic-data</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/07-ukb-genetic-data</guid><pubDate>Tue, 07 Apr 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/ukb_genetic_timeline.png&quot; alt=&quot;A tree ring visualization where each ring is a relevant UKB release&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Traversing genetic data on the UK Biobank RAP (Research Analysis Platform) environment&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Genetic data. We made it. Coming from my Ph.D. in plant epigenetics, I was used to raw sequencing data from one replicate at a time. In contrast, the UK Biobank (UKB) has around 500,000 participants. The enormity of it can become an unbearable dead weight if you let it. The fix is straightforward: fit the data to your analysis, not the other way around. In my work, I&amp;#39;ve focused on a single region or a handful of variants at a time. That means streaming raw files from the data repository and filtering on the fly, rather than pulling them down in full with DNAnexus (see &lt;a href=&quot;../02-hardwareOnUKBandAoU&quot;&gt;my previous blog on UKB hardware&lt;/a&gt;). The first step is knowing what genetic data UKB actually has, where it lives, and how to filter it before it becomes your problem.&lt;/p&gt;
&lt;h2&gt;The Showcase Knows About Genetics Too&lt;/h2&gt;
&lt;p&gt;In my previous post about the &lt;a href=&quot;../03-ukb-showcase&quot;&gt;UKB Showcase&lt;/a&gt;, I focused on observational fields, but the Showcase also catalogs &lt;a href=&quot;https://biobank.ctsu.ox.ac.uk/crystal/label.cgi?id=100314&quot;&gt;genetic data resources&lt;/a&gt;, including:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Microarray genotyping data&lt;/strong&gt; — directly measured variants at selected positions across the genome&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Imputed genotypes&lt;/strong&gt; — variants inferred by combining array data with full-genome reference panels&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Exome sequencing data&lt;/strong&gt; — sequenced protein-coding regions (~2% of the genome)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Whole genome sequencing&lt;/strong&gt; — full genome coverage&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Phased haplotypes&lt;/strong&gt; — estimates of which alleles sit on the same chromosome (i.e., are inherited together)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Depending on the research question, you may need multiple data types. For example, phased haplotype data provides haplotype structure but excludes rare variants, so you might pair it with whole genome sequencing calls to get both. These data types were generated using different tools and released at
different times, and I find it helpful to know that context to not only
pick the right data, but also to understand the publications that
used UKB genetic data before me. UKB provides a &lt;a href=&quot;https://community.ukbiobank.ac.uk/hc/en-gb/articles/26655145866269-Past-data-releases&quot;&gt;comprehensive timeline&lt;/a&gt; of their data releases, but coming from a sequencing background, the terminology didn&amp;#39;t sit well with me. For example, &amp;quot;genotyping data&amp;quot; actually means microarray data — not sequencing. Here are the current releases you&amp;#39;ll most likely use:&lt;/p&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Year&lt;/th&gt;
&lt;th&gt;Data Type&lt;/th&gt;
&lt;th&gt;Field ID&lt;/th&gt;
&lt;th&gt;Participants&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;&lt;tr&gt;
&lt;td&gt;2017&lt;/td&gt;
&lt;td&gt;Microarray + imputation&lt;/td&gt;
&lt;td&gt;22418&lt;/td&gt;
&lt;td&gt;488,377&lt;/td&gt;
&lt;td&gt;~800K measured; ~96M imputed [^1]&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2022&lt;/td&gt;
&lt;td&gt;Exome sequencing (final)&lt;/td&gt;
&lt;td&gt;23141&lt;/td&gt;
&lt;td&gt;~470,000&lt;/td&gt;
&lt;td&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2022&lt;/td&gt;
&lt;td&gt;Imputation (new panels)&lt;/td&gt;
&lt;td&gt;21007&lt;/td&gt;
&lt;td&gt;488,377&lt;/td&gt;
&lt;td&gt;GEL and TOPMed; now in GRCh38&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2023&lt;/td&gt;
&lt;td&gt;Whole genome sequencing&lt;/td&gt;
&lt;td&gt;24311&lt;/td&gt;
&lt;td&gt;~500,000&lt;/td&gt;
&lt;td&gt;ML corrections; DRAGEN; unphased&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2025&lt;/td&gt;
&lt;td&gt;WGS (updated)&lt;/td&gt;
&lt;td&gt;30108&lt;/td&gt;
&lt;td&gt;~500,000&lt;/td&gt;
&lt;td&gt;phased VCFs&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;p&gt;And here are the earlier releases you may encounter in older publications:&lt;/p&gt;
&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Year&lt;/th&gt;
&lt;th&gt;Data Type&lt;/th&gt;
&lt;th&gt;Participants&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;&lt;tr&gt;
&lt;td&gt;2019&lt;/td&gt;
&lt;td&gt;Exome sequencing&lt;/td&gt;
&lt;td&gt;50,000&lt;/td&gt;
&lt;td&gt;Alignment errors identified [^2]&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2020&lt;/td&gt;
&lt;td&gt;Exome sequencing (reprocessed)&lt;/td&gt;
&lt;td&gt;200,000&lt;/td&gt;
&lt;td&gt;New OQFE pipeline [^3]&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2021&lt;/td&gt;
&lt;td&gt;Exome sequencing&lt;/td&gt;
&lt;td&gt;~454,000&lt;/td&gt;
&lt;td&gt;[^4]&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2021&lt;/td&gt;
&lt;td&gt;Whole genome sequencing&lt;/td&gt;
&lt;td&gt;200,000&lt;/td&gt;
&lt;td&gt;[^5]&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;h2&gt;Where Are the Files? (The Practical Part)&lt;/h2&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/dx_jupyternotebook.png&quot; alt=&apos;Screenshot of the DNAnexus JupyterLab launcher interface. Two arrows are overlaid: &quot;Local Working Directory&quot; points to the top section of the left sidebar, and &quot;Data Storage&quot; points to the DNAnexus tab below it. The main panel shows kernel options for notebooks (DNAnexus Notebook, Python 3, Bash, R), console, and other file types.&apos; style=&quot;max-height: 600px; width: auto;&quot; /&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt; The JupyterLab launcher on UKB RAP. The left sidebar gives you access to both storage spaces: your local working directory at the top (temporary, session-only) and the DNAnexus project storage below it (persistent, where Bulk data lives). Clicking &quot;DNAnexus Notebook&quot; in the launcher (boxed in red) opens a notebook that saves directly to project storage; no manual upload needed.&lt;/em&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If you need a refresher on the two storage spaces, see the &lt;a href=&quot;../02-hardwareOnUKBandAoU&quot;&gt;hardware post&lt;/a&gt;. The genetic data lives in data storage under a directory called &amp;quot;Bulk&amp;quot;. Filenames contain field ID numbers, so you can cross-reference them with the UK Biobank Showcase. For WGS data, field ID 24311 corresponds to the ML-corrected DRAGEN release. As of the latest release, this is often the one you want.&lt;/p&gt;
&lt;p&gt;Start broad to learn the file structure, then narrow down. Running&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{bash}&quot;&gt;dx find data --name &amp;quot;ukb24311*.vcf.gz&amp;quot; --folder /Bulk | head
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;reveals that files follow the pattern:&lt;/p&gt;
&lt;p&gt;&lt;code&gt;/Bulk/DRAGEN WGS/ML-corrected DRAGEN population level WGS variants, pVCF format [500k release]/chr{CHROMOSOME}/ukb24311_c{CHROMOSOME}_b{BATCH}_v1.vcf.gz&lt;/code&gt;&lt;/p&gt;
&lt;p&gt;Yes, the path has spaces. Always quote it. Once you know that, you can narrow the search to a specific chromosome. Even then, WGS data is split across many batch files, so &lt;code&gt;| head&lt;/code&gt; is still warranted:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{bash}&quot;&gt;dx find data --name &amp;quot;ukb24311_c11_*.vcf.gz&amp;quot; --folder &amp;quot;/Bulk/DRAGEN WGS/ML-corrected DRAGEN population level WGS variants, pVCF format [500k release]/chr11&amp;quot; | head
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Unfortunately the official documentation is no help here. The best I found was a
&lt;a href=&quot;https://community.ukbiobank.ac.uk/hc/en-gb/community/posts/16790347396253-Question-regarding-batches&quot;&gt;two-year-old community forum reply&lt;/a&gt; suggesting each batch covers roughly 20,000 bp. Thanks, George F. You saved me more time than UKB&amp;#39;s own docs did. Take the estimate as a ballpark, not a guarantee.&lt;/p&gt;
&lt;p&gt;Here is the approach in practice. To find positions 47,331,406 - 47,352,702 on chromosome 11, you can divide 47,331,406 by 20,000 to derive batch 2366 as a reasonable first guess. I wrote a small script (&lt;code&gt;firstpos.sh&lt;/code&gt;) to check the first position of any candidate file:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;#!/bin/bash

CHR=$1
BATCH=$2
FILE=&amp;quot;/Bulk/DRAGEN WGS/ML-corrected DRAGEN population level WGS variants, pVCF format [500k release]/chr${CHR}/ukb24311_c${CHR}_b${BATCH}_v1.vcf.gz&amp;quot;
URL=$(dx make_download_url &amp;quot;$FILE&amp;quot; --duration 1h)
bcftools query -f &amp;#39;%POS\n&amp;#39; &amp;quot;$URL&amp;quot; | head -1
&lt;/code&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;bash firstpos.sh 11 2366
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;returns 47,319,031. This is just below my target region, which is a good sign.&lt;/p&gt;
&lt;p&gt;Checking the next batch:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;bash firstpos.sh 11 2367
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;returns 47,339,029, meaning batch 2366 covers 47,319,031 - 47,339,028. My region ends at 47,352,702, so I check batch 2368 to find where that&amp;#39;s covered. Once you&amp;#39;ve identified which batches bracket your full region, you have your files. If the 20,000 bp estimate puts you nowhere near the right chromosome positions, jump by a larger increment and binary search from there. Eventually you&amp;#39;ll find your file. Then comes the next question: what do you do with it?&lt;/p&gt;
&lt;h2&gt;What&amp;#39;s Actually in These Files?&lt;/h2&gt;
&lt;p&gt;A VCF is a giant matrix. Lines starting with &lt;code&gt;#&lt;/code&gt; are metadata: pipeline details, descriptions for variant annotations in the INFO column, chromosome descriptions, and so on. The last &lt;code&gt;#&lt;/code&gt; line is the column header for the data rows below it. Each data row is one variant. The first nine columns describe that variant: chromosome, position, ID, reference allele, alternate allele, quality score, filter status, info fields, and format. Everything after column nine is per-sample genotype data. At biobank scale that&amp;#39;s 500,000 columns. The good news is that bcftools can filter by position without loading the entire matrix into memory, which is the whole reason streaming works at scale.&lt;/p&gt;
&lt;h2&gt;Code to Stream Genetic Data in UK Biobank&lt;/h2&gt;
&lt;p&gt;That&amp;#39;s exactly what the &lt;code&gt;--regions&lt;/code&gt; flag does.&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{bash}&quot;&gt;FILE=&amp;quot;/Bulk/DRAGEN WGS/ML-corrected DRAGEN population level WGS variants, pVCF format [500k release]/chr11/ukb24311_c11_b2366_v1.vcf.gz&amp;quot;
URL=$(dx make_download_url &amp;quot;$FILE&amp;quot; --duration 1h)
bcftools view &amp;quot;$URL&amp;quot; \
  --regions 11:47331406-47352702 \
  -O z -o my_region.vcf.gz
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Bcftools reads only what it needs and stops. That&amp;#39;s the whole game: 500,000 participants, one small region, no downloading required.&lt;/p&gt;
&lt;h2&gt;The Short Version&lt;/h2&gt;
&lt;p&gt;The release timeline is worth knowing for reading older papers, but you don&amp;#39;t need to memorize it. For most analyses, start with the DRAGEN WGS release (field ID 24311). If your analysis requires haplotype structure, reach for the phased release (field ID 30108) instead, or use both if you need rare variant calls in haplotype context. Once you&amp;#39;ve found the right batch file, &lt;code&gt;dx make_download_url&lt;/code&gt;, hands bcftools a URL to a file that may be half a terabyte in size and lets it take only what it needs.&lt;/p&gt;
&lt;p&gt;In the next post, I&amp;#39;ll walk through how All of Us handles its genetic data. Spoiler: the documentation isn&amp;#39;t better.&lt;/p&gt;
&lt;h2&gt;References&lt;/h2&gt;
&lt;p&gt;[^1]: Bycroft, C., Freeman, C., Petkova, D. et al. (2018). The UK Biobank resource with deep phenotyping and genomic data. &lt;em&gt;Nature&lt;/em&gt; 562, 203–209 &lt;a href=&quot;https://doi.org/10.1038/s41586-018-0579-z&quot;&gt;https://doi.org/10.1038/s41586-018-0579-z&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;[^2]: Van Hout, C. V., Tachmazidou, I., Backman, J. D., Hoffman, J. X., Ye, B., Pandey, A. K., et al. (2019). Whole exome sequencing and characterization of coding variation in 49,960 individuals in the UK Biobank. &lt;em&gt;BioRxiv&lt;/em&gt;, 572347, &lt;a href=&quot;https://doi.org/10.1101/572347&quot;&gt;https://doi.org/10.1101/572347&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;[^3]: Szustakowski, J. D., Balasubramanian, S., Kvikstad, E., Khalid, S., Bronson, P. G., Sasson, A., ... &amp;amp; Reid, J. G. (2021). Advancing human genetics research and drug discovery through exome sequencing of the UK Biobank. Nature genetics, 53(7), 942-948.&lt;/p&gt;
&lt;p&gt;[^4]: Backman, J.D., Li, A.H., Marcketta, A. et al. (2021). Exome sequencing and analysis of 454,787 UK Biobank participants. Nature 599, 628–634, &lt;a href=&quot;https://doi.org/10.1038/s41586-021-04103-z&quot;&gt;https://doi.org/10.1038/s41586-021-04103-z&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;[^5]: Halldorsson, B.V., Eggertsson, H.P., Moore, K.H.S. et al. (2022). The sequences of 150,119 genomes in the UK Biobank. Nature 607, 732–740, &lt;a href=&quot;https://doi.org/10.1038/s41586-022-04965-x&quot;&gt;https://doi.org/10.1038/s41586-022-04965-x&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;[^6]: The UK Biobank Whole-Genome Sequencing Consortium. (2025).Whole-genome sequencing of 490,640 UK Biobank participants. Nature 645, 692–701 &lt;a href=&quot;https://doi.org/10.1038/s41586-025-09272-9&quot;&gt;https://doi.org/10.1038/s41586-025-09272-9&lt;/a&gt;&lt;/p&gt;
</content:encoded><category>biobank</category><category>uk-biobank</category><category>genotype-data</category><category>variant-calls</category><category>variant-annotation</category></item><item><title>The Rite of Process</title><link>https://boston-wib.org/blog/quicktake/wgs-analysis-poem</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/wgs-analysis-poem</guid><pubDate>Thu, 02 Apr 2026 14:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2026-03-25-the_rite_of_process.png&quot; alt=&quot;DNA sequencing&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A Poem on Whole Genome Sequencing Analysis&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;You sat there, together yet all alone,&lt;/p&gt;
&lt;p&gt;in the endless cold and dark —&lt;/p&gt;
&lt;p&gt;waiting, for days, with bated breath,&lt;/p&gt;
&lt;p&gt;to be rescued.&lt;/p&gt;
&lt;p&gt;They came, promising an escape — a quick way out —&lt;/p&gt;
&lt;p&gt;the ones in white coats made many promises,&lt;/p&gt;
&lt;p&gt;but kept only a few.&lt;/p&gt;
&lt;p&gt;Would you say you were rescued&lt;/p&gt;
&lt;p&gt;if you are no longer who you used to be?&lt;/p&gt;
&lt;p&gt;Or would you say you were elevated,&lt;/p&gt;
&lt;p&gt;because that sounds more convincing?&lt;/p&gt;
&lt;p&gt;They told you, when they came to save you,&lt;/p&gt;
&lt;p&gt;that you’d be better off alone.&lt;/p&gt;
&lt;p&gt;The journey ahead demanded individuality —&lt;/p&gt;
&lt;p&gt;even if it meant tearing you into two,&lt;/p&gt;
&lt;p&gt;before putting the entire picture together.&lt;/p&gt;
&lt;p&gt;And so you agreed,&lt;/p&gt;
&lt;p&gt;because you knew that there was only so much you could control,&lt;/p&gt;
&lt;p&gt;especially once they began stripping you down&lt;/p&gt;
&lt;p&gt;to see if you were fit to continue,&lt;/p&gt;
&lt;p&gt;even if you were already in parts.&lt;/p&gt;
&lt;p&gt;You didn’t say a word —&lt;/p&gt;
&lt;p&gt;you just silently prayed&lt;/p&gt;
&lt;p&gt;that you would still have enough of yourself left&lt;/p&gt;
&lt;p&gt;when they arrogantly came to assemble you.&lt;/p&gt;
&lt;p&gt;The white coats put you together&lt;/p&gt;
&lt;p&gt;in a haphazard fashion —&lt;/p&gt;
&lt;p&gt;brazen, annoyed, careless —&lt;/p&gt;
&lt;p&gt;you weren’t perfect,&lt;/p&gt;
&lt;p&gt;but you were suitable for their purpose.&lt;/p&gt;
&lt;p&gt;The journey ahead demanded identity and purity.&lt;/p&gt;
&lt;p&gt;Where you came from mattered&lt;/p&gt;
&lt;p&gt;only in the context of who you were now.&lt;/p&gt;
&lt;p&gt;You had started, together yet all alone,&lt;/p&gt;
&lt;p&gt;a dozen of you,&lt;/p&gt;
&lt;p&gt;in the endless cold and dark.&lt;/p&gt;
&lt;p&gt;And after a sequence of events,&lt;/p&gt;
&lt;p&gt;here you were, at the end of the road,&lt;/p&gt;
&lt;p&gt;back with the dozen others,&lt;/p&gt;
&lt;p&gt;hoping to be remembered&lt;/p&gt;
&lt;p&gt;more than you waited to be rescued.&lt;/p&gt;
&lt;p&gt;For what else was the point&lt;/p&gt;
&lt;p&gt;of making you whole again,&lt;/p&gt;
&lt;p&gt;if only to leave you&lt;/p&gt;
&lt;p&gt;until you become a statistic mentioned in passing,&lt;/p&gt;
&lt;p&gt;in the endless cold and dark.&lt;/p&gt;
</content:encoded><category>bioinformatics</category><category>whole-genome-sequencing</category><category>whole-genome-sequencing-analysis</category><category>wgs</category><category>wgs-analysis</category><category>poem</category><category>poetry</category><category>computational-biology</category><category>comp-bio</category></item><item><title>Biobank Intro Series: All of Us Observational Data (Part II)</title><link>https://boston-wib.org/blog/biobank-intro-series/06-aou-observational-data-partII</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/06-aou-observational-data-partII</guid><pubDate>Tue, 31 Mar 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/omop_table_relationships_basic.png&quot; alt=&quot;Diagram showing the four-step OMOP workflow: look up concept IDs, query a clinical table (e.g., measurement, observation), label values with concept names, and join with person_id to build a cohort table.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Loading observational data in the All of Us Researcher Workbench&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The UK has a single national health system, and it shows: one table, one field ID, a coding dictionary if you needed it. The US has hundreds of hospital systems, dozens of billing standards, and its data model matches this beloved chaotic energy. In All of Us, your data is spread across clinical tables, the coded values in those tables are translated via a separate concept table, and everything traces back to &lt;code&gt;person_id&lt;/code&gt;. More joins, more steps, but it&amp;#39;s just what a data model built on real EHR data looks like.&lt;/p&gt;
&lt;p&gt;With your concept IDs in hand (&lt;a href=&quot;../05-aou-omop&quot;&gt;Part I&lt;/a&gt;), the workflow has three moving parts: knowing which clinical table holds your concept, joining to the &lt;code&gt;concept&lt;/code&gt; table to decode any coded values, and merging everything back to &lt;code&gt;person&lt;/code&gt; to build your cohort. More pieces than UKB, but engineered for the diverse US healthcare system.&lt;/p&gt;
&lt;h2&gt;The OMOP-CDM Schema&lt;/h2&gt;
&lt;p&gt;The full &lt;a href=&quot;https://ohdsi.github.io/CommonDataModel/cdm53.html&quot;&gt;OMOP CDM schema&lt;/a&gt; is worth bookmarking. It documents every available table and its fields. Clinical tables contain records of observed medical and clinical data, where each row represents a single clinical event. In my experience, most analyses only touch a handful of clinical tables:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;condition_occurrence&lt;/code&gt;: diagnoses observed by a provider or reported by a patient&lt;/li&gt;
&lt;li&gt;&lt;code&gt;measurement&lt;/code&gt;: lab values and test results (e.g., BMI, blood pressure)&lt;/li&gt;
&lt;li&gt;&lt;code&gt;observation&lt;/code&gt;: clinical facts that cannot be measured with a standardized test, such as medical history, family history, and lifestyle choices&lt;/li&gt;
&lt;li&gt;&lt;code&gt;drug_exposure&lt;/code&gt;: medication records, useful for validating diagnoses or identifying undiagnosed conditions&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The &lt;code&gt;person&lt;/code&gt; table is the anchor table of the schema. It holds one row per participant, and every clinical table links back to it via &lt;code&gt;person_id&lt;/code&gt;. Like clinical tables, it also uses &lt;code&gt;concept_id&lt;/code&gt; columns as foreign keys to the &lt;code&gt;concept&lt;/code&gt; table for decoding coded values.&lt;/p&gt;
&lt;h3&gt;Query a Clinical Table&lt;/h3&gt;
&lt;p&gt;The OMOP CDM tables live in Google BigQuery. The examples below use Python&amp;#39;s &lt;code&gt;pandas.read_gbq&lt;/code&gt; function, which takes a SQL string and returns a dataframe. R users can likely query them with the &lt;code&gt;bigrquery&lt;/code&gt; package. The examples assume the following setup:&lt;/p&gt;
&lt;details&gt;
&lt;summary&gt;Environment versions&lt;/summary&gt;
Python 3.10.16, pandas 2.0.3, google-cloud-bigquery 2.34.4, CDR C2024Q3R9
&lt;/details&gt;

&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;import pandas as pd
import os
CDR = os.environ[&amp;#39;WORKSPACE_CDR&amp;#39;]
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;In Part I, we found that systolic blood pressure maps to concept ID 3004249 in the &lt;code&gt;measurement&lt;/code&gt; table. Let&amp;#39;s use that as our worked example:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;sbp_query = f&amp;#39;&amp;#39;&amp;#39;
    SELECT
        measurement_id,
        person_id,
        measurement_concept_id,
        measurement_type_concept_id,
        measurement_date,
        measurement_datetime,
        value_as_number
    FROM `{CDR}.measurement`
    WHERE measurement_concept_id = 3004249&amp;#39;&amp;#39;&amp;#39;
sbp_df = pd.io.gbq.read_gbq(sbp_query, dialect=&amp;#39;standard&amp;#39;)
sbp_df
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;The SELECT statement exports these columns:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;measurement_id&lt;/strong&gt;: unique row identifier&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;person_id&lt;/strong&gt;: foreign key linking the &lt;code&gt;person&lt;/code&gt; and clinical tables&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;measurement_concept_id&lt;/strong&gt;: ID of the data field&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;measurement_type_concept_id&lt;/strong&gt;: ID of the data source (eg. EHR, Lab result, etc.)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;measurement_date&lt;/strong&gt; / &lt;strong&gt;measurement_datetime&lt;/strong&gt;: date and time of the record&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;value_as_number&lt;/strong&gt;: numeric systolic blood pressure value&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The WHERE clause limits the measurement table to this field. Since we will stack multiple measurement types into a single cohort table, human-readable labels for &lt;code&gt;measurement_concept_id&lt;/code&gt; and &lt;code&gt;measurement_type_concept_id&lt;/code&gt; are essential for distinguishing which rows belong to which measurement.&lt;/p&gt;
&lt;h3&gt;Decode Coded Values via the &lt;code&gt;concept&lt;/code&gt; Table&lt;/h3&gt;
&lt;p&gt;Coded fields across all OMOP tables store numeric IDs. To translate them, join to the &lt;code&gt;concept&lt;/code&gt; table. The query below decodes the systolic blood pressure measurements:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;sbp_query = f&amp;#39;&amp;#39;&amp;#39;SELECT
  m.person_id,
  mc.concept_name AS measurement_name,
  mtc.concept_name AS measurement_type_name,
  m.value_as_number,
ROW_NUMBER() OVER (
    PARTITION BY m.person_id, m.measurement_concept_id
    ORDER BY m.measurement_date DESC, m.measurement_datetime DESC, m.measurement_id DESC
  ) AS recency_rank
FROM
  `{CDR}.measurement` m
LEFT JOIN `{CDR}.concept` mc ON mc.concept_id = m.measurement_concept_id
LEFT JOIN `{CDR}.concept` mtc ON mtc.concept_id = m.measurement_type_concept_id
WHERE measurement_concept_id = 3004249&amp;#39;&amp;#39;&amp;#39;
pd.io.gbq.read_gbq(sbp_query, dialect=&amp;#39;standard&amp;#39;)
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;This JOIN pattern repeats throughout OMOP and works the same way for any coded field across any clinical table, including the &lt;code&gt;person&lt;/code&gt; table. The query below decodes gender values:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;gender_query = f&amp;#39;&amp;#39;&amp;#39;
  SELECT
    person.person_id,
    pgc.concept_name as gender,
    person.gender_source_value,
    pgsc.concept_name as gender_source_concept_name
  FROM `{CDR}.person` person
  LEFT JOIN `{CDR}.concept` pgc ON pgc.concept_id = person.gender_concept_id
  LEFT JOIN `{CDR}.concept` pgsc ON pgsc.concept_id = person.gender_source_concept_id
&amp;#39;&amp;#39;&amp;#39;
gender_df = pd.io.gbq.read_gbq(gender_query, dialect=&amp;#39;standard&amp;#39;)
gender_df
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;The &lt;code&gt;gender_source_value&lt;/code&gt; column is the shortcut that feels right but isn&amp;#39;t. It contains raw, unstandardized values from the original data source, think of it as the &amp;quot;close enough&amp;quot; column. For any real analysis, always use the decoded &lt;code&gt;gender&lt;/code&gt; column obtained through the &lt;code&gt;concept&lt;/code&gt; join. Keep &lt;code&gt;gender_source_value&lt;/code&gt; around for reference, but DO NOT let it anywhere near your results.&lt;/p&gt;
&lt;h3&gt;Merge Everything into a Cohort Table&lt;/h3&gt;
&lt;p&gt;Each query produces its own dataframe. Merge them on &lt;code&gt;person_id&lt;/code&gt; to build your cohort:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;cohort_df = pd.merge(
  gender_df,
  sbp_df.sort_values(&amp;#39;recency_rank&amp;#39;).drop_duplicates(&amp;#39;person_id&amp;#39;),
  on=&amp;#39;person_id&amp;#39;,
  how=&amp;#39;left&amp;#39;
)
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;This pattern scales to any study: find your concept IDs (&lt;a href=&quot;../05-aou-omop&quot;&gt;Part I&lt;/a&gt;), query the relevant OMOP tables, join to &lt;code&gt;concept&lt;/code&gt; for readable labels, and merge the resulting dataframes in pandas.&lt;/p&gt;
&lt;p&gt;The pandas approach above is readable and easy to follow, but it loads each table into memory separately before joining. For larger cohorts, it&amp;#39;s more efficient to do the join entirely in SQL so BigQuery returns only the final result:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;cohort_query = f&amp;#39;&amp;#39;&amp;#39;
-- Pull one row per participant from the person table
SELECT
  p.person_id,
  pgc.concept_name AS gender,
  p.gender_source_value AS raw_gender,
  m.value_as_number AS sbp,
  mtc.concept_name AS sbp_source
FROM `{CDR}.person` p

-- Decode gender_concept_id to a human-readable label
LEFT JOIN `{CDR}.concept` pgc ON pgc.concept_id = p.gender_concept_id

-- Get the most recent SBP measurement per person
LEFT JOIN (
  SELECT
    person_id,
    value_as_number,
    measurement_concept_id,
    measurement_type_concept_id,
    ROW_NUMBER() OVER (
      PARTITION BY person_id, measurement_concept_id
      ORDER BY measurement_date DESC, measurement_datetime DESC, measurement_id DESC
    ) AS recency_rank  -- 1 = most recent
  FROM `{CDR}.measurement`
  WHERE measurement_concept_id = 3004249  -- systolic blood pressure
) m ON m.person_id = p.person_id AND m.recency_rank = 1

-- Decode measurement_concept_id and measurement_type_concept_id
LEFT JOIN `{CDR}.concept` mc ON mc.concept_id = m.measurement_concept_id
LEFT JOIN `{CDR}.concept` mtc ON mtc.concept_id = m.measurement_type_concept_id
&amp;#39;&amp;#39;&amp;#39;
cohort_df = pd.io.gbq.read_gbq(cohort_query, dialect=&amp;#39;standard&amp;#39;)
cohort_df
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;In the above example, we are only exporting one measurement from the clinical table &lt;code&gt;measurement&lt;/code&gt;. However, you will likely need to export multiple data fields from a single clinical table. Unfortunately, the query grows linearly with the number of measurements. A UNION_ALL approach followed by a pivot will make your query a lot cleaner. The query below exports a cohort table with gender, systolic blood pressure, and BMI (Concept ID:3038553).&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{python}&quot;&gt;cohort_query = f&amp;#39;&amp;#39;&amp;#39;
WITH measurements AS (
  -- Stack all measurements into one table, keeping only the most recent per person
  SELECT
    m.person_id,
    m.measurement_concept_id,
    m.value_as_number,
    mtc.concept_name AS measurement_source,
    ROW_NUMBER() OVER (
      PARTITION BY m.person_id, m.measurement_concept_id
      ORDER BY m.measurement_date DESC, m.measurement_datetime DESC, m.measurement_id DESC
    ) AS recency_rank
  FROM `{CDR}.measurement` m
  -- Decode measurement_type_concept_id to a human-readable label
  LEFT JOIN `{CDR}.concept` mtc ON mtc.concept_id = m.measurement_type_concept_id
  WHERE m.measurement_concept_id IN (
    3004249,  -- systolic blood pressure
    3038553   -- BMI
  )
)
-- Pull one row per participant from the person table
SELECT
  p.person_id,
  pgc.concept_name AS gender,
  p.gender_source_value AS raw_gender,
  -- Pivot each measurement into its own column
  MAX(CASE WHEN measurement_concept_id = 3004249 THEN value_as_number END) AS sbp,
  MAX(CASE WHEN measurement_concept_id = 3038553 THEN value_as_number END) AS bmi,
  MAX(CASE WHEN measurement_concept_id = 3004249 THEN measurement_source END) AS sbp_source,
  MAX(CASE WHEN measurement_concept_id = 3038553 THEN measurement_source END) AS bmi_source
FROM `{CDR}.person` p

-- Decode gender_concept_id to a human-readable label
LEFT JOIN `{CDR}.concept` pgc ON pgc.concept_id = p.gender_concept_id

-- Join the pivoted measurements
LEFT JOIN measurements m ON m.person_id = p.person_id AND m.recency_rank = 1
GROUP BY p.person_id, pgc.concept_name, p.gender_source_value
&amp;#39;&amp;#39;&amp;#39;
cohort_df = pd.io.gbq.read_gbq(cohort_query, dialect=&amp;#39;standard&amp;#39;)
cohort_df
&lt;/code&gt;&lt;/pre&gt;
&lt;h2&gt;Cohort and Dataset Builders&lt;/h2&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/aou_observation_dataset_tools.png&quot; alt=&quot;Three illustrated scenes representing All of Us data tools: a meal kit for Cohort and Dataset Builders, a stocked pantry for OMOP SQL access, and a vegetable garden for CDR directory files.&quot; class=&quot;!max-w-none mx-auto w-full&quot; &gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;Tools for building datasets in All of Us. Cohort and dataset builders are easy-to-use tools but they are limited similar to a meal kit box. OMOP CDM SQL queries are flexible but more complex, like cooking from a fully stocked pantry. The CDR and other external data sources expand available data but are separate, like ingredients from an accessible garden. Image generated by ChatGPT and text modified by the author.&lt;/em&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If you just scrolled past three SQL blocks and felt your eyes glaze over, good news: there&amp;#39;s a meal kit version. The AoU &lt;a href=&quot;https://www.researchallofus.org/data-tools/workbench/&quot;&gt;Cohort and Dataset Builders&lt;/a&gt; are point-and-click tools that work in two steps: define your participant criteria (the Cohort Builder), then build your data features (the Dataset Builder). Easy to use, but limited — like a meal kit, you&amp;#39;re working with what&amp;#39;s in the box. I didn&amp;#39;t find them flexible enough for my project, but they&amp;#39;re great for exploration. And since they run SQL under the hood, you can click &amp;quot;Analyze&amp;quot; → &amp;quot;See Code Preview&amp;quot; to peek at the BigQuery query they generated. It&amp;#39;s a handy starting point if you want to adapt it yourself.&lt;/p&gt;
&lt;h2&gt;Beyond OMOP-CDM: CDR Directory Files&lt;/h2&gt;
&lt;p&gt;Beyond observational data, note that some pre-computed data in the Researcher Workbench lives outside of OMOP-CDM data tables. If you have access to the &lt;a href=&quot;https://support.researchallofus.org/hc/en-us/articles/29475233432212-Controlled-CDR-Directory&quot;&gt;Controlled CDR directory&lt;/a&gt;, you&amp;#39;ll find pre-computed genomic data (e.g., precomputed genetic ancestry, admixture estimates, relatedness, etc.) are made readily available in tab-delimited tables. These genetically-derived estimates can complement or replace the self-reported race and ethnicity values in the BigQuery tables.&lt;/p&gt;
&lt;h2&gt;Summary&lt;/h2&gt;
&lt;p&gt;That&amp;#39;s the full pipeline: find your concept IDs, query the right clinical table, label your coded values, and merge everything into a cohort. It takes more joins than UKB, but the logic is consistent once you&amp;#39;ve done it once. The next post moves from phenotype data into genotype data, where the complexity shifts from data models to file formats.&lt;/p&gt;
</content:encoded><category>biobank</category><category>all-of-us</category><category>omop</category><category>athena</category><category>ehr-data</category><category>observational-data</category></item><item><title>Biobank Intro Series: All of Us Observational Data (Part I)</title><link>https://boston-wib.org/blog/biobank-intro-series/05-aou-observational-data-partI</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/05-aou-observational-data-partI</guid><pubDate>Tue, 24 Mar 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/source_to_standard_omop_square.png&quot; alt=&quot;Flowchart showing hospital data being extracted, transformed via OHDSI vocabularies, and loaded into OMOP relational tables.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Finding concept IDs for the All of Us Researcher Workbench&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;All of Us has returned to the chat (she made her first appearance in the &lt;a href=&quot;../02-hardwareOnUKBandAoU&quot;&gt;hardware post&lt;/a&gt;) and she&amp;#39;s bringing noble intentions and absolutely feral data provenance. UK Biobank is a well-organized remarkable resource, but with &amp;gt;93% European ancestry, it was never designed to represent global genetic diversity. On the other hand, All of Us was explicitly built to oversample underrepresented populations. &lt;a href=&quot;https://www.researchallofus.org/data-tools/data-snapshots/&quot;&gt;With almost 50% of participants identifying as a racial or ethnic minority&lt;/a&gt;, &amp;quot;All of Us&amp;quot; isn&amp;#39;t just a pun. It&amp;#39;s a mission statement. (I wrote more about why that matters &lt;a href=&quot;https://boston-wib.org/blog/deepdive/deigenomics&quot;&gt;here&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;That ambition comes with a data architecture to match. UK Biobank invited ~500,000 people to assessment centers, ran standardized measurements, and stored the results in a single curated dataset with clean field IDs. Even its hospital records flow from a single source: the UK&amp;#39;s publicly funded National Health Service (NHS). All of Us is working with a different reality entirely. It aggregates EHR records from 50+ independent US health systems, enrollment surveys, wearables data, and biosamples, and standardizes all of it after the fact using the OMOP-CDM (&lt;a href=&quot;https://ohdsi.github.io/CommonDataModel/cdm53.html&quot;&gt;Observational Medical Outcomes Partnership Common Data Model&lt;/a&gt;) developed by the Observational Health Data Sciences and Informatics (OHDSI) community. What gets recorded in any given participant&amp;#39;s file follows whatever combination of ICD, LOINC, CPT, or SNOMED their health system happened to use.&lt;/p&gt;
&lt;p&gt;This is why the workflow feels different. Instead of searching a Showcase and grabbing a field ID, you&amp;#39;re navigating concept IDs, vocabulary mappings, and relational tables: the machinery the OMOP-CDM uses to impose order on that heterogeneity. Concept IDs are the key to unlocking most of what&amp;#39;s available, so let&amp;#39;s start there.&lt;/p&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/source_to_standard_omop.png&quot; alt=&quot;Flowchart showing the process of transforming hospital EHR data into OMOP format.&quot; class=&quot;block dark:hidden !max-w-none mx-auto w-full&quot; &gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/source_to_standard_omop_dark.png&quot; alt=&quot;Flowchart showing the process of transforming hospital EHR data into OMOP format.&quot; class=&quot;hidden dark:block !max-w-none mx-auto w-full&quot;&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;Raw hospital data is transformed into standardized OHDSI vocabularies and loaded into OMOP-CDM relational tables.&lt;/em&gt;
 &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;h2&gt;Finding Concept IDs: Three Approaches&lt;/h2&gt;
&lt;p&gt;Similar to Field IDs, a concept ID is just a number until you know what it maps to. These three tools each give you a different way to look that up, depending on how much context you need.&lt;/p&gt;
&lt;h3&gt;1. All of Us Data Browser (fastest for quick lookups)&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href=&quot;https://databrowser.researchallofus.org&quot;&gt;All of Us Data Browser&lt;/a&gt;&lt;/strong&gt; shows what concepts exist and how often they&amp;#39;re used. Search &amp;quot;systolic blood pressure&amp;quot; and you&amp;#39;ll see results under &amp;quot;Conditions&amp;quot; and &amp;quot;Labs &amp;amp; Measurements&amp;quot; - it can be recorded multiple ways.&lt;/p&gt;
&lt;p&gt;Click a category to see participant counts and value distributions. For example, 225,000+ participants have concept ID 3004249 (Systolic blood pressure) with source code LOINC 8480-6.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to use:&lt;/strong&gt; Quick lookups, checking data availability, getting participant counts.&lt;/p&gt;
&lt;h3&gt;2. OHDSI Athena Vocabulary (richer metadata)&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;&lt;a href=&quot;https://athena.ohdsi.org/&quot;&gt;OHDSI Athena&lt;/a&gt;&lt;/strong&gt; provides concept context such as hierarchies, standard vs. non-standard mappings, and relationships. Searching for &amp;quot;systolic blood pressure&amp;quot; returns about 78,865 items at the time of writing. However, if you look up OMOP-CDM concept ID 3004249 (which you found with the All of Us Data Browser), you&amp;#39;ll see its vocabulary source (LOINC), domain (Measurement), and related concepts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to use:&lt;/strong&gt; Detailed concept information, exploring relationships, understanding vocabulary mappings.&lt;/p&gt;
&lt;h3&gt;3. Direct SQL queries (most flexible)&lt;/h3&gt;
&lt;p&gt;Query the &lt;code&gt;CONCEPT&lt;/code&gt; table directly to translate medical concepts into IDs:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;import pandas as pd
import os

# Use environmental variable for the Controlled Tier
CDR = os.environ[&amp;#39;WORKSPACE_CDR&amp;#39;]

# Search by name
concept_query = f&amp;#39;&amp;#39;&amp;#39;SELECT *
    FROM `{CDR}.concept`
    WHERE LOWER(concept_name) LIKE &amp;#39;%systolic blood pressure%&amp;#39;
    &amp;#39;&amp;#39;&amp;#39;
sbp_query = pd.io.gbq.read_gbq(concept_query, dialect=&amp;#39;standard&amp;#39;)

# Or query a known concept ID directly
concept_id_query = pd.io.gbq.read_gbq(f&amp;#39;SELECT * FROM `{CDR}.concept` WHERE concept_id = 3004249&amp;#39;)
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Use &lt;code&gt;CONCEPT_RELATIONSHIP&lt;/code&gt; to explore how concepts relate - for example, which ICD-10 codes map to SNOMED concept 3004249.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When to use:&lt;/strong&gt; Complex relationship queries, programmatic vocabulary exploration.&lt;/p&gt;
&lt;h2&gt;What&amp;#39;s Next&lt;/h2&gt;
&lt;p&gt;Finding concept IDs is the first step, but you need to navigate the relational tables to actually pull observational data. In Part II, we&amp;#39;ll get into the messier, more interesting part.&lt;/p&gt;
</content:encoded><category>biobank</category><category>all-of-us</category><category>omop</category><category>athena</category><category>ehr-data</category><category>observational-data</category></item><item><title>A Coffee with CompBio: Claude Code for Bioinformatics</title><link>https://boston-wib.org/blog/coffeewithcompbio/s2-e3</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s2-e3</guid><pubDate>Mon, 23 Mar 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio-logo2.png&quot; alt=&quot;Coffee with CompBio Podcast Logo: Four painted women under the podcast title&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Can Claude Code handle real bioinformatics? Saba and Amulya put it to the test.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  data-testid=&quot;embed-iframe&quot;
  style=&quot;border-radius:12px&quot;
  src=&quot;https://open.spotify.com/embed/episode/443kSvnJfU8Bubq5r3MnDR?utm_source=generator&quot;
  width=&quot;100%&quot;
  height=&quot;352&quot;
  frameBorder=&quot;0&quot;
  allowfullscreen=&quot;&quot;
  allow=&quot;autoplay; clipboard-write; encrypted-media; fullscreen; picture-in-picture&quot;
  loading=&quot;lazy&quot;
&gt;&lt;/iframe&gt;

&lt;p&gt;Claude code has taken over the tech world like a storm. But how good is Claude code when it comes to Bioinformatics and Computational Biology? In this episode of A Coffee with Compbio, Saba and Amulya take on the task of testing claude code on a simple task: performing QC on a Peripheral Blood Mononuclear Cells (PBMC) single cell dataset. They find out the dos and don’t with a tool like claude code while comparing it to how a real bioinformatician would analyze these results. Tune into this episode to learn more about Claude code for Bioinformatics.&lt;/p&gt;
&lt;p&gt;Bonus - We share all the results, markdown files and the single cell RNA QC skills markdown file that you can try on your own!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Links from the Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;a href=&quot;https://github.com/Boston-area-Women-in-Bioinformatics/A_coffee_with_compbio_resources/tree/main&quot;&gt;All files related to this episode&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href=&quot;https://huggingface.co/spaces/Ashastry2/journalMatch&quot;&gt;JournalMatch&lt;/a&gt;: A vibe-coded tool for finding the right match for your publication by Amulya&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Listen On&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/claude-code-for-bioinformatics/id1817024741?i=1000756823979&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/443kSvnJfU8Bubq5r3MnDR&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaBell size={32} className=&quot;text-yellow-600&quot; /&gt; &lt;a href=&quot;https://podcast.boston-wib.org/feed.xml&quot;&gt;RSS Feed&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;More on this Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastry&lt;/a&gt; for editing and management support and &lt;a href=&quot;https://www.linkedin.com/in/dinaissakova/&quot;&gt;Dina Issakova&lt;/a&gt; for the cover art and social media support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Support the Podcast&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/sharvarinarendra/&quot;&gt;Sharvari&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/saba-nafees/&quot;&gt;Saba&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>computational-biology</category><category>professional-development</category><category>bioinformatics</category><category>networking</category><category>claude-code</category><category>single-cell-rna</category></item><item><title>Biobank Intro Series: UK Biobank Observational Data (Part II)</title><link>https://boston-wib.org/blog/biobank-intro-series/04-ukb-observational-data-partII</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/04-ukb-observational-data-partII</guid><pubDate>Mon, 16 Mar 2026 05:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/ukb_fieldid_scent.png&quot; alt=&quot;Cat following wafts of fresh kibble onto a table. The scent trails are labeled with UK Biobank Field IDs.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Loading phenotype data in the UK Biobank RAP (Research Analysis Platform) environment&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;When you get approved for a UK Biobank project you are gifted a VIP pass to a secure data wonderland called the UK Biobank Research Analysis Platform (UKB RAP). The UKB RAP is a cloud-based venue (built on DNAnexus infrastructure) where you can spin up coding environments (JupyterLab, RStudio, take your pick) and analyze data without the nightmare of downloading 500,000+ participant records to your poor laptop.&lt;/p&gt;
&lt;p&gt;I enjoy working in JupyterLab, but the concepts transfer regardless of your preferred environment. Opening JupyterLab on UKB RAP is straightforward: click &amp;quot;Tools&amp;quot; in the navigation menu and hit the teal &amp;quot;+ New JupyterLab&amp;quot; button. This opens a setup GUI where you can configure your compute specs. For most analyses, the defaults work just fine (see my tips on hardware in a &lt;a href=&quot;../02-hardwareonukbandaou&quot;&gt;previous post&lt;/a&gt;).&lt;/p&gt;
&lt;p&gt;In this post, we&amp;#39;ll focus on the phenotype data. Here I define phenotype data as essentially anything that is not genetic data. This includes not just questionnaire responses, physical measurements, and hospital records, but proteomics data as well.&lt;/p&gt;
&lt;p&gt;To query the data, you need a fully qualified dataset reference combining your project ID (your workspace on RAP) and your dispensed dataset ID (the UK Biobank data object provisioned to that project). Below I have cheatcodes for finding these values in python and BASH.&lt;/p&gt;
&lt;p&gt;python:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{python}&quot;&gt;import dxpy
import glob

# Get your dataset identifier
dispensed_dataset_id = dxpy.find_one_data_object(
    typename=&amp;#39;Dataset&amp;#39;,
    name=&amp;#39;app*.dataset&amp;#39;,
    folder=&amp;#39;/&amp;#39;,
    name_mode=&amp;#39;glob&amp;#39;
)[&amp;#39;id&amp;#39;]

project_id = dxpy.find_one_project()[&amp;quot;id&amp;quot;]
dataset = f&amp;quot;{project_id}:{dispensed_dataset_id}&amp;quot;
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;bash:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-{bash}&quot;&gt;# Get project ID
project_id=$(dx env --json | jq -r &amp;#39;.project&amp;#39;)

# Get dispensed dataset ID
dispensed_dataset_id=$(dx find data --name &amp;quot;app*.dataset&amp;quot; --brief)

# Combine them
dataset=&amp;quot;${project_id}:${dispensed_dataset_id}&amp;quot;
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;With that in hand, let&amp;#39;s get some data.&lt;/p&gt;
&lt;h2&gt;Step 1: Find the field names for your data&lt;/h2&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/notebook_v_commandline.png&quot; alt=&quot;Cat peering over a table in delight at two plates of kibble: one labeled &apos;Jupyter Notebook&apos; and the other labeled &apos;Command Line&apos;.&quot; /&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;Two bowls, same feast: whether you import via Jupyter or the command line, you’re still getting the data you want.&lt;/em&gt;
&lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;With a list of field IDs you gathered from the UKB Showcase, your next step is to figure out their exact field names on the RAP (which aren&amp;#39;t always identical to what the Showcase shows), and then extract the actual participant data for those fields. There are two different methods for this.&lt;/p&gt;
&lt;h3&gt;Quick lookup via terminal&lt;/h3&gt;
&lt;p&gt;When I&amp;#39;m in &amp;quot;just get it working&amp;quot; mode (which, let&amp;#39;s be honest, is most of research), I found that the command-line approach is faster for quick lookups. I simply list all the field names in the terminal and grep for the ones I need.&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;dx extract_dataset ${dataset}  --entities participant --list-fields | grep &amp;quot;22420&amp;quot;
&lt;/code&gt;&lt;/pre&gt;
&lt;h3&gt;Dictionary approach&lt;/h3&gt;
&lt;p&gt;The UKB RAP documentation will steer you toward extracting dictionary files (&lt;code&gt;*.dataset.data_dictionary.csv &lt;/code&gt;), which map field IDs to field names, describe data coding schemes, and generally serve as the Rosetta Stone for the dataset. This approach is considered more &amp;quot;proper&amp;quot;, and for good reason: it&amp;#39;s reproducible, documentable, and plays nicely with notebooks.&lt;/p&gt;
&lt;p&gt;The dictionary approach requires more setup: extracting CSVs, loading them into pandas, and writing filter logic. When you&amp;#39;re exploring or need a quick answer, I recommend the quick and dirty command-line approach. That said, there are times when the dictionaries are nice to have. They make your code easier to comprehend and also are useful for making sense of these data tables after extraction.&lt;/p&gt;
&lt;h2&gt;Step 2: Extract the dataset&lt;/h2&gt;
&lt;p&gt;To extract the actual dataset values, use DNAnexus&amp;#39; &lt;code&gt;extract_dataset&lt;/code&gt; command with the &lt;code&gt;--fields&lt;/code&gt; flag set to the relevant field names:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;dx extract_dataset &amp;lt;project_id&amp;gt;:&amp;lt;dispensed_dataset_id&amp;gt; \
  --fields participant.eid,participant.p22420_i2,participant.p22420_i3 \
  --delimiter &amp;quot;,&amp;quot; \
  --output lvef_pheno.csv
&lt;/code&gt;&lt;/pre&gt;
&lt;h2&gt;Step 3: Translate the dataset values&lt;/h2&gt;
&lt;p&gt;The data you just extracted is often coded, meaning the raw values or numbers are not immediately interpretable.. Therefore, to make human analysis easier, it is sometimes helpful to translate these code to their definition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Example: Filtering cardiomyopathy diagnoses from ICD10 codes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Working on a project to understand the genetic architecture of cardiomyopathy, I needed to identify participants with cardiomyopathy diagnoses. The Showcase told me International Classification of Disease version 10 (ICD-10) diagnosis codes were Field ID 41270.&lt;/p&gt;
&lt;p&gt;I extracted the field names easily enough with &lt;code&gt;dx&lt;/code&gt; commands. However, the raw data returned value codes like &amp;quot;I42&amp;quot; or &amp;quot;I420&amp;quot;, meaningless without context.&lt;/p&gt;
&lt;p&gt;Fortunately, the UKB Showcase maintains various data coding tables for each of their coded data fields. Specifically ICD-10 diagnosis codes are given coding 19 in UKB. Similarly ICD9 codes are found in coding 87.&lt;/p&gt;
&lt;p&gt;You could also extract all the coding dictionaries once and have them ready as searchable dataframes. For example,&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-python&quot;&gt;import subprocess
import pandas as pd

# Extract dictionaries once with -ddd flag
cmd = [&amp;quot;dx&amp;quot;, &amp;quot;extract_dataset&amp;quot;, dataset, &amp;quot;-ddd&amp;quot;, &amp;quot;--delimiter&amp;quot;, &amp;quot;,&amp;quot;]
subprocess.check_call(cmd)

# Load the codings dictionary
codings_df = pd.read_csv(glob.glob(&amp;quot;*.codings.csv&amp;quot;)[0])

# Find which ICD10 codes mean &amp;quot;cardiomyopathy&amp;quot;
icd10_coding = codings_df[codings_df[&amp;#39;coding_name&amp;#39;] == &amp;quot;data_coding_19&amp;quot;]
cardiomyopathy_codes = icd10_coding[
    icd10_coding[&amp;#39;meaning&amp;#39;].str.contains(&amp;#39;cardiomyopathy&amp;#39;, case=False)
][[&amp;quot;meaning&amp;quot;, &amp;quot;code&amp;quot;]]
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Now you can filter your extracted data to only participants with those specific codes, without tab-switching back to the Showcase every time you need to decode something.&lt;/p&gt;
&lt;h2&gt;The Gotchas Nobody Tells You&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Authentication weirdness:&lt;/strong&gt; Sometimes &lt;code&gt;dx&lt;/code&gt; commands work fine in terminal but throw mysterious errors when called through &lt;code&gt;subprocess&lt;/code&gt; in Jupyter. I&amp;#39;ve never pinned down exactly why. It possible that the Jupyter notebook does not inherit the same environment variables that your interactive terminal session has. Either way, when you hit this, just run the command in terminal instead.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spark is not optional for big extractions:&lt;/strong&gt; If you&amp;#39;re pulling more than ~30 fields, you&amp;#39;ll need a Spark cluster. The &lt;a href=&quot;https://dnanexus.gitbook.io/uk-biobank-rap/working-on-the-research-analysis-platform/accessing-data/accessing-phenotypic-data&quot;&gt;UKB RAP documentation&lt;/a&gt; covers this, but fair warning: Spark uses lazy evaluation, which means errors can show up way downstream from where they actually originated. Fun times.&lt;/p&gt;
&lt;h2&gt;Wrapping Up&lt;/h2&gt;
&lt;p&gt;That&amp;#39;s the full pipeline: find your field names, extract the data, and decode any coded values. Once you&amp;#39;ve done it a couple of times, the whole process takes minutes.&lt;/p&gt;
&lt;p&gt;In the next post, we&amp;#39;ll look at how to do the same thing on the All of Us Researcher Workbench.&lt;/p&gt;
</content:encoded><category>biobank</category><category>sql</category><category>ukb-rap</category><category>ehr-data</category><category>phenotype-data</category></item><item><title>Biobank Intro Series: UK Biobank Observational Data (Part I)</title><link>https://boston-wib.org/blog/biobank-intro-series/03-ukb-observational-data-partI</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/03-ukb-observational-data-partI</guid><pubDate>Tue, 10 Mar 2026 05:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/ukbShowcaseGraphic.png&quot; alt=&quot;Papers tell you WHAT was used. Showcase tells you what&apos;s AVAILABLE NOW.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;An ode to the UK Biobank Showcase&lt;/em&gt;&lt;/p&gt;

&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;The showcase returns at least three different field IDs for BMI. It is difficult to find this information in any publication.&lt;/em&gt;
&lt;/figcaption&gt;

&lt;p&gt;Consider this post a love letter to the UK Biobank Showcase.&lt;/p&gt;
&lt;p&gt;When I first started working with UK Biobank, I fell back on what I knew from graduate school. I dug through methods sections and supplemental materials to track down the features used in the study. It worked, but it was slow. Part of the problem is that papers rarely cite &lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/&quot;&gt;The UK Biobank Showcase&lt;/a&gt; directly. It is so foundational to the field that experienced researchers treat it as assumed knowledge. Coming from a different domain, I had no idea it existed. Once my manager pointed me to the Showcase, I discovered measurements beyond what had been published and no longer had to spend hours on detective work.&lt;/p&gt;
&lt;p&gt;To understand why the showcase is so useful, it helps to know the scale of what UK Biobank actually contains. The UK Biobank as a resource spans clinical measurements, survey data, genomics, imaging, proteomics, metabolomics, and more. For a comprehensive overview of all available data types, see &lt;a href=&quot;https://community.ukbiobank.ac.uk/hc/en-gb/articles/23472796568861-What-types-of-data-are-available-in-UK-Biobank&quot;&gt;What types of data are available in UK Biobank?&lt;/a&gt; on the UK Biobank Community site. The Showcase is your tool for navigating all of it.&lt;/p&gt;
&lt;h2&gt;Reading Between the Data Fields: Arrays, Instances, and Codes&lt;/h2&gt;
&lt;p&gt;The Showcase doesn&amp;#39;t just catalog measurements. It lovingly documents the shape of the data itself. On the main page for each field, the &lt;strong&gt;Data&lt;/strong&gt; tab provides key details about coding, instances, and array indices that will save you real headaches downstream.&lt;/p&gt;
&lt;h3&gt;Coding&lt;/h3&gt;
&lt;p&gt;Coding is how categorical responses are stored. Rather than storing &amp;quot;Yes&amp;quot; or &amp;quot;No&amp;quot;, many fields store numeric codes: &lt;code&gt;1&lt;/code&gt; for &amp;quot;Yes&amp;quot;, &lt;code&gt;0&lt;/code&gt; for &amp;quot;No&amp;quot;, and &lt;code&gt;-3&lt;/code&gt; for &amp;quot;Prefer not to answer&amp;quot;. The Showcase provides a data coding table for each such field. More on working with complex codes in the next post.&lt;/p&gt;
&lt;h3&gt;Instances&lt;/h3&gt;
&lt;p&gt;Instances are timepoints. For example, if a measurement reports using instancing type 2, it will report measurements collected at four visits:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;instance &lt;code&gt;0&lt;/code&gt;: the initial assessment&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;instance &lt;code&gt;1&lt;/code&gt;: the first repeat visit&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;instance &lt;code&gt;2&lt;/code&gt;: the imaging visit&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;instance &lt;code&gt;3&lt;/code&gt;: the first repeat imaging visit&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;For most phenotypes, Instance &lt;code&gt;0&lt;/code&gt; has the largest sample size. If you need longitudinal data, expect much smaller numbers at later instances.&lt;/p&gt;
&lt;h3&gt;Arrays&lt;/h3&gt;
&lt;p&gt;Arrays are repeated measurements within a single visit. Diastolic and systolic blood pressure (&lt;code&gt;4079&lt;/code&gt;, &lt;code&gt;4080&lt;/code&gt;), for example, are taken twice in one sitting. Each repeat is stored as a separate array index (&lt;code&gt;0&lt;/code&gt;, &lt;code&gt;1&lt;/code&gt;). The Showcase tells you how many array values a field has so you can plan how to handle them.&lt;/p&gt;
&lt;p&gt;The Data tab gives you the architecture of a field: how its values are structured, repeated, and encoded. What it does not tell you is whether those values are reliable, comparable, or the best available option for your phenotype. For that, you need the category-level context, and the Showcase delivers it.&lt;/p&gt;
&lt;h2&gt;Example: Not All Field IDs Are Created Equal&lt;/h2&gt;
&lt;p&gt;For example, searching &amp;quot;Left Ventricular Ejection Fraction&amp;quot; returns &lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/search.cgi?wot=0&amp;srch=Left+Ventricular+Ejection+Fraction&amp;yfirst=2000&amp;ylast=2025&quot;&gt;multiple relevant fields&lt;/a&gt; and three with the exact correct description (22420, 24103, and 31060). But which one should you use? This is where the Showcase becomes essential.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/lvef_search.png&quot; alt=&quot;UK Biobank Showcase search interface showing search results for left ventricular ejection fraction, with three data fields highlighted: 22420 (Left ventricular size and function category), 24103 (Cardiac and aortic function #1 category), and 31060 (Cardiac and aortic function #2 category)&quot;&gt;&lt;/p&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   UK Biobank Showcase search results showing three LVEF measurement fields in different categories
&lt;/figcaption&gt;

&lt;p&gt;Notice the three highlighted fields measure the same thing but belong to different categories. Clicking into each field reveals why this matters:&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/lvef_fields_long.png&quot; alt=&quot;Three-panel comparison showing data field headers and category descriptions for fields 22420, 24103, and 31060. Field 22420 shows 39,645 participants with a quality warning. Field 24103 shows 80,974 participants with methodology references. Field 31060 shows 4,868 participants with publication citations. Each panel includes category warnings about data quality and compatibility.&quot;&gt;&lt;/p&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   Comparison of three LVEF fields showing participant counts and category quality warnings
&lt;/figcaption&gt;

&lt;p&gt;Field &lt;code&gt;22420&lt;/code&gt; (&lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/label.cgi?id=133&quot;&gt;category 133&lt;/a&gt;) has 39,649 measurements but includes a warning: &amp;quot;Quality issues may exist in this data. Researchers may wish to consider using data available in Category 157 or Category 162 as an alternative.&amp;quot; Field &lt;code&gt;24103&lt;/code&gt; (&lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/label.cgi?id=157&quot;&gt;category 157&lt;/a&gt;) contains 80,974 measurements and references a published methodology, but warns these fields &amp;quot;should not be considered together&amp;quot; with Category 162 without quality assessment. Field &lt;code&gt;31060&lt;/code&gt; (&lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/label.cgi?id=162&quot;&gt;category 162&lt;/a&gt;) has only 4,868 participants, fewer than the flagged field &lt;code&gt;22420&lt;/code&gt;.&lt;/p&gt;
&lt;p&gt;For my cardiomyopathy work (&lt;a href=&quot;https://www.cell.com/hgg-advances/fulltext/S2666-2477(25)00063-6&quot;&gt;Klasfeld &lt;em&gt;et al&lt;/em&gt; 2025&lt;/a&gt;), I chose field &lt;code&gt;24103&lt;/code&gt; for its sample size and data quality. However, other practical information provided by the showcase includes the date of which the data was reported (Debut) and the distribution of the data (shown in the data tab in the second section of the Field ID entry).&lt;/p&gt;
&lt;h2&gt;UK Biobank Showcase Tips&lt;/h2&gt;
&lt;p&gt;After working with the Showcase on multiple projects, I&amp;#39;ve developed a workflow that catches issues before they become problems. Here&amp;#39;s my list:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Before selecting a field:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Always check the category warnings, not just the field description&lt;/li&gt;
&lt;li&gt;Look at the data distribution tab: Is it normally distributed? Heavy missingness? Homogeneous values? Sampling bias?&lt;/li&gt;
&lt;li&gt;Check the total participants to plan your sample size accordingly&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;For reproducibility:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Note the debut date (when data became available)&lt;/li&gt;
&lt;li&gt;Record the version date (last import/update)&lt;/li&gt;
&lt;li&gt;Check stability rating (how data may change in future releases)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Watch out:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Like any great love, the Showcase is not perfect. Sometimes a data field has the status set to &amp;quot;Not available&amp;quot;, meaning it is listed before release. If the release date is listed and it is not set in the future, reach out to UK Biobank support for clarification.&lt;/li&gt;
&lt;li&gt;Sometimes data that appears in the Showcase is missing from your RAP workspace entirely. This can happen if your project is running an outdated version of the UK Biobank data release.&lt;ul&gt;
&lt;li&gt;If you are the project admin, go to the &lt;code&gt;Settings&lt;/code&gt; page of your dispensed project and click &lt;code&gt;Check for Updates&lt;/code&gt; in the UK Biobank section.&lt;/li&gt;
&lt;li&gt;If you are not the admin or the update does not resolve it, reach out to the UK Biobank support team directly. Tell them upfront if you have already searched the community forums. They are genuinely helpful and worth contacting.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The more time I spend with the Showcase, the more I appreciate what it actually is: not just a catalog, but a guide to making good decisions about your data. For features not covered here, Part III of the &lt;a href=&quot;https://biobank.ndph.ox.ac.uk/showcase/ukb/exinfo/ShowcaseUserGuide.pdf&quot;&gt;Showcase user guide&lt;/a&gt; (page 4) is worth bookmarking. In the next post, we&amp;#39;ll dive into actually loading this data for analysis.&lt;/p&gt;
</content:encoded><category>biobank</category><category>uk-biobank</category><category>ukb-showcase</category><category>phenotype-data</category><category>field-selection</category><category>data-quality</category><category>reproducibility</category></item><item><title>Biobank Intro Series: Hardware Settings</title><link>https://boston-wib.org/blog/biobank-intro-series/02-hardware-settings</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/02-hardware-settings</guid><pubDate>Tue, 03 Mar 2026 05:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/commandlineinterfaceBridge.png&quot; alt=&quot;Two islands labeled &quot;Your Workspace&quot; and &quot;Data Storage&quot; are connected by a tiny, rickety wooden footbridge. Crossing the bridge is a terminal carrying a folder.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Hardware setup lessons for UK Biobank Research Analysis Platform and All of Us Workbench&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In 2026 the most expensive resource isn&amp;#39;t compute time or storage. It&amp;#39;s your time rerunning failed analyses. After months of working across the UK Biobank RAP and All of Us Researcher Workbench, I&amp;#39;ve collected some hard-won lessons about resource management.&lt;/p&gt;
&lt;p&gt;This post is more technical. Consider yourself warned.&lt;/p&gt;
&lt;h2&gt;Tip #1: Each Platform Has Its Own Command-Line Interface (CLI)&lt;/h2&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/commandlineinterfaceBridge.png&quot; alt=&apos;Two islands labeled &quot;Your Workspace&quot; and &quot;Data Storage&quot; are connected by a tiny, rickety wooden footbridge. Crossing the bridge is a terminal carrying a folder.&apos; class=&quot;!max-w-none mx-auto w-full&quot;&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;The CLI: a small bridge between two very different worlds.  Image generated by Gemini AI.&lt;/em&gt;
&lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Think of your workspace and the platform&amp;#39;s data storage as two separate floating islands. Your code lives on one island. The massive biobank files live on the other. The CLI is the bridge between them, and each platform has its own.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;UK Biobank RAP: The dx toolkit&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The dx CLI is your friend for navigating the RAP filesystem:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# List files in a directory within the platform&amp;#39;s data storage
dx ls

# Stream file contents (don&amp;#39;t download!) from data storage
dx cat file-xxxx | bcftools view

# Upload local files from your workspace to data storage
dx upload local_file.txt --path /file/path/in/workspace/

# Download files (only if absolutely necessary) to your workspace
dx download file-xxxx --output local_file.txt
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;&lt;strong&gt;All of Us: gsutil for Google Cloud Storage&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All of Us data lives in Google Cloud Storage (GCS) buckets and uses &lt;code&gt;gsutil&lt;/code&gt; to identify, stream, and move data between your workspace and these buckets.&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# List files in the controlled data-repository (CDR) bucket
gsutil ls gs://fc-aou-datasets-controlled/

# Find VCF files in the CDR bucket
gsutil ls gs://fc-aou-datasets-controlled/v7/wgs/short_read/snpindel/

# Stream directly (the right way)
gsutil cat gs://path/to/file.vcf.gz | bcftools view
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;&lt;strong&gt;Important:&lt;/strong&gt; Add a &lt;code&gt;-u&lt;/code&gt; flag to &lt;code&gt;gsutil&lt;/code&gt; commands to attribute the operation to your project for proper billing and access control:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;gsutil -u $GOOGLE_PROJECT [command]
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;Use the environment variables &lt;code&gt;$GOOGLE_PROJECT&lt;/code&gt; and &lt;code&gt;$WORKSPACE_BUCKET&lt;/code&gt; to avoid hardcoding paths:&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# Upload local file from your workspace to your storage bucket
gsutil -u $GOOGLE_PROJECT cp local_file.txt $WORKSPACE_BUCKET/

# Download files (only if absolutely necessary) to your workspace
gsutil -u $GOOGLE_PROJECT cp gs://path/to/file.txt local_file.txt
&lt;/code&gt;&lt;/pre&gt;
&lt;h2&gt;Tip #2: Don&amp;#39;t Bring the Cloud Home With You&lt;/h2&gt;
&lt;figure class=&quot;my-8 !max-w-none&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/stream_not_copy.png&quot; alt=&quot;Comparison of downloading vs streaming biobank data: wrong way (slow download) versus right way (fast streaming).&quot; class=&quot;!max-w-none mx-auto w-full&quot;&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
   &lt;em&gt;Don&apos;t download massive biobank files — stream and filter directly on the platform.&lt;/em&gt;
&lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Now that you know how to upload and download files, I must restate that &lt;strong&gt;you should not use those download commands on biobank data files&lt;/strong&gt;. Yes, &lt;code&gt;dx download&lt;/code&gt; and &lt;code&gt;gsutil cp&lt;/code&gt; exist, but the data is already where it needs to be. Your job is to meet it there, not drag it to you.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Don&amp;#39;t do this:&lt;/strong&gt;&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# UK Biobank: Copying a 500GB VCF locally
dx download file-xxxx

# All of Us: Same mistake, different platform
gsutil cp gs://path/to/huge.vcf.gz .
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;&lt;strong&gt;Do this instead:&lt;/strong&gt;&lt;/p&gt;
&lt;pre&gt;&lt;code class=&quot;language-bash&quot;&gt;# UK Biobank: Stream with dx
dx cat file-xxxx | bcftools view | your_analysis

# All of Us: Stream with gsutil
gsutil cat gs://path/to/file.vcf.gz | bcftools view | your_analysis
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;The data is already where it needs to be. It sits in the cloud, on fast storage, ready to be streamed. Copying wastes time, burns through storage quotas, and risks running out of disk space mid-analysis. Both platforms are designed for streaming access. Use it.&lt;/p&gt;
&lt;h2&gt;Tip #3: Know Your Tools and Your Files&lt;/h2&gt;
&lt;p&gt;Hail is prominently featured in All of Us documentation, which makes it tempting to reach for first. Resist that instinct and match your tool to your actual problem, not the first one you find or the most impressive-sounding one.&lt;/p&gt;
&lt;p&gt;Why avoid Hail when possible:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Requires expensive Spark clusters&lt;/li&gt;
&lt;li&gt;Memory-intensive operations that crash your instance&lt;/li&gt;
&lt;li&gt;Adds complexity when simpler tools work fine&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you can use standard tools (pandas, bcftools, plink), do that instead. Save Hail for genuinely distributed computing tasks.&lt;/p&gt;
&lt;p&gt;Regardless of which tool you choose, make sure your files are indexed (.tbi, .csi) before querying them. Without an index, tools like bcftools have to read the entire file from start to finish — region queries are no faster than loading everything.&lt;/p&gt;
&lt;h2&gt;Tip #4: Don&amp;#39;t Let Long Jobs Catch You Off Guard&lt;/h2&gt;
&lt;p&gt;Picture this: You start a 2-hour variant annotation job, grab lunch, and return to... nothing. Just a terminated instance.&lt;/p&gt;
&lt;p&gt;Three habits will save you from rerunning everything from scratch.&lt;/p&gt;
&lt;p&gt;First, check your idle timeout limit before running any long job. By default, All of Us shuts down after 15 minutes of inactivity.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All of Us: Go to workspace settings → increase idle timeout to 8 hours (or your preferred duration)&lt;/li&gt;
&lt;li&gt;UK Biobank RAP: Check instance auto-pause settings&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Note: &lt;code&gt;nohup&lt;/code&gt; or &lt;code&gt;screen&lt;/code&gt;/&lt;code&gt;tmux&lt;/code&gt; can keep jobs running but won&amp;#39;t survive an instance shutdown so adjusting your timeout is still necessary.&lt;/p&gt;
&lt;p&gt;Second, filter as early in your pipeline as possible. The less data you&amp;#39;re carrying through each step,
the faster and cheaper each step is.&lt;/p&gt;
&lt;p&gt;Third, checkpoint intermediate results. Save outputs at meaningful stages so a crash at step 5 doesn&amp;#39;t send you back to step 1.&lt;/p&gt;
&lt;p&gt;None of these take more than a minute to set up. The rerun will.&lt;/p&gt;
&lt;h2&gt;The Bottom Line&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;When things crash:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;You ran out of memory&lt;/li&gt;
&lt;li&gt;Filter earlier in your pipeline&lt;/li&gt;
&lt;li&gt;Checkpoint intermediate results&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;When things are slow:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Check if files are indexed (.tbi, .csi)&lt;/li&gt;
&lt;li&gt;Use region queries instead of full chromosomes&lt;/li&gt;
&lt;li&gt;Stream instead of copying&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Set yourself up for success: learn your CLIs, stream your data, pick the right tool for the job, and make sure your long jobs have safety nets.&lt;/p&gt;
</content:encoded><category>biobank</category><category>uk-biobank</category><category>all-of-us</category><category>cloud-computing</category><category>bioinformatics</category><category>genetics</category><category>best-practices</category><category>resource-management</category><category>cli-tools</category></item><item><title>Biobank Intro Series: Getting Started</title><link>https://boston-wib.org/blog/biobank-intro-series/01-biobank-iceberg</link><guid isPermaLink="true">https://boston-wib.org/blog/biobank-intro-series/01-biobank-iceberg</guid><pubDate>Tue, 24 Feb 2026 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/biobank_iceberg.png&quot; alt=&quot;Iceberg diagram illustrating hidden complexities in biobank data: visible raw data above the waterline, and three categories of hidden challenges below: batch effects, population stratification, and inaccurate phenotype data.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;How to do biobank analysis without losing your mind&lt;/em&gt;&lt;/p&gt;

&lt;figure&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/biobank_iceberg.png&quot; alt=&quot;Iceberg diagram illustrating hidden complexities in biobank data: visible raw data above the waterline, and three categories of hidden challenges below: batch effects, population stratification, and inaccurate phenotype data.&quot; class=&quot;block dark:hidden&quot;&gt;
&lt;img src=&quot;https://boston-wib.org//blog_images/biobank1/biobank_iceberg_dark.png&quot; alt=&quot;Iceberg diagram illustrating hidden complexities in biobank data: visible raw data above the waterline, and three categories of hidden challenges below: batch effects, population stratification, and inaccurate phenotype data.&quot; class=&quot;hidden dark:block&quot;&gt;
&lt;figcaption class=&quot;text-center text-sm opacity-80 mt-2&quot;&gt;
    &lt;em&gt;The Biobank Iceberg. Everyone knows there&apos;s more beneath the surface. You can&apos;t ignore the complexities, but you don&apos;t need to solve them all.&lt;/em&gt;
  &lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;If I learned anything during my first few months as a bioinformatics contractor, it&amp;#39;s that you need to know your question before you worry about the hidden complexities. Like icebergs, biobanks have immense power—and like icebergs, their foundation is made up of massive complexities beneath the surface that can sink your analysis if you&amp;#39;re not prepared. (The Titanic had better odds.)&lt;/p&gt;
&lt;p&gt;The first step to any biobank analysis is strategic rather than technical.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;What specific research question are you trying to answer?&lt;/li&gt;
&lt;li&gt;Who is your audience?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;That&amp;#39;s it. Two questions. Answer these before you touch a single line of code.&lt;/p&gt;
&lt;p&gt;Consider my own work as an example:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;During my postdoc:&lt;/strong&gt; I was investigating the genetic architecture of cardiomyopathy for publication. My goal was to understand how genetic variants contribute to disease risk. A large challenge, working within the context of rare disease, is overcoming limited statistical power. Therefore, to strengthen the association signal, I restricted analyses to individuals of
White British ancestry (UK Biobank data field 22006, coding 1). The healthy volunteer recruitment biases and population structure weren&amp;#39;t just footnotes in the discussion section—they fundamentally shaped what I could and couldn&amp;#39;t claim about cardiomyopathy genetics. I needed to understand the people making up the dataset as deeply as I understood the variants themselves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;As a contractor:&lt;/strong&gt; I provide bioinformatics support for hypothesis-driven study design that could lead to drug development. The client wants to find out if a genetic signal associates with a phenotype strongly enough to justify further investigation. In other words, I&amp;#39;m running pilot analyses. I need clean cohort definitions, appropriate statistical methods, and enough power to detect effects. The recruitment biases? Important to document, but they don&amp;#39;t invalidate finding a biological association in this specific dataset. My job is to answer the client&amp;#39;s question efficiently and move the project forward.&lt;/p&gt;
&lt;p&gt;In my contractor role, I&amp;#39;ve learned that curiosity must be disciplined. My job is to hit project checkpoints and answer the client&amp;#39;s specific question, not to chase down every interesting technical rabbit hole. The data complexity matters, but staying on track isn&amp;#39;t just good project management; it&amp;#39;s professional integrity.&lt;/p&gt;
&lt;p&gt;Only after you&amp;#39;ve defined your question and audience can you identify which data complexities actually matter. Every dataset comes with biases and baggage, and I fight two competing urges:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The paranoid perfectionist&lt;/strong&gt;: Document every limitation, explore every confounding variable, and fall into a six-month rabbit hole about batch effects&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;The YOLO analyst:&lt;/strong&gt; Ignore all complexity, run the analysis, get wildly misleading results, and have to redo everything anyway.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Neither instinct serves the project. The discipline I&amp;#39;ve had to learn is this: acknowledge the complexity but focus only on what affects your specific research question.&lt;/p&gt;
&lt;p&gt;The good news is that this complexity isn&amp;#39;t insurmountable. It just requires understanding the data first. I&amp;#39;ve already fallen into some of these rabbit holes, so hopefully you won&amp;#39;t have to. In a series of posts, I hope to demystify UK Biobank and All of Us so you can navigate them confidently. But let me be clear, I cannot cover every nuance of these massive resources. Your specific research question may surface complexities I haven&amp;#39;t encountered. What I can give you are foundational issues that matter across most use cases, the patterns to watch for, and questions to ask.&lt;/p&gt;
</content:encoded><category>biobank</category><category>research-strategy</category><category>project-management</category><category>consulting</category><category>bioinformatics</category><category>data-science</category><category>computational-biology</category><category>biotech</category><category>genetics</category><category>best-practices</category></item><item><title>A Coffee with CompBio: Hacking your way into computational biology</title><link>https://boston-wib.org/blog/coffeewithcompbio/s2-e2</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s2-e2</guid><pubDate>Tue, 24 Feb 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio-logo2.png&quot; alt=&quot;Coffee with CompBio Podcast Logo: Four painted women under the podcast title&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Hackathons can be a great way to understand the trends in your field, meet new people, and network.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  allow=&quot;autoplay *; encrypted-media *; fullscreen *; clipboard-write&quot;
  frameborder=&quot;0&quot;
  height=&quot;175&quot;
  style=&quot;width:100%;max-width:660px;overflow:hidden;border-radius:10px;&quot;
  sandbox=&quot;allow-forms allow-popups allow-same-origin allow-scripts allow-storage-access-by-user-activation allow-top-navigation-by-user-activation&quot;
  src=&quot;https://embed.podcasts.apple.com/us/podcast/hacking-your-way-into-computational-biology/id1817024741?i=1000751241489&quot;
&gt;&lt;/iframe&gt;

&lt;p&gt;Hackathons are not just for coders anymore — computational biologists have made it their own with data, models, and insights! Hackathons can be a great way to understand the trends in your field, meet new people, and network. Do you want to try and participate in a hackathon this year and feel like a true hacker? The wait is over — in this episode we give you all the tea about hackathons, over a cup of coffee! Tune into our latest episode of &amp;quot;A Coffee with CompBio&amp;quot; where Sharvari Narendra and Saba Nafees talk about hackathons and more!&lt;/p&gt;
&lt;p&gt;If you think you know some more hackathon-related resources, let us know in the comments and we will give you a shoutout in the next episode!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Links from the Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;http://podcast.boston-wib.org/&quot;&gt;Season 1 Archive&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://biohackathons.github.io/&quot;&gt;BioHackathons&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://nf-co.re/events/hackathon&quot;&gt;nf-core Hackathon Events&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&amp;lt;strong style={{ color: &amp;#39;red&amp;#39; }}&amp;gt;UPCOMING EVENT -&amp;gt;&lt;/strong&gt; &lt;a href=&quot;https://boston-wib.org/events/20260311_nextflow&quot;&gt;nf-core Hackathon March
2026&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://college.harvard.edu/student-life/student-stories/how-i-organized-hackathon-harvard&quot;&gt;How I Organized a Hackathon at Harvard&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.bio-itworldexpo.com/fair-data-hackathon&quot;&gt;BIO-IT World FAIR Data Hackathon&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.openhackathons.org/s/&quot;&gt;OpenHackathons&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.mlh.com/seasons/2026/events&quot;&gt;MLH 2026 Events&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://www.codeday.org/&quot;&gt;CodeDay&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://devpost.com/&quot;&gt;Devpost&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;https://ncbi-codeathons.github.io/&quot;&gt;NCBI Codeathons&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Listen On&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/hacking-your-way-into-computational-biology/id1817024741?i=1000751241489&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/5r965ssFfE4O91w1xxxX4x&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaBell size={32} className=&quot;text-yellow-600&quot; /&gt; &lt;a href=&quot;https://podcast.boston-wib.org/feed.xml&quot;&gt;RSS Feed&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;More on this Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastry&lt;/a&gt; for editing and management support and &lt;a href=&quot;https://www.linkedin.com/in/dinaissakova/&quot;&gt;Dina Issakova&lt;/a&gt; for the cover art and social media support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Support the Podcast&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/sharvarinarendra/&quot;&gt;Sharvari&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/saba-nafees/&quot;&gt;Saba&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>computational-biology</category><category>professional-development</category><category>bioinformatics</category><category>hackathons</category><category>nfcore</category><category>nextflow</category><category>networking</category></item><item><title>ctDNA as an oncology endpoint</title><link>https://boston-wib.org/blog/quicktake/ctdna-as-an-oncology-endpoint</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/ctdna-as-an-oncology-endpoint</guid><pubDate>Mon, 23 Feb 2026 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2026-02-10-ctdna-as-a-clinical-endpoint.jpeg&quot; alt=&quot;A research figure compares different brain regions to see where information about what someone is consciously seeing can be decoded most accurately, showing stronger decoding in posterior regions than in prefrontal areas&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Takeaways from the FoCR symposium&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I recently attended the virtual &lt;strong&gt;&lt;a href=&quot;https://www.linkedin.com/company/friends-of-cancer-research/&quot;&gt;Friends of Cancer Research&lt;/a&gt; (FoCR)&lt;/strong&gt; symposium on &lt;strong&gt;Modernizing Oncology Endpoints&lt;/strong&gt;, focused on &lt;strong&gt;circulating tumor DNA (ctDNA)&lt;/strong&gt; and &lt;strong&gt;AI-enabled imaging&lt;/strong&gt;. The discussion brought together FoCR, industry, FDA, and academia, and it was energizing to see the progress so far, plus a clear path forward.&lt;/p&gt;
&lt;p&gt;A &lt;strong&gt;biomarker&lt;/strong&gt; is a measurable biological signal that helps indicate what’s happening in the body—such as the presence of disease, how aggressive it is, or whether a treatment is working. In oncology, &lt;strong&gt;ctDNA (circulating tumor DNA)&lt;/strong&gt; is especially compelling because it can be measured from a simple blood draw (non-invasive) and repeated over time, enabling &lt;strong&gt;longitudinal monitoring&lt;/strong&gt; of tumor burden and molecular response without relying solely on imaging or repeated tissue biopsies. That potential is exactly why ctDNA keeps coming up in conversations about &lt;strong&gt;endpoints&lt;/strong&gt;: it can capture molecular change earlier, more frequently, and often more directly than traditional approaches—if we standardize how we collect, interpret, and validate it.&lt;/p&gt;
&lt;h2&gt;Key takeaways&lt;/h2&gt;
&lt;h4&gt;The ctMoniTR Project showed ctDNA’s clinical signal at scale and what’s needed next.&lt;/h4&gt;
&lt;p&gt;With 20+ sponsors pooling heterogeneous trial datasets, ctMoniTR found consistent patient-level links between &lt;strong&gt;ctDNA reduction/clearance&lt;/strong&gt; and &lt;strong&gt;patient outcomes&lt;/strong&gt;. The next challenge is turning these signals into trial-ready and regulator-ready endpoints, especially in &lt;strong&gt;early-stage MRD&lt;/strong&gt; where ctDNA levels are lowest.&lt;/p&gt;
&lt;h4&gt;Tissue availability is becoming a limiting factor.&lt;/h4&gt;
&lt;p&gt;&lt;strong&gt;Tumor-informed MRD assays&lt;/strong&gt; are often preferred, but tissue is not always accessible or usable. A practical approach is to design &lt;strong&gt;dual-assay strategy&lt;/strong&gt; up front: tumor-informed when feasible, with a tumor-naive backup, plus clear decision rules and an analysis plan that handles both.&lt;/p&gt;
&lt;h4&gt;Scaling requires prospective standardization.&lt;/h4&gt;
&lt;p&gt;Define clinical landmark timepoints and sampling schedules in advance, not retroactively. &lt;strong&gt;Real-world data&lt;/strong&gt; can help fill natural history gaps (lead time, kinetics), but only with careful tracking of assay versions and the treatment time period.&lt;/p&gt;
&lt;h2&gt;Looking Ahead&lt;/h2&gt;
&lt;p&gt;As ctDNA endpoints evolve, they can enable &lt;strong&gt;earlier interventions&lt;/strong&gt; by detecting molecular change sooner and supporting faster decisions in both &lt;strong&gt;clinical trials&lt;/strong&gt; and &lt;strong&gt;patient care&lt;/strong&gt;. Teams that prioritize &lt;strong&gt;harmonization up&lt;/strong&gt; front will be best positioned to move quickly and confidently.&lt;/p&gt;
</content:encoded><category>data-science</category><category>computational-biology</category><category>dataStrategy</category><category>clinicalTrials</category><category>biomarkers</category><category>real-world-data</category><category>real-world-evidence</category><category>ctDNA</category><category>oncology</category><category>precision-medicine</category><category>drug-development</category></item><item><title>Beyond behavior: How machine learning decodes consciousness and forecasts seizures from brain activity</title><link>https://boston-wib.org/blog/deepdive/seminar-reflections-beyond-behavior</link><guid isPermaLink="true">https://boston-wib.org/blog/deepdive/seminar-reflections-beyond-behavior</guid><pubDate>Wed, 11 Feb 2026 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2026-02-02_Talk_UMass_Figures_Dana_CoverFigure.png&quot; alt=&quot;A research figure compares different brain regions to see where information about what someone is consciously seeing can be decoded most accurately, showing stronger decoding in posterior regions than in prefrontal areas&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Reflections on a seminar exploring how AI and brain network analysis can transform patient care&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Image credit: Source: &lt;a href=&quot;https://www.nature.com/articles/s41586-025-08888-1&quot;&gt;Nature (2025)&lt;/a&gt;. Adversarial testing of global neuronal workspace and integrated information theories of consciousness.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;There is something genuinely exciting about starting a new semester by being reminded how much we still do not know, and how many people are actively working to figure it out.&lt;/p&gt;
&lt;p&gt;At the first Biomedical Engineering Seminar of the spring semester at UMass Amherst&amp;#39;s &lt;a href=&quot;https://www.umass.edu/engineering/&quot;&gt;Riccio College of Engineering&lt;/a&gt;, &lt;a href=&quot;https://www.linkedin.com/in/aya-khalaf-528b5a317/&quot;&gt;Aya Khalaf, PhD&lt;/a&gt;, from the &lt;a href=&quot;https://medicine.yale.edu/&quot;&gt;Yale School of Medicine&lt;/a&gt; presented &lt;em&gt;Decoding Brain Networks for Improved Patient Quality of Life&lt;/em&gt;. From the beginning, it was clear this seminar would be one of those talks that stays with me long after I left the room. As someone with a background in both wet lab neuroscience research and computational modeling, seminars like this reinforce how essential computational tools and AI have become for transforming complex neural data into insights that can meaningfully impact patient care.&lt;/p&gt;
&lt;h2&gt;Studying Consciousness Beyond Behavior&lt;/h2&gt;
&lt;p&gt;At its core, the seminar focused on consciousness. What it is, how we detect it, and how it can be disrupted in neurological disease. Consciousness feels intuitive when we experience it ourselves, yet scientifically it remains one of the biggest open questions in neuroscience, with major implications for medicine, ethics, and patient care. This is a topic I am personally very interested in, especially because in the past I learned about coma states and how patients can still retain forms of consciousness even when they appear behaviorally unresponsive.&lt;/p&gt;
&lt;p&gt;Dr. Khalaf’s work combines computational modeling, AI, electrophysiology, and neuroimaging to study how conscious perception emerges from coordinated activity across brain networks. One key idea explored was decoding the content of consciousness, not just determining whether someone is conscious at all. Rather than asking a simple yes or no question about awareness, this approach focuses on identifying what a person is actually perceiving. For example, neural activity patterns in specific brain regions can be used to distinguish whether someone is seeing a face versus an object, offering a more nuanced view of conscious experience.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;https://boston-wib.org//blog_images/2026-02-02_Talk_UMass_Figures_Dana_Slide5.png&quot; alt=&quot;A slide from the presentation highlights prefrontal and posterior regions of the brain as possible locations where researchers can measure information about the content of consciousness.&quot;&gt;
&lt;em&gt;Decoding conscious content using distributed brain networks across prefrontal and posterior regions (&lt;a href=&quot;https://www.sciencedirect.com/science/article/pii/S1053811925002277&quot;&gt;Khalaf et al. 2025&lt;/a&gt;).&lt;/em&gt;&lt;/p&gt;
&lt;h2&gt;Seeing Consciousness Hidden in Brain Signals&lt;/h2&gt;
&lt;p&gt;One moment that really stuck with me was when Dr. Khalaf discussed patients diagnosed with unresponsive wakefulness syndrome. When asked to imagine movement, patients appear behaviorally unresponsive, but neuroimaging can reveal patterns of brain activity similar to those seen in minimally conscious individuals . This highlights how behavioral tests alone may not fully capture internal awareness.&lt;/p&gt;
&lt;p&gt;From a technical perspective, the work isolates neural signals associated with conscious perception, meaning when a stimulus is actively experienced and reportable by the individual (for example, intentionally imagining a movement). This is distinct from general stimulus processing, which refers to automatic neural responses that can occur even when a stimulus is not consciously perceived (for example, blinking without thinking about it). To study this difference, researchers examined stimulus detection networks using intracranial EEG (electroencephalogram) recordings with thousands of implanted electrodes. In this context, target stimuli were those that participants were instructed to attend to or respond to, while non-target stimuli were presented but did not require conscious attention or action. Because both types of stimuli activate the brain at some level, careful signal isolation is needed to identify neural activity that truly reflects conscious awareness rather than passive sensory processing.&lt;/p&gt;
&lt;p&gt;The functional MRI (fMRI) analyses stood out to me because they showed how brain imaging can be used to study not just where activity occurs, but how conscious experience is coordinated across networks. By tracking changes in blood oxygen level dependent (BOLD) signals, researchers found that as participants shifted from situations where they rested and focused on a static point (fixation blocks) to circumstances where they actively processed sensory information (task blocks), they could observe how brain activity changed with conscious engagement. Using mathematical pattern recognition methods to identify groups of brain regions that consistently activated together, the study highlighted the midbrain and central thalamus as key hubs that amplify and coordinate signals across vision, hearing, and touch. While these regions have long been linked to consciousness, identifying them through data driven network analysis helped validate the approach and reinforced the idea that consciousness emerges from distributed brain systems rather than isolated regions.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;https://boston-wib.org//blog_images/2026-02-02_Talk_UMass_Figures_Dana_Slide8.png&quot; alt=&quot;Brain scan images highlight the midbrain and central thalamus, showing that these regions are active across vision, hearing, taste, and touch when conscious perception occurs.&quot;&gt;
&lt;em&gt;Midbrain and central thalamus involvement in conscious perception across sensory modalities (&lt;a href=&quot;https://www.sciencedirect.com/science/article/pii/S1053811925002277&quot;&gt;Khalaf et al. 2025&lt;/a&gt;).&lt;/em&gt;&lt;/p&gt;
&lt;h2&gt;Using Machine Learning to Predict Loss of Consciousness Before a Seizure&lt;/h2&gt;
&lt;p&gt;The part of the talk that stood out to me the most focused on epilepsy and the use of machine learning to understand how seizures affect consciousness.
Dr. Khalaf discussed absence seizures, a type of seizure in which patients may briefly lose awareness or responsiveness, often without obvious physical convulsions. Because these seizures can be subtle and difficult to detect behaviorally, understanding their neural signatures is especially important.
Using EEG data, Dr. Khalaf presented machine learning models trained to predict whether an absence seizure would impair consciousness by comparing brain activity before a seizure begins and during the seizure itself. The models revealed clear differences in neural activity patterns between these two brain states, showing that measurable changes in network behavior emerge even before consciousness is disrupted. One key insight was that healthier brain networks tend to produce more complex neural signals, while impaired consciousness is associated with reduced signal complexity.&lt;/p&gt;
&lt;p&gt;Technically, this approach involved extracting informative features from EEG signals that highlight differences between the brain states before and during a seizure. By applying Common Spatial Pattern analysis, the models emphasized spatial patterns of neural activity that best distinguished seizures that impaired consciousness from those that did not. Since consciousness during seizures is typically assessed through behavior alone, identifying reliable neural markers could eventually help clinicians anticipate loss of awareness and support earlier or more targeted interventions for patients.
I am particularly interested in the machine learning side of this work because AI has the potential to support clinical decision making while also improving cost effectiveness. If we can rely on robust ML models to predict impaired consciousness from EEG data, it could reduce the need for extensive behavioral testing and long monitoring sessions, making epilepsy evaluation more accessible and scalable in real clinical settings.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;https://boston-wib.org//blog_images/2026-02-02_Talk_UMass_Figures_Dana_Slide11.png&quot; alt=&quot;A slide shows how brain wave data is mathematically separated into patterns that help distinguish between seizures that impair consciousness and those that do not.&quot;&gt;
&lt;em&gt;Extracting EEG features using Common Spatial Pattern analysis to distinguish impaired and spared consciousness during seizures (&lt;a href=&quot;https://onlinelibrary.wiley.com/doi/full/10.1002/acn3.51647&quot;&gt;Springer, Khalaf, et al. 2022&lt;/a&gt;).&lt;/em&gt;&lt;/p&gt;
&lt;h2&gt;Looking Ahead&lt;/h2&gt;
&lt;p&gt;This semester, I want to be intentional about attending seminars and writing about what I learn. Research evolves quickly, and staying curious and engaged with work happening now feels like an important part of being a student in science and engineering.&lt;/p&gt;
&lt;p&gt;Today was a strong start.&lt;/p&gt;
</content:encoded><category>biomedical-engineering</category><category>neuroscience</category><category>ai-in-healthcare</category><category>machine-learning</category><category>brain-networks</category><category>epilepsy-research</category><category>clinical-neuroscience</category><category>science-communication</category></item><item><title>A Coffee with CompBio: New Year Resolutions</title><link>https://boston-wib.org/blog/coffeewithcompbio/s2-e1</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s2-e1</guid><pubDate>Wed, 28 Jan 2026 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio-logo2.png&quot; alt=&quot;Coffee with CompBio Podcast Logo: Four painted women under the podcast title&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;12 New Year Resolutions For Computational Biologists&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&amp;lt;audio
  controls
  preload=&amp;quot;metadata&amp;quot;
  style=&amp;quot;display: block; margin: 0 auto; width: 80%; height: 50px;&amp;quot;
  class=&amp;quot;block mx-auto w-4/5 max-w-3xl h-14 border-2 border-gray-300 dark:border-slate-700 rounded-xl bg-gray-50 dark:bg-transparent&amp;quot;&lt;/p&gt;
&lt;blockquote&gt;
&lt;/blockquote&gt;
&lt;p&gt;  &amp;lt;source
    src=&amp;quot;&lt;a href=&quot;https://ia800703.us.archive.org/3/items/acoffeewithcompbio/episode_01_202601_12-New-Year-Resolutions-For-Computational-Biologists.m4a&quot;&gt;https://ia800703.us.archive.org/3/items/acoffeewithcompbio/episode_01_202601_12-New-Year-Resolutions-For-Computational-Biologists.m4a&lt;/a&gt;&amp;quot;
    type=&amp;quot;audio/mp4&amp;quot;&lt;/p&gt;
&lt;blockquote&gt;
&lt;/blockquote&gt;
&lt;pre&gt;&lt;code&gt;Your browser does not support the audio element.
&lt;/code&gt;&lt;/pre&gt;
  &lt;/source&gt;
&lt;/audio&gt;

&lt;p&gt;Grab your coffee and join us for another episode of &lt;strong&gt;A Coffee with Comp Bio&lt;/strong&gt;!&lt;/p&gt;
&lt;p&gt;New year, new episode, new comp-bio goals and new hosts. But first, we wish you a very Happy New Year! Most new year&amp;#39;s resolutions fail due to lack of clarity, so to make it easier, we begin our first episode of the season with a list that hopefully inspires you. From using AI tools to make your life easier to documenting your own code better, we are bringing resolutions every computational biologist needs this new year. Tune into the latest episode of “A Coffee with CompBio” where Sharvari Narendra and Saba Nafees present 12 awesome resolutions for the new year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Listen On&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/12-new-year-resolutions-for-computational-biologists/id1817024741?i=1000747076113&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/0igVqZBmyartwyAp5469tk&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaBell size={32} className=&quot;text-yellow-600&quot; /&gt; &lt;a href=&quot;https://podcast.boston-wib.org/feed.xml&quot;&gt;RSS Feed&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;More on this Episode&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastry&lt;/a&gt; for editing and management support and &lt;a href=&quot;https://www.linkedin.com/in/dinaissakova/&quot;&gt;Dina Issakova&lt;/a&gt; for the cover art and social media support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Support the Podcast&lt;/strong&gt;:&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/sharvarinarendra/&quot;&gt;Sharvari&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/saba-nafees/&quot;&gt;Saba&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>computational-biology</category><category>self-directed-learning</category><category>bioinformatics</category><category>ai-in-bioinformatics</category></item><item><title>Tuesday Tactics: Build vs Buy Red Flag</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-build-vs-buy-red-flag</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-build-vs-buy-red-flag</guid><pubDate>Tue, 06 Jan 2026 13:00:00 GMT</pubDate><content:encoded/><category>software-engineering</category><category>build-vs-buy</category><category>tech-strategy</category><category>biotech</category><category>tuesday-tactics</category></item><item><title>Tuesday Tactics: The Excel Export Feature Request</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-excel-export-request</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-excel-export-request</guid><pubDate>Tue, 30 Dec 2025 13:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>product-design</category><category>user-experience</category><category>bioinformatics</category><category>tuesday-tactics</category></item><item><title>Tuesday Tactics: Data Steward ≠ Data Janitor</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-data-steward-not-janitor</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-data-steward-not-janitor</guid><pubDate>Tue, 23 Dec 2025 13:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>data-stewardship</category><category>bioinformatics</category><category>talent-management</category><category>tuesday-tactics</category></item><item><title>The Tool Consolidation Paradox</title><link>https://boston-wib.org/blog/quicktake/tool-consolidation-paradox</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tool-consolidation-paradox</guid><pubDate>Fri, 19 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-19-tool-consolidation-paradox.png&quot; alt=&quot;Consolidation paradox: One platform promises less complexity but creates new problems. More tools does not always mean more problems. Optimize for workflows, not org charts.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why Fewer Tools Often Means More Complexity&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-science</category><category>biotech</category><category>data-engineering</category><category>tech-leadership</category><category>data-strategy</category></item><item><title>The Twelve Days of Scale-Up</title><link>https://boston-wib.org/blog/quicktake/twelve-days-of-scale-up</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/twelve-days-of-scale-up</guid><pubDate>Thu, 18 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-18-twelve-days-of-scale-up.png&quot; alt=&quot;Festive holiday tree decorated with bioinformatics pipeline error messages, representing the chaos of debugging during the holiday season&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A festive tune of crashed jobs caroling in the key of why.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>bioinformatics</category><category>computational-biology</category><category>holiday-humor</category><category>data-science</category></item><item><title>Tuesday Tactics: The 10-Minute Test</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-10-minute-test</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-10-minute-test</guid><pubDate>Tue, 16 Dec 2025 13:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>documentation</category><category>bioinformatics</category><category>onboarding</category><category>tuesday-tactics</category></item><item><title>Software Doesn&apos;t Age Like Wine</title><link>https://boston-wib.org/blog/quicktake/software-maintenance-biotech</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/software-maintenance-biotech</guid><pubDate>Mon, 15 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-15-software-maintenance-biotech.png&quot; alt=&quot;Lifecycle comparison between software and a microscope both bought in 2020. The software becomes outdated faster and needs maintenance more often.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why Your Data Tools Need Maintenance (And What That Really Means)&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-engineering</category><category>bioinformatics</category><category>technical-debt</category><category>biotech</category><category>data-science</category></item><item><title>Interviewing Is Networking in Disguise</title><link>https://boston-wib.org/blog/quicktake/interviewing-is-networking</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/interviewing-is-networking</guid><pubDate>Wed, 10 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-10-interviewing-is-networking.png&quot; alt=&quot;Interviews can either lead to a job offer or a professional connections and both are considered wins.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The real win in interviewing is the network you grow along the way.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>career-development</category><category>career-advice</category><category>networking</category><category>job-search</category><category>professional-growth</category></item><item><title>Tuesday Tactics: Sunset Before You Scale</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-sunset-before-you-scale</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-sunset-before-you-scale</guid><pubDate>Tue, 09 Dec 2025 13:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>data-governance</category><category>technical-dept</category><category>biotech</category><category>tuesday-tactics</category></item><item><title>The Incremental Migration Pattern</title><link>https://boston-wib.org/blog/quicktake/incremental-migration-pattern</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/incremental-migration-pattern</guid><pubDate>Mon, 08 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-08-incremental-migration-pattern.png&quot; alt=&quot;Diagram illustrating the incremental migration pattern with a routing layer directing traffic between old and new systems, showing how to gradually migrate components without disrupting production&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;How to Rebuild the Plane While Flying It&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>software-engineering</category><category>data-engineering</category><category>refactoring</category><category>technical-debt</category></item><item><title>Starting a Women in Bioinformatics Chapter: A Practical Guide</title><link>https://boston-wib.org/blog/tutorial/starting-a-women-in-bioinformatics-chapter</link><guid isPermaLink="true">https://boston-wib.org/blog/tutorial/starting-a-women-in-bioinformatics-chapter</guid><pubDate>Fri, 05 Dec 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/~/assets/images/WIB_Logo.jpg&quot; alt=&quot;Boston Women in Bioinformatics Logo&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;&quot;Been There, Done That&quot; advice from established chapters&lt;/em&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Note: This guide is based on our experience as a local volunteer group. We are now a non-profit organization and are happy to advise other groups, but we cannot share our logos or web domain due to legal considerations. Our long-term vision is to grow together: if you start a chapter and it becomes stable, we would love your leadership team to connect with ours so we can build a nationwide network.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2&gt;Table of Contents&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href=&quot;table-of-contents&quot;&gt;Table of Contents&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;getting-started:-the-foundation&quot;&gt;Getting Started: The Foundation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;digital-infrastructure:-setting-up-your-online-presence&quot;&gt;Digital Infrastructure: Setting Up Your Online Presence&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;committee-structure:-learning-from-boston-wib&quot;&gt;Committee Structure: Learning from Boston WiB&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;event-planning:-what-we-wish-we&apos;d-known&quot;&gt;Event Planning: What We Wish We&amp;#39;d Known&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;funding-and-sponsorship:-making-it-sustainable&quot;&gt;Funding and Sponsorship: Making It Sustainable&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;community-building:-the-soft-skills&quot;&gt;Community Building: The Soft Skills&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;common-pitfalls-and-how-to-avoid-them&quot;&gt;Common Pitfalls and How to Avoid Them&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;timeline:-first-year-milestones&quot;&gt;Timeline: First Year Milestones&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;resources-and-templates&quot;&gt;Resources and Templates&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&quot;final-words-of-encouragement&quot;&gt;Final Words of Encouragement&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Getting Started: The Foundation&lt;/h2&gt;
&lt;h3&gt;Core Team Assembly&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Start small but think strategically&lt;/strong&gt;. Begin with 3-5 committed individuals who can each take ownership of key areas. Look for people with complementary skills: someone with event planning experience, a communications-savvy person, someone with industry connections, and ideally someone with non-profit or volunteer organization experience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Establish clear roles early&lt;/strong&gt;. Even before formal committees, designate who handles what to avoid overlap and ensure nothing falls through the cracks.&lt;/p&gt;
&lt;h3&gt;Legal and Organizational Structure&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Research local requirements for establishing a volunteer organization&lt;/li&gt;
&lt;li&gt;Consider whether you want to incorporate as a non-profit (this can wait until you&amp;#39;re more established)&lt;/li&gt;
&lt;li&gt;Create basic bylaws or operating agreements early to prevent conflicts later&lt;/li&gt;
&lt;li&gt;Establish a simple decision-making process&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: BWiB operated without official non-profit status for 10 years… so you don’t need to rush this.&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;Digital Infrastructure: Setting Up Your Online Presence&lt;/h2&gt;
&lt;h3&gt;Event Management Platforms&lt;/h3&gt;
&lt;h4&gt;Luma vs. Meetup: Our Experience&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Meetup: Great for getting started, has built-in discovery features, costs ~$15-20/month&lt;ul&gt;
&lt;li&gt;Pros: Established user base, good for finding your initial community&lt;/li&gt;
&lt;li&gt;Cons: Limited customization, ongoing costs, platform dependency, cannot access member email addresses&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Luma: Better for established groups, more professional appearance, free tier available&lt;ul&gt;
&lt;li&gt;Pros: More polished interface, better integration options, free for most events that we host so far, can access member email addresses&lt;/li&gt;
&lt;li&gt;Cons: Less discovery, need to drive your own traffic&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: BWiB started with Meetup but we have recently transitioned to Luma.&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;Communication Channels&lt;/h3&gt;
&lt;h4&gt;LinkedIn Group&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Create a LinkedIn group early for professional networking&lt;/li&gt;
&lt;li&gt;Post job opportunities, industry news, and event announcements&lt;/li&gt;
&lt;li&gt;Encourage members to share their professional achievements&lt;/li&gt;
&lt;li&gt;Use LinkedIn Events to promote your gatherings&lt;/li&gt;
&lt;li&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: we get a lot of traffic from our LI group postings &lt;/i&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Slack Channel&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Essential for real-time communication and community building&lt;/li&gt;
&lt;li&gt;Create channels for: #general, #jobs, #events, #resources, #introductions&lt;/li&gt;
&lt;li&gt;Consider topic-specific channels as you grow (#r-users, #python, #career-advice)&lt;/li&gt;
&lt;li&gt;Establish community guidelines and moderation policies from day one&lt;/li&gt;
&lt;li&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: we have one and it’s mostly being used by the executive team, less so by the members&lt;/i&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Email List&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Don&amp;#39;t rely solely on social platforms - build an email list&lt;/li&gt;
&lt;li&gt;Send monthly newsletters with event updates, job postings, and community highlights&lt;/li&gt;
&lt;li&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: we are still working on the best way to do this &lt;/i&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Website Considerations&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Start simple: a basic website with your mission, upcoming events, and contact info&lt;/li&gt;
&lt;li&gt;Free options: GitHub Pages, Netlify, or basic WordPress&lt;/li&gt;
&lt;li&gt;Include: About page, Events calendar, Resources section, Committee information&lt;/li&gt;
&lt;li&gt;Make it mobile-friendly from day one&lt;/li&gt;
&lt;li&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: it took some effort to get the first version off the ground but we like it a lot now that it’s there! &lt;/i&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Committee Structure: Learning from Boston WiB&lt;/h2&gt;
&lt;p&gt;Based on Boston WiB&amp;#39;s successful model, consider establishing these committees as you grow:&lt;/p&gt;
&lt;h3&gt;Essential Committees (Start Here)&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Web/Digital and Communication Committee&lt;/strong&gt; - Technical infrastructure&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Handle communications workflows&lt;/li&gt;
&lt;li&gt;Maintain website and online resources&lt;/li&gt;
&lt;li&gt;Manage digital tools and platforms&lt;/li&gt;
&lt;li&gt;Curate community resources&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Events Committee&lt;/strong&gt; – Programming and logistics for events&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The Events Committee is responsible for:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Designing a balanced annual program (technical talks, workshops, networking, career panels, journal clubs, etc.)&lt;/li&gt;
&lt;li&gt;Collecting ideas from the community and shaping them into concrete event formats&lt;/li&gt;
&lt;li&gt;Handling event logistics: dates, venues (or virtual platforms), AV needs, accessibility considerations, and registration pages (e.g., Luma/Meetup)&lt;/li&gt;
&lt;li&gt;Coordinating with speakers and panelists (outreach, confirmations, talk titles/abstracts, bios)&lt;/li&gt;
&lt;li&gt;Ensuring each event has a clear run-of-show, roles for volunteers, and a backup plan for virtual participation where possible&lt;/li&gt;
&lt;li&gt;Partnering with other committees (Web/Digital, Sponsorship, Communications) to promote events, capture photos/resources, and follow up with attendees&lt;/li&gt;
&lt;li&gt;Tracking basic metrics (attendance, feedback, repeat attendees) to improve future programming and support sponsorship conversations&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: we didn’t have official committees for the first ~9 years of the group, but once we did establish committees, we got more things done in a more organized fashion. &lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;Event Planning: What We Wish We&amp;#39;d Known&lt;/h2&gt;
&lt;h3&gt;Venue Selection&lt;/h3&gt;
&lt;h4&gt;Free Options to Explore&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;University libraries and conference rooms&lt;/li&gt;
&lt;li&gt;Hospital/medical center meeting spaces&lt;/li&gt;
&lt;li&gt;Tech company offices (many have community programs)&lt;/li&gt;
&lt;li&gt;Co-working spaces (often free for non-profits)&lt;/li&gt;
&lt;li&gt;Public libraries with meeting rooms&lt;/li&gt;
&lt;li&gt;Biotech incubators and accelerators&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Pro Tips&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Always have a backup plan for virtual meetings&lt;/li&gt;
&lt;li&gt;Test all AV equipment before events&lt;/li&gt;
&lt;li&gt;Choose accessible locations with public transportation&lt;/li&gt;
&lt;li&gt;Consider rotating locations to serve different geographic areas&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Meeting Formats That Work&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Technical workshops&lt;/strong&gt;: Hands-on learning (R/Python tutorials, specific tools)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Career panels&lt;/strong&gt;: Industry professionals sharing experiences&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Networking mixers&lt;/strong&gt;: Casual relationship building&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Lunch meetups&lt;/strong&gt;: Getting folks together and chat over lunch&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Journal clubs&lt;/strong&gt;: Discussing recent papers&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Industry visits&lt;/strong&gt;: Tours of biotech companies or research facilities&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Skill-sharing sessions&lt;/strong&gt;: Members teaching each other&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Speaker Recruitment&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Tap your local biotech and academic communities&lt;/li&gt;
&lt;li&gt;Invite recent conference speakers (they often reuse presentations)&lt;/li&gt;
&lt;li&gt;Consider virtual speakers to expand your options&lt;/li&gt;
&lt;li&gt;Create a speaker database and wishlist&lt;/li&gt;
&lt;li&gt;Offer to reciprocate speaking opportunities&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Timing&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;During the day&lt;ul&gt;
&lt;li&gt;Good for virtual events&lt;/li&gt;
&lt;li&gt;Good for working parents&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Evening&lt;ul&gt;
&lt;li&gt;Best for in-person events that involve networking&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Funding and Sponsorship: Making It Sustainable&lt;/h2&gt;
&lt;h3&gt;Free Resources to Maximize&lt;/h3&gt;
&lt;p&gt;Most successful chapters operate primarily on volunteer time and free resources for the first few years.&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: 99% of our events are free to the public&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;Sponsorship Strategy&lt;/h3&gt;
&lt;h4&gt;When to Start Seeking Sponsors:&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Once you have 50+ regular attendees&lt;/li&gt;
&lt;li&gt;When you have consistent programming&lt;/li&gt;
&lt;li&gt;After establishing credibility in the community&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Potential Sponsors:&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Local biotech companies&lt;/li&gt;
&lt;li&gt;Pharmaceutical companies&lt;/li&gt;
&lt;li&gt;Academic institutions&lt;/li&gt;
&lt;li&gt;Bioinformatics software companies&lt;/li&gt;
&lt;li&gt;Consulting firms&lt;/li&gt;
&lt;li&gt;Professional service providers (legal, HR, etc.)&lt;/li&gt;
&lt;/ul&gt;
&lt;h4&gt;Sponsorship Packages&lt;/h4&gt;
&lt;ul&gt;
&lt;li&gt;Bronze ($100-500): Logo on website, mention in newsletters&lt;/li&gt;
&lt;li&gt;Silver ($500-1500): Event speaking slot, booth at networking events&lt;/li&gt;
&lt;li&gt;Gold ($1500+): Title sponsor of major events, annual report recognition&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Grant Opportunities&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Many professional organizations offer small grants for diversity initiatives&lt;/li&gt;
&lt;li&gt;Local community foundations&lt;/li&gt;
&lt;li&gt;Corporate diversity and inclusion grants&lt;/li&gt;
&lt;li&gt;University community engagement funds&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: engaging with sponsors in a meaningful way takes quite a bit of time so we have one person who is working only on that.&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;Community Building: The Soft Skills&lt;/h2&gt;
&lt;h3&gt;Creating Inclusive Spaces&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Establish and enforce a code of conduct (see ours below)&lt;/li&gt;
&lt;li&gt;Use inclusive language in all communications&lt;/li&gt;
&lt;li&gt;Provide multiple ways for people to engage (in-person, virtual, async)&lt;/li&gt;
&lt;li&gt;Actively welcome newcomers and explain &amp;quot;inside&amp;quot; references&lt;/li&gt;
&lt;li&gt;Consider childcare or timing for working parents&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Sustaining Volunteer Energy&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Rotate leadership responsibilities to prevent burnout&lt;/li&gt;
&lt;li&gt;Celebrate volunteers publicly and often&lt;/li&gt;
&lt;li&gt;Set realistic expectations and timelines&lt;/li&gt;
&lt;li&gt;Create clear handoff procedures for roles&lt;/li&gt;
&lt;li&gt;Host volunteer appreciation events&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Measuring Success&lt;/h3&gt;
&lt;p&gt;Track metrics that matter:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Attendance at events (both unique and repeat attendees)&lt;/li&gt;
&lt;li&gt;Engagement on digital platforms&lt;/li&gt;
&lt;li&gt;Career advancements of members&lt;/li&gt;
&lt;li&gt;Feedback scores from events&lt;/li&gt;
&lt;li&gt;Diversity of speakers and attendees&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: this is also very important for engaging with future sponsors.&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;BWiB Code of Conduct&lt;/h3&gt;
&lt;p&gt;This lunch event is dedicated to providing a harassment-free experience for everyone, regardless of gender, gender identity and expression, age, sexual orientation, disability, physical appearance, body size, race, ethnicity, religion (or lack thereof), or technology choices. We do not tolerate harassment in any form. Anyone violating these rules may be sanctioned or banned from attending at the discretion of the conference organizers. Please contact the organizers through meet-up messaging/Slack, if you feel someone has broken the code of conduct.&lt;/p&gt;
&lt;p&gt;Original source and credit:&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://2012.jsconf.us/#/about&quot;&gt;http://2012.jsconf.us/#/about&lt;/a&gt; &amp;amp; &lt;a href=&quot;http://geekfeminism.wikia.com/wiki/Conference_anti-harassment/Policy&quot;&gt;The Ada Initiative&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;Common Pitfalls and How to Avoid Them&lt;/h2&gt;
&lt;h3&gt;Overcommitting Early&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Problem&lt;/strong&gt;: Trying to do too much too fast leads to burnout and poor quality events.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Solution&lt;/strong&gt;: Start with monthly or bi-monthly events and grow from there.&lt;/p&gt;
&lt;h3&gt;Founder Dependence&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Problem&lt;/strong&gt;: One person becomes indispensable, creating fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Solution&lt;/strong&gt;: Distribute responsibilities and document processes from day one.&lt;/p&gt;
&lt;h3&gt;Mission Drift&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Problem&lt;/strong&gt;: Losing focus on your core purpose as you grow.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Solution&lt;/strong&gt;: Regularly revisit your mission statement and evaluate activities against it.&lt;/p&gt;
&lt;h3&gt;Geographic Challenges&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The Problem&lt;/strong&gt;: Serving a spread-out community effectively.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Solution&lt;/strong&gt;: Embrace hybrid events and consider multiple smaller meetups.&lt;/p&gt;
&lt;h2&gt;Timeline: First Year Milestones&lt;/h2&gt;
&lt;p&gt;The timeline below is a suggestion based on our experience. If you want to grow at a different pace, you can stay in each phase as long as needed.&lt;/p&gt;
&lt;h3&gt;Months 1-3: Foundation&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Assemble core team&lt;/li&gt;
&lt;li&gt;Define mission and basic structure&lt;/li&gt;
&lt;li&gt;Set up digital infrastructure (Meetup/Luma, LinkedIn, Slack)&lt;/li&gt;
&lt;li&gt;Plan first event&lt;/li&gt;
&lt;li&gt;Brainstorm the next 3 or 4 events&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Months 4-6: Growth&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Host 2-3 successful events&lt;/li&gt;
&lt;li&gt;Establish committee structure&lt;/li&gt;
&lt;li&gt;Build email list to 50+ people&lt;/li&gt;
&lt;li&gt;Create basic website&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Months 7-9: Expansion&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Launch mentorship or special program&lt;/li&gt;
&lt;li&gt;Seek first sponsorship opportunities&lt;/li&gt;
&lt;li&gt;Establish partnerships with local organizations&lt;/li&gt;
&lt;li&gt;Host first major event (conference, symposium)&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Months 10-12: Sustainability&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Develop leadership succession plan&lt;/li&gt;
&lt;li&gt;Evaluate and refine committee structure&lt;/li&gt;
&lt;li&gt;Plan annual programming calendar&lt;ul&gt;
&lt;li&gt;Document processes and create handbooks&lt;/li&gt;
&lt;li&gt;Contact the Boston WiB leadership team to work more closely together&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;Resources and Templates&lt;/h2&gt;
&lt;h3&gt;Essential Tools (Free Tier)&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Event Management&lt;/strong&gt;: Luma, Meetup, Eventbrite&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Communication&lt;/strong&gt;: Slack (free up to 10,000 messages), Discord&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Email Marketing&lt;/strong&gt;: Mailchimp, ConvertKit&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Project Management&lt;/strong&gt;: Trello, Notion, Google Workspace&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Design&lt;/strong&gt;: Canva for social media graphics and flyers&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Legal and Administrative&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Sample bylaws and operating agreements&lt;/li&gt;
&lt;li&gt;Code of conduct templates&lt;/li&gt;
&lt;li&gt;Volunteer agreement forms&lt;/li&gt;
&lt;li&gt;Event planning checklists&lt;/li&gt;
&lt;li&gt;Financial tracking spreadsheets&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style=&quot;color: red;&quot;&gt;&lt;i&gt;BWiB BTDT advice: we started with a code of conduct… everything else came much later.&lt;i&gt;&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;Final Words of Encouragement&lt;/h2&gt;
&lt;p&gt;Starting a Women in Bioinformatics chapter is incredibly rewarding but takes patience and persistence. Your first event might have 5 people - that&amp;#39;s perfectly normal! Focus on providing value to your community, stay consistent with your programming, and be responsive to member needs.&lt;/p&gt;
&lt;p&gt;Remember: every successful chapter started exactly where you are now. The bioinformatics community is generally supportive and collaborative, so don&amp;#39;t hesitate to reach out to established chapters for advice, speaker recommendations, or partnership opportunities.&lt;/p&gt;
&lt;p&gt;The field needs more diverse voices and inclusive spaces. By starting a local chapter, you&amp;#39;re not just building a community - you&amp;#39;re actively changing the landscape of bioinformatics for the better.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;This guide is a living document. Please share your experiences and lessons learned to help future chapters succeed!&lt;/em&gt;&lt;/p&gt;
</content:encoded><category>computational-biology</category><category>bioinformatics</category><category>women-in-science</category><category>group-organization</category></item><item><title>The Five Signals That Scream &apos;Hire a Data Person Now&apos;</title><link>https://boston-wib.org/blog/quicktake/five-signals-hire-data-person-now</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/five-signals-hire-data-person-now</guid><pubDate>Thu, 04 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-04-five-signals-hire-data-person-now.png&quot; alt=&quot;The five signals checklist: excel limit errors, missing data, analysis bottlenecks, untrustworthy numbers, and duplicated work discoveries&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A gentle guide for realizing when it&apos;s time to call in the data experts and which one you actually need.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>hiring</category><category>data-science</category><category>data-engineering</category></item><item><title>Tuesday Tactics: The &quot;No Jira Ticket&quot; Rule</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-no-jira-ticket-rule</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-no-jira-ticket-rule</guid><pubDate>Tue, 02 Dec 2025 13:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>self-service</category><category>bioinformatics</category><category>data-democratization</category><category>tuesday-tactics</category></item><item><title>Nobody Owns the Data Warehouse (And That&apos;s Why It&apos;s Broken)</title><link>https://boston-wib.org/blog/quicktake/nobody-owns-the-data-warehouse</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/nobody-owns-the-data-warehouse</guid><pubDate>Mon, 01 Dec 2025 13:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-12-01-nobody-owns-the-data-warehouse.png&quot; alt=&quot;Ownership vacuum diagram for data warehouse ownership shows a data warehouse with no clear owner, leading to chaos and inefficiency.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A data warehouse only works when someone truly stewards it, or it quickly becomes a bottleneck.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-science</category><category>biotech</category><category>data-engineering</category><category>data-governance</category></item><item><title>Thanksgiving in Biotech: A Survival Guide</title><link>https://boston-wib.org/blog/quicktake/thanksgiving-in-biotech-a-survival-guide</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/thanksgiving-in-biotech-a-survival-guide</guid><pubDate>Thu, 27 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-27-thanksgiving-in-biotech.png&quot; alt=&quot;A bioinformatics pipeline with a side of pie&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;This season, the polar plunge in cross‑functional communication isn’t just survivable, it’s a chance to thrive.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>thanksgiving</category><category>biotech</category><category>data-science</category><category>bioinformatics</category></item><item><title>Tuesday Tactics: Your First Data Hire Signal</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-your-first-data-hire-signal</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-your-first-data-hire-signal</guid><pubDate>Tue, 25 Nov 2025 00:00:00 GMT</pubDate><content:encoded/><category>data-engineering</category><category>biotech</category><category>hiring</category><category>scaleup</category><category>tuesday-tactics</category></item><item><title>The Explicit Out-of-Scope Section: My Secret Weapon for Project Trust</title><link>https://boston-wib.org/blog/quicktake/explicit-out-of-scope-section</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/explicit-out-of-scope-section</guid><pubDate>Mon, 24 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-24-explicit-out-of-scope-section.png&quot; alt=&quot;Project requirements document with an ‘Out of Scope’ section. Checklist shows PowerPoint export, real‑time data updates, and mobile responsive design marked as not in scope, with a note that these items are expected in Phase 2.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Transparency isn&apos;t just about what&apos;s included, it&apos;s about naming what&apos;s out of scope.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>software-engineering</category><category>project-management</category><category>data-science</category><category>biotech</category><category>leadership</category></item><item><title>What Your Favorite Biobank Says About You</title><link>https://boston-wib.org/blog/quicktake/what-your-favorite-biobank-says-about-you</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/what-your-favorite-biobank-says-about-you</guid><pubDate>Fri, 21 Nov 2025 12:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/minator-yang-oJjfBo_0R_8-unsplash.jpg&quot; alt=&quot;What Your Favorite Biobank Says About You&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;We asked 500,000 researchers which biobank they stan, and the results were… scientifically significant.&lt;/em&gt;&lt;/p&gt;

&lt;Quiz title={quizData.title} questions={quizData.questions} results={quizData.results} /&gt;

&lt;p&gt;&lt;strong&gt;Disclaimer&lt;/strong&gt;: &lt;em&gt;This is satire. All biobanks are incredible resources that have advanced science immeasurably. The author uses multiple biobanks and has deep respect for all of them.&lt;/em&gt;&lt;/p&gt;
</content:encoded><category>biobanks</category><category>genetics-research</category><category>genomics-humour</category><category>science-quiz</category><category>population-genetics</category><category>human-genetics</category><category>uk-biobank</category><category>all-of-us</category><category>gnomad</category><category>finngen</category><category>biobank-japan</category><category>genes-and-health</category></item><item><title>AI vs. AI Agents in Healthcare: Not the Same Thing!</title><link>https://boston-wib.org/blog/quicktake/ai-vs-ai-agents-in-healthcare-not-the-same-thing</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/ai-vs-ai-agents-in-healthcare-not-the-same-thing</guid><pubDate>Fri, 21 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-21-ai-vs-ai-agents.jpeg&quot; alt=&quot;Split graphic comparing &quot;AI in healthcare&quot; on the left with &quot;AI agents in healthcare&quot; on the right.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The distinction between traditional AI and AI agents is crucial for understanding their impact on healthcare.&lt;/em&gt;&lt;/p&gt;

&lt;details class=&quot;image-description&quot;&gt;
&lt;summary&gt;Text description of graphic&lt;/summary&gt;

&lt;p&gt;&lt;strong&gt;AI vs. AI Agents in Healthcare: Not the Same Thing!&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Left side: AI in healthcare are depicted as assistants providing assistive intelligence. Examples include predicting patient no-shows, flagging patterns from wearables, and answering FAQs via chatbot. Human interpretation and action are still required.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Right side: AI agents are depicted as assistants with &amp;quot;hands to get things done,&amp;quot; providing operational intelligence. Examples include automating data-to-system workflows, insurance verification, and patient file updates. AI agents perform multi-step tasks with minimal human intervention.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/details&gt;

&lt;p&gt;Following up on my last post about the growing role of AI Agents in Healthcare, I wanted to draw a sharp distinction: AI vs. AI Agents; they are not the same thing!&lt;/p&gt;
&lt;p&gt;Lately, everyone’s been talking about &amp;quot;AI in healthcare&amp;quot; like it’s one big monolithic thing. But honestly, there’s a huge difference between general AI tools and AI agents, and it matters a lot for clinical workflows.&lt;/p&gt;
&lt;h2&gt;AI in healthcare (in general):&lt;/h2&gt;
&lt;p&gt;Think of this as assistive intelligence that is usually used for:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;Predicting which patients might miss their appointments&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Flagging abnormal patterns in wearable data&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;A chatbot that answers common insurance or billing questions&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;It’s powerful, but it still depends heavily on clinicians to interpret and act!&lt;/strong&gt;&lt;/p&gt;
&lt;h2&gt;AI Agents in healthcare:&lt;/h2&gt;
&lt;p&gt;These are like the overachievers. They don’t just predict, they operate.
Agents take in data, make decisions, and execute multi-step tasks with minimal human intervention.&lt;/p&gt;
&lt;p&gt;For example:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;An agent can automatically pull data from multiple sources, populate forms, and update the internal system eliminates the need for manual clicks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;A tool can track insurance eligibility, verify coverage, and seamlessly update the patient file in real time.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Basically:&lt;/p&gt;
&lt;p&gt;AI = smart assistant&lt;/p&gt;
&lt;p&gt;AI Agents = smart assistant + hands to get things done&lt;/p&gt;
&lt;h2&gt;Why it matters👇🏽&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Healthcare doesn’t just need predictions; it also needs workflows that actually move.&lt;/li&gt;
&lt;li&gt;AI agents can tackle the admin overload that burns out clinicians, while still keeping humans in the loop for real judgment calls.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We’re entering the era where hospitals won’t just ask: &amp;quot;Do we have AI?&amp;quot;
They’ll ask: &amp;quot;Do we have agents that can actually &lt;strong&gt;close the loop&lt;/strong&gt;?&amp;quot;&lt;/p&gt;
&lt;p&gt;💠 Curious how this evolves in 2026 and beyond clinically, operationally, and ethically!&lt;/p&gt;
</content:encoded><category>clinical-data</category><category>artificial-intelligence</category><category>ai</category><category>ai-agents</category><category>data-science</category><category>health-informatics</category><category>health-tech</category></item><item><title>I Fixed the Same Bug Three Times (No One Noticed)</title><link>https://boston-wib.org/blog/quicktake/i-fixed-the-same-bug-three-times</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/i-fixed-the-same-bug-three-times</guid><pubDate>Thu, 20 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-20-i-fixed-the-same-bug-three-times.png&quot; alt=&quot;Infographic showing a repeating life cycle: a bug occurs, a fix is applied, the solution is documented, turnover happens, documentation is forgotten, and the cycle repeats.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

</content:encoded><category>biotech</category><category>data-engineering</category><category>bioinformatics</category><category>technical-debt</category><category>software-engineering</category></item><item><title>Tuesday Tactics: The 3-Question Requirements Filter</title><link>https://boston-wib.org/blog/quicktake/tuesday-tactics-3-question-requirements-filter</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/tuesday-tactics-3-question-requirements-filter</guid><pubDate>Tue, 18 Nov 2025 00:00:00 GMT</pubDate><content:encoded/><category>data-science</category><category>data-engineering</category><category>bioinformatics</category><category>product-management</category><category>biotech</category><category>tuesday-tactics</category></item><item><title>Work Life Decoded: Understanding Your Manager&apos;s World</title><link>https://boston-wib.org/blog/work-life-decoded/understanding-your-managers-world</link><guid isPermaLink="true">https://boston-wib.org/blog/work-life-decoded/understanding-your-managers-world</guid><pubDate>Tue, 18 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/Work-Life-Decoded.png&quot; alt=&quot;Work Life Decoded Video Series Thumbnail&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;How to Building a Strategic Partnership with Your Boss&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Your manager isn’t your boss. They’re your business partner.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Most people see their manager as someone who assigns tasks and approves time off. But here&amp;#39;s what changes everything: your manager is fighting for budget in meetings you&amp;#39;ll never attend, juggling competing priorities from other departments, and trying to figure out who gets that one promotion slot when three people want it.&lt;/p&gt;
&lt;p&gt;Once you understand their world, you can position yourself strategically.&lt;/p&gt;
&lt;p&gt;Instead of: &amp;quot;I finished the project&amp;quot; Try: &amp;quot;I finished the project two days early, which means we can move the client presentation up if needed, and I documented the process for the team&amp;quot;&lt;/p&gt;
&lt;p&gt;You&amp;#39;re giving them options and solutions, not just updates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The strategic question most people never ask:&lt;/strong&gt; &amp;quot;What does success look like for YOU this quarter?&amp;quot;&lt;/p&gt;
&lt;p&gt;Not the team – for them personally. Maybe they&amp;#39;re trying to improve retention, hit a revenue target, or launch something new. Once you know their goals, you can align your work to support them.&lt;/p&gt;
&lt;p&gt;This isn&amp;#39;t about sucking up. It&amp;#39;s about understanding that your manager has the authority to open doors for you – but they can&amp;#39;t read your mind, and they&amp;#39;re dealing with constraints you might not see.&lt;/p&gt;
&lt;p&gt;Watch the full video where Lorena and I break down:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Understanding the hidden pressures your manager faces&lt;/li&gt;
&lt;li&gt;Communication strategies that actually work&lt;/li&gt;
&lt;li&gt;How to build trust and advocate for yourself strategically&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Watch the full episode &lt;a href=&quot;https://www.patreon.com/posts/new-video-your-143734360&quot;&gt;Work Life Decoded: Understanding Your Manager&amp;#39;s World)&lt;/a&gt; on Patreon.&lt;/p&gt;
</content:encoded><category>career-advice</category><category>career-development</category><category>manager-relationships</category><category>workplace-strategy</category><category>professional-growth</category><category>work-life-decoded</category></item><item><title>Write the Test First (Even for Your Science)</title><link>https://boston-wib.org/blog/quicktake/write-the-test-first-even-for-your-science</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/write-the-test-first-even-for-your-science</guid><pubDate>Mon, 17 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-17-write-the-test-first-even-for-your-science.png&quot; alt=&quot;Side‑by‑side flowcharts comparing traditional vs. test‑driven development (TDD) approaches.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why data scientists should design error messages that guide users straight to solutions.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>data-engineering</category><category>biotech</category><category>computational-biology</category><category>data-science</category></item><item><title>Data Pipelines That Scientists Can Debug (Without Calling You at 9 PM)</title><link>https://boston-wib.org/blog/quicktake/data-pipelines-that-scientists-can-debug-without-calling-you-at-9-pm</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/data-pipelines-that-scientists-can-debug-without-calling-you-at-9-pm</guid><pubDate>Thu, 13 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-13-data-pipelines-that-scientists-can-debug-without-calling-you-at-9-pm.png&quot; alt=&quot;Clear error messages empowers scientists to solve challenges fast and independently; cryptic messages stall scientific process by often requiring data scientist intervention&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why data scientists should design error messages that guide users straight to solutions.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>data-engineering</category><category>biotech</category><category>computational-biology</category><category>data-science</category></item><item><title>AI Agents Are Quietly Redefining Healthcare</title><link>https://boston-wib.org/blog/quicktake/ai-agents-redefining-healthcare</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/ai-agents-redefining-healthcare</guid><pubDate>Wed, 12 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/julien-tromeur-FYOwBvRb2Mk-unsplash.jpg&quot; alt=&quot;A futuristic female robot floats mid‑air, holding a metallic brain like a basketball, poised to shoot.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;AI agents turn healthcare data into real‑time insights, evolving with patients and providers.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Healthcare is full of data 👉🏻 labs, imaging, EHRs, vitals, even wearables, and yet most systems only react to it.&lt;/p&gt;
&lt;p&gt;🤖AI agents are changing that!
They’re designed to monitor, reason, and respond. They keep learning continuously as new data streams in.&lt;/p&gt;
&lt;p&gt;🔸 Imagine systems that track patient vitals in real time, adjust medication alerts based on patterns, or surface the right information to clinicians before they even ask.&lt;/p&gt;
&lt;p&gt;That’s not a futuristic dream anymore! It’s happening in early prototypes across hospitals and research networks.&lt;/p&gt;
&lt;p&gt;The real shift isn’t just about smarter tech now, it’s about moving from static tools to adaptive systems that evolve alongside patients and providers.&lt;/p&gt;
&lt;p&gt;💬 I’d love to hear your thoughts. Where do you think AI agents will make the biggest impact first: patient monitoring, drug discovery, or clinical decision support?&lt;/p&gt;
</content:encoded><category>ai-in-healthcare</category><category>data-science</category><category>healthcare-innovation</category><category>machine-learning</category><category>precision-medicine</category><category>digital-health</category></item><item><title>A Coffee with CompBio: Collaboration Survival Guide for CompBio</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e10</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e10</guid><pubDate>Tue, 11 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Unpacking the messy truths behind scientific teamwork.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-58eB&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=BMnZZefmJRPo&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-58eB&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;What really happens when a wet lab scientist and a computational biologist sit down to plan an experiment? Spoiler: it&amp;#39;s not always smooth sailing. In this episode of &amp;#39;A Coffee with Compbio,&amp;#39; Lorena Pantano and Alex Bartlett chat with Amulya about the real talk nobody tells you about scientific collaborations.&lt;/p&gt;
&lt;p&gt;They break down the three make-or-break moments of any project: that first meeting where you&amp;#39;re figuring out if single-cell sequencing on mouse eyes is actually the move (hint: maybe start simpler), the data processing stage where quality issues rear their ugly head, and those uncomfortable conversations when results don&amp;#39;t pan out.&lt;/p&gt;
&lt;p&gt;What you&amp;#39;ll learn:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;How to redirect overambitious project plans without shutting people down&lt;/li&gt;
&lt;li&gt;Smart ways to communicate technology limitations early&lt;/li&gt;
&lt;li&gt;What to say when pilot data quality is... not great&lt;/li&gt;
&lt;li&gt;Why being adaptable beats being rigid every single time&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;If you want to level up your collaboration game and avoid common pitfalls, grab your coffee and tune in.&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastr&lt;/a&gt; for editing and management support.&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/a-coffee-with-saranya-canchi-fail-learn/id1817024741?i=1000731855297&amp;uo=4&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/2Sx6bFIOAt3x32o5IvySAu&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>neuroinformatics</category><category>self-directed-learning</category><category>bioinformatics</category><category>problem-solving-skills</category></item><item><title>Work Life Decoded: How to Handle Workplace Negativity (Without Becoming the Office Therapist)</title><link>https://boston-wib.org/blog/work-life-decoded/how-to-handle-workplace-negativity</link><guid isPermaLink="true">https://boston-wib.org/blog/work-life-decoded/how-to-handle-workplace-negativity</guid><pubDate>Tue, 11 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/Work-Life-Decoded.png&quot; alt=&quot;Work Life Decoded Video Series Thumbnail&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Practical strategies for handling workplace negativity.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;“That person is SO incompetent.” “This project is a disaster.” “Management has no idea what they’re doing.”&lt;/p&gt;
&lt;p&gt;We’ve all worked with chronic complainers. And if you’re in any kind of leadership role—formal or informal—you’ve probably felt the pressure to fix everyone’s frustrations.&lt;/p&gt;
&lt;p&gt;Here’s what I wish someone had told me 10 years ago: You’re not the office therapist.&lt;/p&gt;
&lt;p&gt;In the latest episode of &amp;quot;Work Life Decoded&amp;quot;, Lorena and Lina break down practical strategies for handling workplace negativity without getting dragged into the drama or becoming everyone’s emotional dumping ground.&lt;/p&gt;
&lt;p&gt;You’ll learn:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The 3-step framework for redirecting chronic complainers toward action&lt;/li&gt;
&lt;li&gt;How to separate legitimate work issues from gossip and personal drama&lt;/li&gt;
&lt;li&gt;The exact language to transform negative conversations in real-time&lt;/li&gt;
&lt;li&gt;Boundary-setting strategies that protect your own mental space&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;This isn’t about toxic positivity or pretending problems don’t exist. It’s about knowing when negativity is productive and when it’s destructive—and having the tools to navigate both.&lt;/p&gt;
&lt;p&gt;Because negativity is contagious. But so is the solution-focused energy you bring to your team.&lt;/p&gt;
&lt;p&gt;Watch the full episode &lt;a href=&quot;https://www.patreon.com/posts/143081096&quot;&gt;Work Life Decoded: How to Handle Workplace Negativity (Without Becoming the Office Therapist)&lt;/a&gt; on Patreon.&lt;/p&gt;
</content:encoded><category>career-advice</category><category>professional-development</category><category>leadership-skills</category><category>workplace-culture</category><category>work-life-decoded</category></item><item><title>The Strategic Value of Simple Solutions</title><link>https://boston-wib.org/blog/quicktake/value-of-simple-solutions</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/value-of-simple-solutions</guid><pubDate>Mon, 10 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-10-the-strategic-value-of-simple-solutions.png&quot; alt=&quot;The graph shows how simple solutions work best at small scales, but as your system grows, there’s a tipping point where leveling up to a more complex setup becomes worth the effort.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The best solution is the one that solves today&apos;s problem without creating tomorrow&apos;s.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>software-engineering</category><category>data-engineering</category><category>technical-leadership</category><category>biotech</category><category>data-infrastructure</category></item><item><title>Data Stewards vs. Data Scientists</title><link>https://boston-wib.org/blog/quicktake/data-stewards-vs-data-scientists</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/data-stewards-vs-data-scientists</guid><pubDate>Mon, 03 Nov 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-11-03-data-stewards-vs-data-scientists.png&quot; alt=&quot;You need both data stewards and data scientists.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why Biotech Needs Both (But Hires Only One)&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-science</category><category>biotech</category><category>data-infrastructure</category><category>biotech-leadership</category></item><item><title>Why Data Layer Guardrails are Key to Scalable Self-Service</title><link>https://boston-wib.org/blog/quicktake/data-layer-guardrails-are-key-to-scalable-self-service</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/data-layer-guardrails-are-key-to-scalable-self-service</guid><pubDate>Fri, 31 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-30-data-layer-guardrails-are-the-key-to-scalable-self-service.png&quot; alt=&quot;A shift from separated security for each application to centralized access management&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;What is the best kind of security for your data?&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-engineering</category><category>bioinformatics</category><category>data-quality</category><category>biotech</category><category>self-service</category></item><item><title>Work Life Decoded: Series Introduction</title><link>https://boston-wib.org/blog/work-life-decoded/welcome-to-the-series</link><guid isPermaLink="true">https://boston-wib.org/blog/work-life-decoded/welcome-to-the-series</guid><pubDate>Tue, 28 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/Work-Life-Decoded.png&quot; alt=&quot;Work Life Decoded Video Series Thumbnail&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A new video series that cuts through the myths with real talk on science and careers.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Women in Bioinformatics chairs Lorena and Lina introduce their new video series tackling the career challenges that keep bioinformatics professionals up at night. With over 40 years of combined experience at Harvard, eGenesis, Ginkgo Bioworks, and other leading organizations, they&amp;#39;re sharing the honest conversations about workplace dynamics, negotiation, and career navigation that they wish they&amp;#39;d had access to earlier in their careers.&lt;/p&gt;
&lt;p&gt;Should you stay at your current job or make a move? How do you handle a colleague taking credit for your work? When should you speak up, and when should you let it go? Each video in the series addresses a specific challenge with practical, actionable advice from leaders who&amp;#39;ve been there.&lt;/p&gt;
&lt;p&gt;This isn&amp;#39;t about motivational platitudes—it&amp;#39;s real talk about career strategy, advocating for yourself, supporting other minorities in the workplace, and managing the daily realities of working in science.&lt;/p&gt;
&lt;p&gt;If you&amp;#39;re ready for honest insights on building the career you want, join the Patreon video series, &amp;quot;Work Life Decoded.&amp;quot; The first episode &lt;a href=&quot;http://patreon.com/posts/welcome-to-work-142167916&quot;&gt;Welcome to Work Life Decoded&lt;/a&gt; is available now!&lt;/p&gt;
</content:encoded><category>early-career</category><category>career-advice</category><category>professional-development</category><category>workplace-culture</category><category>work-life-decoded</category></item><item><title>Work Life Decoded: Your Weekly Win Log – The Career Tool You Didn&apos;t Know You Needed</title><link>https://boston-wib.org/blog/work-life-decoded/your-weekly-win-log</link><guid isPermaLink="true">https://boston-wib.org/blog/work-life-decoded/your-weekly-win-log</guid><pubDate>Tue, 28 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/Work-Life-Decoded.png&quot; alt=&quot;Work Life Decoded Video Series Thumbnail&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why you should keep a weekly log of your wins (and how it can transform your career)&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;In their latest video, Women in Bioinformatics chairs Lorena and Lina tackle a deceptively simple practice that can transform your career: keeping a weekly log of your wins and quantifying your achievements.&lt;/p&gt;
&lt;p&gt;This isn&amp;#39;t just about surviving performance review season—though it absolutely makes that easier. A weekly win log is your defense against imposter syndrome, your foundation for a compelling resume, and your ammunition when it&amp;#39;s time to negotiate for what you deserve. The commitment? Just five minutes every Friday.&lt;/p&gt;
&lt;p&gt;The video walks through why this habit matters, what to track, and how to quantify your impact with concrete before/after examples. No more scrambling six months later trying to remember what you accomplished. No more underselling yourself because you forgot the numbers that prove your value.&lt;/p&gt;
&lt;p&gt;Lorena and Lina created a one-page Quick Reference Guide with practical setup tips and real examples—available exclusively to Patreon supporters along with the full video.
Ready to build this career-changing habit? Watch &lt;a href=&quot;https://www.patreon.com/posts/your-weekly-win-142523683&quot;&gt;Your Weekly Win Log – The Career Tool You Didn&amp;#39;t Know You Needed&lt;/a&gt; on Patreon and start logging this Friday. Set that 5-minute calendar reminder right now—your future self will thank you.&lt;/p&gt;
</content:encoded><category>career-advice</category><category>achievement-tracking</category><category>career-strategy</category><category>professional-development</category><category>productivity-tools</category><category>work-life-decoded</category></item><item><title>The Success Criteria Question: Why I Don&apos;t Start with Requirements</title><link>https://boston-wib.org/blog/quicktake/success-criteria-requirements</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/success-criteria-requirements</guid><pubDate>Mon, 27 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-27-the-success-criteria-question-why-i-dont-start-with-requirements.png&quot; alt=&quot;Bridge diagram showing transformation from vague request &quot;I wish we had...&quot; to specific solution &quot;Quality trend monitoring dashboard&quot; through clarifying questions.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The difference between requirements and success criteria&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>data-science</category><category>biotech</category><category>product-thinking</category><category>data-infrastructure</category></item><item><title>Why Every Series A Biotech Hits the Same Data Wall</title><link>https://boston-wib.org/blog/quicktake/why-every-series-a-biotech-hits-the-same-data-wall</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/why-every-series-a-biotech-hits-the-same-data-wall</guid><pubDate>Thu, 23 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-23-why-every-series-a-biotech-hits-the-same-data-wall.png&quot; alt=&quot;Comic titled &quot;The Data Bottleneck Funnel&quot; with two panels. First panel &quot;Before Series A&quot;: small team, one person easily handling work. Second panel: larger team, same person overwhelmed managing everything.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Avoid the common data pitfalls that slow down growing biotech companies.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>data-science</category><category>series-a</category><category>data-infrastructure</category><category>startups</category></item><item><title>Your Proof-of-Concept Is Not A Platform</title><link>https://boston-wib.org/blog/quicktake/your-poc-is-not-a-platform</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/your-poc-is-not-a-platform</guid><pubDate>Mon, 20 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-20-your-poc-is-not-a-platform.png&quot; alt=&quot;Proof of Concept evolution timeline: simple cube (Month 1) progressively gains features becoming increasingly complex until transforming into a monster &quot;platform&quot; by Month 12&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The best Proof-of-Concepts teach you what to build next and then get retired.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-engineering</category><category>technical-debt</category><category>software-architecture</category><category>software-engineering</category><category>data-infrastructure</category></item><item><title>The Universal Pattern Behind Scalable Data Systems</title><link>https://boston-wib.org/blog/quicktake/he-universal-pattern-behind-scalable-data-systems</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/he-universal-pattern-behind-scalable-data-systems</guid><pubDate>Thu, 16 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-16-the-universal-pattern-behind-scalable-data.png&quot; alt=&quot;&quot;The Universal Pattern&quot; diagram showing three parallel workflows (Genomics, Clinical Trials, and Generic Industry) each following the same structure: diverse data sources flow through a data pipeline to produce reports/dashboards for domain experts.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

</content:encoded><category>data-engineering</category><category>bioinformatics</category><category>career-transition</category></item><item><title>A Coffee with CompBio: Fail, learn, repeat, the bioinformatics way!</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e9</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e9</guid><pubDate>Tue, 14 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;A coffee with Saranya Canchi&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-NiRp&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=6w55XHmZ1JAq&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-NiRp&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Grab your coffee and join us for another episode of &lt;strong&gt;A Coffee with Comp Bio&lt;/strong&gt;!&lt;/p&gt;
&lt;p&gt;In this episode of A Coffee with Comp Bio, hosts Alex Bartlett and Lorena Pantano sit down with Saranya Canchi, a computational biologist specializing in neuroscience. Together, they explore how to thrive as a self-directed learner in bioinformatics—tackling early challenges, learning through projects, and building problem-solving resilience. Saranya shares her journey as a self-taught bioinformatician, highlighting the importance of mastering the field’s unique language and embracing failure as part of growth. Whether you’re just starting out or looking to strengthen your learning approach, this conversation offers practical insights and inspiration for your bioinformatics journey.&lt;/p&gt;
&lt;p&gt;Saranya&amp;#39;s webpage: &lt;a href=&quot;https://s-canchi.github.io/&quot;&gt;https://s-canchi.github.io/&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastr&lt;/a&gt; for editing and management support.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/a-coffee-with-saranya-canchi-fail-learn/id1817024741?i=1000731855297&amp;uo=4&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/2Sx6bFIOAt3x32o5IvySAu&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>neuroinformatics</category><category>self-directed-learning</category><category>bioinformatics</category><category>problem-solving-skills</category></item><item><title>The Sphere of Inluence in Project Management</title><link>https://boston-wib.org/blog/quicktake/the-sphere-of-influence-in-project-management</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-sphere-of-influence-in-project-management</guid><pubDate>Thu, 09 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-08-sphere-of-influence-in-project-management.png&quot; alt=&quot;Concentric circles diagram showing spheres of control: innermost &quot;Direct Control&quot; (team and decisions), middle &quot;Influence&quot; (stakeholders, dependencies, resources), outermost &quot;Concern&quot; (market and executive decisions)&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;How to focus your energy to actually make a difference&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>project-management</category><category>sphere-of-influence</category><category>leadership</category></item><item><title>Precision Medicine, Through One Lens</title><link>https://boston-wib.org/blog/quicktake/precusion-medicine-through-one-lens</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/precusion-medicine-through-one-lens</guid><pubDate>Wed, 08 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1647650549984-3d3bca4b15f3?q=80&amp;w=880&amp;auto=format&amp;fit=crop&amp;ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&quot; alt=&quot;Image of a camera lens&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Bridging Biology with Clinical Insight&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Precision medicine is more than just a buzzword; it’s a shift in how we understand, treat, and even predict disease. Instead of using a “one-size-fits-all” approach, it uses data from a patient’s genome to their clinical history to design treatments that are tailored to them.&lt;/p&gt;
&lt;p&gt;✨ While precision medicine is a broad field that also involves environmental, lifestyle, and behavioral factors, biological and clinical data play a key role in shaping how we design targeted treatments and predict patient outcomes.&lt;/p&gt;
&lt;p&gt;#PrecisionMedicine is about asking deeper questions:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Why do two patients with the same diagnosis respond differently to the same therapy?&lt;/li&gt;
&lt;li&gt;Which genetic signatures predict how someone will react to a drug?&lt;/li&gt;
&lt;li&gt;How can we use molecular data to catch diseases before they appear in symptoms?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;And the answer lies in #DataScience. By integrating molecular datasets (like gene expression or mutations) with clinical data (like outcomes or lab results), precision medicine connects the “why” from biology with the “what” from patient care.&lt;/p&gt;
&lt;p&gt;This isn’t the future; it’s already reshaping how we diagnose, treat, and prevent disease.&lt;/p&gt;
&lt;p&gt;💡 Want to see how clinical insights and biological data come together in practice?&lt;/p&gt;
&lt;p&gt;👉🏻 I dive deeper into this idea of this integration with a real-world example &lt;a href=&quot;https://medium.com/@riyadua99/precision-medicine-where-biology-meets-clinical-insights-c14b707f1195&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;💭 Beyond biological and clinical data, which datasets do you believe could further advance the precision medicine revolution?&lt;/p&gt;
</content:encoded><category>bioinformatics</category><category>clinical-informatics</category><category>data-science</category><category>genomics</category><category>healthcare-data</category><category>precision-medicine</category><category>computational-biology</category><category>patient-care</category></item><item><title>Multi-Source Data Integration</title><link>https://boston-wib.org/blog/quicktake/multi-source-data-integration</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/multi-source-data-integration</guid><pubDate>Tue, 07 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-06-multi-source-data-integration.png&quot; alt=&quot;Medallion Architecture diagram showing data refinement progression: Bronze layer (raw, messy), Silver layer (clean, standardized), Gold layer (trusted, query-ready)&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;From chaos to clarity: How the medallion architecture transforms messy, multi-source data into trustworthy insights.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-engineering</category><category>tech-leadership</category><category>remote-first</category><category>engineering-management</category></item><item><title>Managing Data Engineering Consultants Across 4 Time Zones: What Actually Worked</title><link>https://boston-wib.org/blog/quicktake/managing-data-engineering-consultants-across-four-timezones</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/managing-data-engineering-consultants-across-four-timezones</guid><pubDate>Thu, 02 Oct 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-10-02-managing-data-engineering-consultants-across-4-time-zones.png&quot; alt=&quot;A timeline graphic showing different time zones around the world with icons representing team members working asynchronously.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Strategies to management without real-time communication&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-engineering</category><category>tech-leadership</category><category>remote-first</category><category>engineering-management</category></item><item><title>The API Strategy Gap in Research</title><link>https://boston-wib.org/blog/quicktake/the-api-strategy-gap-in-research</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-api-strategy-gap-in-research</guid><pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-29-api-strategy-gap-in-research.png&quot; alt=&quot;Before/after diagram showing QC App and Data Warehouse evolving from isolated silos to connected systems via a bridge&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Most biotech companies build applications. Few build APIs.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>software-engineering</category><category>data-engineering</category><category>api-design</category><category>biotech</category><category>data-architecture</category><category>data-integration</category></item><item><title>Change management in small biotech</title><link>https://boston-wib.org/blog/quicktake/change-management-in-small-biotech</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/change-management-in-small-biotech</guid><pubDate>Thu, 25 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-25-change-management-in-small-biotech.png&quot; alt=&quot;A diagram titled &quot;The &apos;Speed Paradox&apos; Timeline&quot; showing a timeline with two lines: &quot;Without Change Management&quot; (frequent crises, high stress) and &quot;With Change Management&quot; (initial slow down, then steady progress).&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The &quot;slow down to speed up&quot; paradox&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>change-management</category><category>data-engineering</category><category>bioinformatics</category><category>scientific-computing</category></item><item><title>A Coffee with CompBio: (Dry) Lab Notebooks</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e8</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e8</guid><pubDate>Tue, 23 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The Importance of Recordkeeping in CompBio with insights from Amulya Shastry and Lina Faller.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-kKTp&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=VLQQwSdNm6Dm&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-kKTp&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Grab your coffee and join us for another episode of &lt;strong&gt;A Coffee with Comp Bio&lt;/strong&gt;!&lt;/p&gt;
&lt;p&gt;This time, Alexandra Bartlett and I kick things off with Amulya Shastry, a PhD student at Boston University and co-chair of Boston Women in Bioinformatics, who introduces us to &lt;strong&gt;llmr -- a new Tidyverse-friendly&lt;/strong&gt; tool for connecting with LLMs like ChatGPT, Gemini, and more.&lt;/p&gt;
&lt;p&gt;Then we sit down with Lina L. Faller, Ph.D., a veteran in bioinformatics with nearly two decades of experience bridging software engineering, research, and pharma. Lina shares &lt;strong&gt;why she started blogging about sustainable data systems, leadership in tech, and the very human side of computational biology&lt;/strong&gt;. We dive into one of her favorite topics: &lt;strong&gt;why computational biologists should keep lab notebooks&lt;/strong&gt; (yes, even if your &amp;quot;lab&amp;quot; is just a laptop). From reproducibility to institutional memory to the art of &amp;quot;&lt;strong&gt;forensic bioinformatics&lt;/strong&gt;,&amp;quot; Lina brings stories and advice that will be useful to anyone working with data.&lt;/p&gt;
&lt;p&gt;If you’ve ever forgotten what you coded six months ago (we’ve all been there), or wondered how AI might fit into documentation and knowledge-sharing, this episode is for you.&lt;/p&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://forms.gle/ncwo6HZeN4uA9gPg7&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://ellmer.tidyverse.org/articles/ellmer.html&quot;&gt;https://ellmer.tidyverse.org/articles/ellmer.html&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://linafaller.com/&quot;&gt;https://linafaller.com/&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastr&lt;/a&gt; for editing and management support.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/dry-lab-notebooks-the-importance-of-recordkeeping/id1817024741?i=1000728035337&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/4QBmN9kfRkEqmKUimqY9nx?si=231bb176b6be456d&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://podcast.ausha.co/a-coffee-with-compbio&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>science-communication</category><category>tidyverse</category><category>sustainable-data-systems</category><category>llms</category><category>ai-in-bioinformatics</category></item><item><title>Force Multiplication Through Simple Solutions</title><link>https://boston-wib.org/blog/quicktake/force-multiplication-through-simple-solutions</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/force-multiplication-through-simple-solutions</guid><pubDate>Mon, 22 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-22-force-multiplication-through-simple-solutions.png&quot; alt=&quot;A victorian-era illustration of a woman on a stage referencing a simple line plot sitting on an easel. In the background, out of the spotlight, there are people working complex machines.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The best tech solutions aren&apos;t rocket science. They&apos;re force multipliers.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-infrastructure</category><category>bioinformatics</category><category>tech-leadership</category><category>scientific-software</category></item><item><title>Every Quick Fix in Research Code is a Future Investment Decision</title><link>https://boston-wib.org/blog/quicktake/every-quick-fix-in-research-code-is-a-future-investment-decision</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/every-quick-fix-in-research-code-is-a-future-investment-decision</guid><pubDate>Thu, 18 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-18-every-quick-fix-in-research-code-is-a-future-investment-decision.png&quot; alt=&quot;Two panels labeled &quot;Which Team Would You Rather Be On?&quot; Left panel shows a simple machine with a relaxed group of people standing beside it. Right panel shows a complex machine tangled in a mess of wires with a stressed group of people trying to fix it.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The challenge isn&apos;t eliminating quick fixes—it&apos;s being intentional about which ones you keep and how you document the journey.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>technical-debt</category><category>software-engineering</category><category>biotech</category><category>knowledge-management</category><category>bioinformatics</category></item><item><title>The Self-Service Data Paradox</title><link>https://boston-wib.org/blog/quicktake/the-self-service-data-paradox</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-self-service-data-paradox</guid><pubDate>Mon, 15 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-15-the-self-service-data-paradox.png&quot; alt=&quot;&quot;The Data Democratization Cascade&quot; diagram showing progression from Centralized Data (few questions, long wait time) to Self-Service Tools (more access, more questions) to Collective Intelligence (unexpected discoveries, organizational learning)&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Why do good tools create more questions than they answer?&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-democratization</category><category>bioinformatics-leadership</category><category>data-strategy</category><category>self-service-analytics</category></item><item><title>Managing Up Isn&apos;t About Office Politics</title><link>https://boston-wib.org/blog/quicktake/managing-up-isnt-about-office-politics</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/managing-up-isnt-about-office-politics</guid><pubDate>Thu, 11 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-11-managing-up-isnt-about-office-politics.png&quot; alt=&quot;A silhouette of two people about to shake hands, with one person standing on a higher platform and the other on a lower platform. The platforms are connected by a bridge.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Understanding what success looks like from your manager’s perspective is key to advancing your career.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>technical-leadership</category><category>managing-up</category><category>career-development</category><category>leadership</category></item><item><title>Press Play on Bioinformatics: Snakemake in Action</title><link>https://boston-wib.org/blog/quicktake/press-play-on-bioinformatics-snakemake-in-action</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/press-play-on-bioinformatics-snakemake-in-action</guid><pubDate>Tue, 09 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1666730140132-e7d83575c599?q=80&amp;w=580&amp;auto=format&amp;fit=crop&amp;ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&quot; alt=&quot;A coiled snake staring directly at the camera&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Ever wish running a bioinformatics pipeline was as easy as pressing “play”? With Snakemake, it can be.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;🐍 Ever wish running a bioinformatics pipeline was as easy as pressing “play”?&lt;/p&gt;
&lt;p&gt;That’s basically what &lt;a href=&quot;https://snakemake.readthedocs.io/en/stable/&quot;&gt;Snakemake&lt;/a&gt; does!&lt;/p&gt;
&lt;p&gt;Think of it like a recipe book for data analysis:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;You list your ingredients (raw data)&lt;/li&gt;
&lt;li&gt;Write down each step (rules and scripts for preprocessing, analysis, plots)&lt;/li&gt;
&lt;li&gt;Hit “go” and it automatically cooks the entire meal for you!&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The magic?&lt;/p&gt;
&lt;p&gt;✅ No more re-running everything when just one step changes&lt;/p&gt;
&lt;p&gt;✅ Works on your laptop or scales up to an HPC cluster&lt;/p&gt;
&lt;p&gt;✅ Makes your analysis reproducible, so six months later (or on someone else’s machine) you get the same results&lt;/p&gt;
&lt;p&gt;To put this into practice, I recently built a Snakemake workflow for cervical cancer gene expression analysis.&lt;/p&gt;
&lt;p&gt;It:&lt;/p&gt;
&lt;p&gt;🔹 Fetches data directly from GEO&lt;/p&gt;
&lt;p&gt;🔹 Runs preprocessing + differential expression analysis&lt;/p&gt;
&lt;p&gt;🔹 Generates a volcano plot for quick visualization&lt;/p&gt;
&lt;p&gt;You basically write a Snakefile describing each step (preprocessing, analysis, visualization), and then run just one command:&lt;/p&gt;
&lt;pre&gt;&lt;code&gt;snakemake --cores 4
&lt;/code&gt;&lt;/pre&gt;
&lt;p&gt;That’s it! Snakemake figures out the order of tasks, runs only what’s needed, and makes sure results are reproducible.&lt;/p&gt;
&lt;p&gt;✨ The BEST part? With the config file updated for your dataset, ANYONE can reproduce the full analysis with just that one command!&lt;/p&gt;
&lt;p&gt;🟢 I’ve shared the pipeline on GitHub here 👉 &lt;a href=&quot;https://lnkd.in/eS6G7W75&quot;&gt;https://lnkd.in/eS6G7W75&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;If you’re curious about Snakemake or just want to peek at a reproducible cancer genomics workflow, check it out!&lt;/p&gt;
</content:encoded><category>bioinformatics</category><category>snakemake</category><category>reproducibility</category><category>data-science</category><category>r-programming</category><category>python</category><category>version-control</category><category>computational-biology</category><category>learn-by-doing</category><category>women-in-stem</category></item><item><title>Team Building Isn&apos;t About Trust Falls</title><link>https://boston-wib.org/blog/quicktake/team-building-isnt-about-trust-falls</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/team-building-isnt-about-trust-falls</guid><pubDate>Mon, 08 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-08-team-building-isnt-about-trust-falls.png&quot; alt=&quot;Small group of people looking at a laptop with thought bubbles showing different motivations: &quot;Learn Python,&quot; &quot;Make an Impact,&quot; &quot;Solve Complex Problems,&quot; and &quot;Data Visualization Mastery&quot;&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The best technical leaders figure out what each team member wants from the project—then find ways to deliver it.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>technical-leadership</category><category>team-building</category><category>career-development</category><category>leadership</category><category>mentorship</category></item><item><title>The Art of Saying No Without Losing Friends</title><link>https://boston-wib.org/blog/quicktake/the-art-of-saying-no-without-losing-friends</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-art-of-saying-no-without-losing-friends</guid><pubDate>Thu, 04 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-04-the-art-of-saying-no-without-losing-friends.png&quot; alt=&quot;Cartoon of a person on a path labeled &quot;Original Scope&quot; with alternate paths branching off with different features. The end of the path has a signpost with &quot;Project Success&quot; written on it.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Scope negotiation isn&apos;t about being rigid—it&apos;s about being intentional.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>technical-leadership</category><category>scope-management</category><category>project-management</category><category>leadership</category><category>career-development</category></item><item><title>A Coffee with CompBio: The Spatial Transcriptomics Toolkit</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e7</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e7</guid><pubDate>Tue, 02 Sep 2025 08:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Memory, Clustering, and Deconvolution&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-U2XY&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=pZqqYHGLPVkY&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-U2XY&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Alex Barlett and Lorena Pantano tackle the computational challenges of spatial transcriptomics. Learn how &lt;strong&gt;𝗕𝗣𝗖𝗲𝗹𝗹𝘀&lt;/strong&gt; can help you work with millions of cells without needing terabytes of RAM, discover how &lt;strong&gt;𝗕𝗮𝗻𝗸𝘀𝘆&amp;#39;𝘀&lt;/strong&gt; neighborhood-aware clustering reveals tissue architecture, and explore &lt;strong&gt;𝗥𝗖𝗧𝗗&amp;#39;𝘀&lt;/strong&gt; approach to cell type deconvolution in spatially-resolved data. Plus, Lorena reviews &lt;strong&gt;𝗣𝗼𝘀𝗶𝘁𝗿𝗼𝗻&lt;/strong&gt;, the new R-friendly IDE that&amp;#39;s catching attention in the bioinformatics community.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/eqFfkzKq&quot;&gt;https://lnkd.in/eqFfkzKq&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/ekrS4H5p&quot;&gt;https://lnkd.in/ekrS4H5p&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/e7r33PKf&quot;&gt;https://lnkd.in/e7r33PKf&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/eiWqVjjK&quot;&gt;https://lnkd.in/eiWqVjjK&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/e7B35A7V&quot;&gt;https://lnkd.in/e7B35A7V&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://lnkd.in/eZdQ-qKV&quot;&gt;https://lnkd.in/eZdQ-qKV&lt;/a&gt; - Sean Davis&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://positron.posit.co/&quot;&gt;https://positron.posit.co/&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Please get in touch if you or your business would like to help support this podcast. Thanks &lt;a href=&quot;https://www.linkedin.com/in/amulya-shastry/&quot;&gt;Amulya Shastr&lt;/a&gt; for editing and management support.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/the-spatial-transcriptomics-toolkit-memory-clustering/id1817024741?i=1000724623758&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/5aTQqIp6Ig0SE27Y5Dzcne?si=ef57ff0893c34321&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>career-transition</category><category>bioinformatics</category><category>product-management</category><category>career-development</category><category>from-lab-to-tech</category><category>spatial-transcriptomics</category></item><item><title>How to Turn Stakeholders from Obstacles into Advocates</title><link>https://boston-wib.org/blog/quicktake/how-to-turn-stakeholders-from-obstacles-into-advocates</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/how-to-turn-stakeholders-from-obstacles-into-advocates</guid><pubDate>Tue, 02 Sep 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-09-02-how-to-turn-stakeholders-from-obstacles-into-advocates.png&quot; alt=&quot;Island network doodle illustrating stakeholder management. Center island &quot;Project Success&quot; connects to four surrounding islands (Problem Solver, Individualist, Collaborator, Results Oriented) via bridges or paths. The four outer islands only connect to the center, not to each other.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Technical skills get you the job. Stakeholder management skills make you effective in the job.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>technical-leadership</category><category>stakeholder-management</category><category>project-management</category><category>career-development</category><category>leadership</category></item><item><title>The Genomics Diversity Crisis</title><link>https://boston-wib.org/blog/deepdive/deiGenomics</link><guid isPermaLink="true">https://boston-wib.org/blog/deepdive/deiGenomics</guid><pubDate>Mon, 01 Sep 2025 14:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1635944599655-500389bddd1e?q=80&amp;w=1170&amp;auto=format&amp;fit=crop&amp;ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&quot; alt=&quot;Image of a diverse group of protesters holding signs advocating for HIV awareness and rights.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;When 86% of genomic data comes from European ancestry, treatments built on this data will inevitably fail marginalized communities.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;As diversity, equity, and inclusion (DEI) initiatives are being dismantled across US institutions, Eric Green, former Director of the National Human Genome Research Institute, opened the Festival of Genomics in Boston to make a case that DEI extends far beyond creating a better workplace culture. In genomics research, DEI is scientifically essential. DEI refers to efforts that ensure diverse representation (diversity) through fair treatment and opportunity (equity) and meaningful participation (inclusion), all grounded in respect for different communities and perspectives. Current genomic datasets overwhelmingly represent people of European ancestry, yet the insights derived from this narrow slice of humanity are being applied to diagnose, treat, and understand disease across all populations. To truly reflect human diversity, science depends on data from all communities. However, that data cannot be collected from populations who distrust the scientific establishment. Genuine commitments to equity and inclusion are the foundation needed to rebuild those critical relationships.&lt;/p&gt;
&lt;h3&gt;The Technical Problem: Sampling Bias&lt;/h3&gt;
&lt;p&gt;Sampling bias is a fundamental challenge at the heart of genomics research. This occurs when data used to build a model fails to adequately represent the study or target population due to the underrepresentation of certain groups. A classic example of sampling bias would be trying to understand human height by collecting data only from NBA players. The resulting model would drastically overestimate how tall humans are. Since sampling bias compromises generalizability, these models often produce misleading outputs. In genomics, sampling bias can lead researchers to overestimate or underestimate impacts of genetic variants or treatments.&lt;/p&gt;
&lt;p&gt;In the field of genomics, the risk of sampling bias takes on particular urgency. In 2021, an estimated 86% of sequenced genomic data came from individuals of European ancestry [^1]. This staggering imbalance means that models trained on this data are systematically misleading and, therefore, unreliable across diverse populations. This is a profound problem because models that fail to generalize to marginalized communities will inevitably exacerbate existing health disparities.&lt;/p&gt;
&lt;p&gt;As bioinformatics enters a new era of artificial intelligence (AI) driven discovery, the composition of our training data has never mattered more. How can we build a future of personalized medicine on a foundation that only represents less than 20% of the world&amp;#39;s population?&lt;/p&gt;
&lt;h3&gt;The Human Problem: A Legacy of Distrust&lt;/h3&gt;
&lt;p&gt;To improve our biological models, we must prioritize diverse dataset collection. However, while the solution seems straightforward, the reality is far more complex. The scientific community&amp;#39;s painful history of human exploitation and data misuse continues to stifle many communities&amp;#39; willingness to participate in research studies.&lt;/p&gt;
&lt;p&gt;These scars run deep. From 1932 to 1972, with support from state and local governments, the US Public Health Service conducted what became known as the Tuskegee Syphilis Study, misleading impoverished Black male sharecroppers to participate in a treatment program for their &amp;quot;bad blood&amp;quot; [^2], [^3]. In reality, the program was a study of the progression of untreated syphilis. Withheld from diagnoses and available treatment, hundreds of participants lost their lives from the disease in the name of scientific advancement.&lt;/p&gt;
&lt;p&gt;Even well-intentioned research can have ethical missteps that deepen distrust. In the 1990s, as the rate of diabetes climbed in the Havasupai tribe, an indigenous community near the Grand Canyon, around 650 members donated blood samples to Arizona State University to study genetic links to diabetes [^4]. Approximately a hundred participants signed a consent form allowing their samples to be used to &amp;quot;study the causes of behavior/medical disorders.&amp;quot; However, with English as a second language for many participants and most having never completed high school, the full implications of this broad consent were at high risk of not being understood. Therefore, when researchers used the tribe&amp;#39;s samples in studies unrelated to diabetes, it was done with disregard to the civil rights of the Havasupai tribe to self-determination. Rightfully, the Havasupai tribe sued the university and has refused to participate in any further studies.&lt;/p&gt;
&lt;p&gt;More recently, the rise of consumer genetic testing, such as those offered by 23andMe, has made it easy to share genetic data. However, these platforms have also raised concerns about data transparency for marginalized communities [^5]. These companies tell consumers that consumers own their personal data. However, for communities who are already wary of government surveillance and over-policing, the knowledge that their genetic information could potentially be sold to pharmaceutical companies to develop medicine that they will not have access to or accessed by law enforcement in ways that could lead to wrongful convictions for them or a family member adds another layer of hesitation to research participation.&lt;/p&gt;
&lt;p&gt;The examples listed only touch the surface of these deeply rooted issues that plague distrust of the scientific community, and I encourage readers to learn more outside the context of this article.&lt;/p&gt;
&lt;h3&gt;A Path Forward: DEI as a Framework for Rebuilding Trust&lt;/h3&gt;
&lt;p&gt;History shows that trust in science, even after profound breaches, can be rebuilt. The atrocities conducted by medical professionals in the Holocaust ruptured the relationship between science and Jewish communities [^6]. Despite this history, today the Ashkenazi Jewish population is among the most studied groups in human genomics. This reconciliation was made possible through decades of ethical reform and intentional efforts to include and support Jewish scientists.&lt;/p&gt;
&lt;p&gt;The Nuremberg Code, a framework established in response to unethical Nazi medical experiments, set up crucial protections for research participants through informed consent and participant autonomy [^7]. However, while these protections support equitable practices, they did not insure participants have meaningful representation and decision-making power in the research itself. For the Jewish community, this gap was filled by inclusive practices like the Rockefeller Foundation&amp;#39;s Refugee Scholar Program, which resettled Jewish scientists and supported their continued involvement in research despite widespread persecution [^8]. This combination of ethical frameworks and institutional inclusion likely contributed significantly to rebuilding trust between Jewish communities and scientific institutions.&lt;/p&gt;
&lt;p&gt;Until recently, we were seeing similar patterns emerging through initiatives of inclusion. Researchers from communities affected by HIV/AIDS has been associated with improved rates of community engagement in research and better treatment outcomes [^9]. Similarly, Indigenous health research programs indicate that studies of their communities benefit from Indigenous leadership by increasing community engagement and fostering resources to support non-Indigenous research team members to develop cultural competency [^10]. These studies ensured that voices of the communities they were doing research on were empowered to shape the research process, from study design to data interpretation.&lt;/p&gt;
&lt;p&gt;It must be emphasized that practicing DEI is not a quick fix. DEI practices must be sustained, proactive commitments to ensure diversity through informed consent and inclusion. As genomics continues to evolve, so must our standards for how research is conducted, whose data is included, and how the data is used. By building trust, we build better science and, in turn, better health outcomes for everyone.&lt;/p&gt;
&lt;p&gt;As we stand at the threshold of AI-driven genomics, we have a choice to make. We can continue building models on a narrow foundation that serves only a fraction of humanity, or we can invest in the trust-building work necessary to create truly inclusive research. The technical quality of our science depends on our decision.&lt;/p&gt;
&lt;h3&gt;References:&lt;/h3&gt;
&lt;p&gt;[^1]: Fatumo, S., Chikowore, T., Choudhury, A., Ayub, M., Martin, A. R., &amp;amp; Kuchenbäcker, K. (2022). Diversity in genomic studies: a roadmap to address the imbalance. Nature medicine, 28(2), 243.&lt;/p&gt;
&lt;p&gt;[^2]: Jones, J. H. (1993). Bad blood: the Tuskegee syphilis experiment. New and expanded ed. New York.&lt;/p&gt;
&lt;p&gt;[^3]: Gray, F. (1998). The Tuskegee syphilis study: An insider’s account. Montgomery, AL: Black Belt.&lt;/p&gt;
&lt;p&gt;[^4]: Sterling, R. L. (2011). Genetic research among the Havasupai: a cautionary tale. AMA Journal of Ethics, 13(2), 113-117.&lt;/p&gt;
&lt;p&gt;[^5]: Raz, A. E., Niemiec, E., Howard, H. C., Sterckx, S., Cockbain, J., &amp;amp; Prainsack, B. (2020). Transparency, consent and trust in the use of customers&amp;#39; data by an online genetic testing company: an exploratory survey among 23andMe users. New Genetics and Society, 39(4), 459-482.&lt;/p&gt;
&lt;p&gt;[^6]: Lagnado, L. M., &amp;amp; Dekel, S. C. (1992). Children of the flames: Dr. Josef Mengele and the untold story of the twins of Auschwitz. Penguin.&lt;/p&gt;
&lt;p&gt;[^7]: Trials of War Criminals before the Nuremberg Military Tribunals under Control Council Law No. 10. (n.d.). Permissible medical experiments. (Vol. 2, pp. 181-182). Washington, D.C.: U.S. Government Printing Office.&lt;/p&gt;
&lt;p&gt;[^8]: Iacobelli, T. (2021). The Rockefeller Foundation’s Refugee Scholar Program.&lt;/p&gt;
&lt;p&gt;[^9]: Karris, M. Y., Dube, K., &amp;amp; Moore, A. A. (2020). What lessons it might teach us? Community engagement in HIV research. Current Opinion in HIV and AIDS, 15(2), 142-149.&lt;/p&gt;
&lt;p&gt;[^10]: Woods, C., Settee, C., Beaucage, M., Robinson-Settee, H., Desjarlais, A., Adams, E., ... &amp;amp; Nahanee, D. (2023). Ensuring Indigenous co-leadership in health research: a Can-SOLVE CKD case example. International Journal for Equity in Health, 22(1), 234.&lt;/p&gt;
</content:encoded><category>diversity-equity-inclusion</category><category>dei-in-science</category><category>inclusive-research</category><category>genetics-research</category><category>equity-in-science</category><category>diversity-in-stem</category><category>genomics-equity</category></item><item><title>The Engineering Report That Never Gets Written</title><link>https://boston-wib.org/blog/quicktake/engineering-report-that-never-gets-written</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/engineering-report-that-never-gets-written</guid><pubDate>Mon, 25 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-25-the-engineering-report-that-never-gets-written.png&quot; alt=&quot;Left panel titled &quot;Knowledge in Heads&quot; showing multiple people with thought bubbles and right panel titled &quot;Knowledge in Reports&quot; showing how one person&apos;s knowledge is documented and shared with others.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Software engineers routinely write project wrap-up reports. Bioinformaticians? Almost never.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>knowledge-management</category><category>project-management</category><category>engineering-culture</category><category>technical-writing</category></item><item><title>The Post-Mortem No One Wants to Do (But Everyone Should)</title><link>https://boston-wib.org/blog/quicktake/the-post-mortem-no-one-wants-to-do-but-everyone-should</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-post-mortem-no-one-wants-to-do-but-everyone-should</guid><pubDate>Thu, 21 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-21-the-post-mortem-no-one-wants-to-do.png&quot; alt=&quot;Cycle diagram with five steps: 1) Failture Occurs, 2) Post-Mortem Investigation, 3) Root Cause Learning, 4) System Involvement, 5) Stronger System. Each step is arranged in a circular flow to illustrate the continuous improvement cycle.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Thirty minutes you spend on a post-mortem can save you thirty hours of future firefighting.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>post-mortem</category><category>process-improvement</category><category>systems-thinking</category><category>technical-leadership</category><category>continuous-improvement</category></item><item><title>The One Question That Changed How I Build Tools</title><link>https://boston-wib.org/blog/quicktake/the-one-question-that-changed-how-i-build-tools</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-one-question-that-changed-how-i-build-tools</guid><pubDate>Mon, 18 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-18-the-one-question-that-changed-how-i-build-tools..png&quot; alt=&quot;flow chart showing different three different success criteria for the same request of an RNA-seq dashboard&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Successful tool development begins with understanding where to draw the finish line.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>data-science</category><category>requirements</category><category>user-experience</category><category>bioinformatics</category><category>product-development</category></item><item><title>A Coffee with CompBio: First hand experience on transitioning to a Product Manager role</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e6</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e6</guid><pubDate>Wed, 13 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Katie Huges shares her professional journey from bioinformatics to product management with help from Lorena and Alex.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-O3Nv&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=PqNNvCJ4ZnWd&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-O3Nv&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Alex Barlett and Lorena Pantano welcome Katie Hughes, their first guest, to discuss her career transition from bioinformatics to product management. Katie shares her journey from studying genetics, working in wet labs, and discovering a passion for bioinformatics, to eventually earning a master&amp;#39;s degree in the field. She details her experience at various biotech companies, including Harvard Medical School, Moderna, Sonata Therapeutics, and Generate Biomedicines. Katie emphasizes the importance of curiosity, adaptability, and soft skills in making career transitions. She explains what a product manager does, differentiates it from similar roles, and outlines the skills and experiences that helped her succeed. The discussion also covers the day-to-day responsibilities of a product manager, the collaborative nature of the role, and advice for those interested in making a similar career shift.&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://www.svpg.com/books/inspired-how-to-create-tech-products-customers-love-2nd-edition/&quot;&gt;Marty Cagan&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;https://youtube.com/@howiaipodcast?si=CYby_n5KrKUKqo2u&quot;&gt;How I AI podcast&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/a-coffee-with-katie-hughes-first-hand-experience/id1817024741?i=1000721700181&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/4m4y1iCk1AaVkBOWjnsJ3l?si=d47a019944024d2e&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>career-transition</category><category>bioinformatics</category><category>product-management</category><category>career-development</category><category>from-lab-to-tech</category></item><item><title>The Hidden Bottleneck in AI-Driven Drug Discovery</title><link>https://boston-wib.org/blog/quicktake/hidden-bottleneck-ai-drug-discovery</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/hidden-bottleneck-ai-drug-discovery</guid><pubDate>Mon, 11 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-11-the-hidden-bottleneck-in-ai-driven-drug-design.png&quot; alt=&quot;Illustration of a bottle-neck funnel showing AI algorithms at one end and drug discovery outcomes at the other, with a narrow bottleneck in between labeled &quot;data pipeline,&quot; indicating that the flow of data is constricted at this point.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The real challenge in AI-drive drug discovery is the broken data infrastructure.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>AI</category><category>drug-discovery</category><category>data-infrastructure</category><category>biotech</category><category>data-science</category><category>lims</category></item><item><title>The Software Engineering Principle No One Teaches in Bioinformatics</title><link>https://boston-wib.org/blog/quicktake/software-engineering-principle-no-one-teaches-in-bioinformatics</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/software-engineering-principle-no-one-teaches-in-bioinformatics</guid><pubDate>Thu, 07 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-07-the-sw-eng-principle-no-one-teaches-in-bioinfo.png&quot; alt=&quot;Separation of Concerns diagram showing a monolithic script on the left that performs at least six functions to analyze RNA-seq data versus modular components on the right that separates these functions into distinct parts. The box below the diagram lists the key benefits of modular code: flexibility, testing, scalability, reusability, collaboration, and maintainability.&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Each part of your code should have one job and do it well.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>data-science</category><category>code-quality</category><category>software-engineering</category><category>computational-biology</category></item><item><title>Why Every Biotech Needs a Data Steward</title><link>https://boston-wib.org/blog/quicktake/why-every-biotech-needs-a-data-steward</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/why-every-biotech-needs-a-data-steward</guid><pubDate>Mon, 04 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-08-04-why-every-biotech-needs-a-data-steward.png&quot; alt=&quot;Diagram showing how one full-time data steward saves multiple scientists hours weekly by preventing common data issues: identifying clean datasets, locating lost data, and avoiding analysis rebuilds&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;What is a data steward? Why is important?&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>data-governance</category><category>data-stewardship</category><category>data-quality</category><category>technical-leadership</category></item><item><title>The Power of Listening Across Teams</title><link>https://boston-wib.org/blog/quicktake/the-power-of-listening-across-teams</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/the-power-of-listening-across-teams</guid><pubDate>Fri, 01 Aug 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-07-31-the-power-of-listening-across-teams.png&quot; alt=&quot;Iceberg diagram titled &quot;Listen for what&apos;s beneath the surface&quot; showing surface complaint &quot;this analysis takes forever&quot; above water, with multiple underlying causes hidden below, illustrating that the same request can have different root causes&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Listening skills in bioinformatics is critical&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>cross-functional</category><category>empathy</category><category>bioinformatics</category><category>teamwork</category><category>technical-leadership</category></item><item><title>A Coffee with CompBio: R You Doing It Right?</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e5</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e5</guid><pubDate>Tue, 29 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Alex and Lorena dig into the tricks and tips that&apos;ll actually make your R code work better.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-lSBx&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=Ljzz7heQJNRj&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-lSBx&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Lorena Pantano and Alexandra Bartlett dig into the tricks and tips that&amp;#39;ll actually make your R code work better. We&amp;#39;re talking about ditching those old habits we all picked up and switching to code that works better in 2025. We cover over 10 solid habits that&amp;#39;ll seriously boost your R game - everything from how you&amp;#39;re reading and storing files, making plots that are publish-ready, theming, data manipulation, and setting up environments so your code works when you come back to it later. If you want to up your R skills, this one&amp;#39;s got practical stuff you can start using right away.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/r-you-doing-it-right-modern-best-practices-in-r/id1817024741?i=1000719682944&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/02XubBH33WzWzEfxUPR49G?si=7622cebba1e842ed&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>science-communication</category><category>podcast</category><category>bioinformatics</category><category>women-in-science</category><category>r-programming</category></item><item><title>Reactive vs Proactive Bioinformatics</title><link>https://boston-wib.org/blog/quicktake/reactive-vs-proactive-bioinformatics</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/reactive-vs-proactive-bioinformatics</guid><pubDate>Mon, 28 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-07-28-reactive-vs-proactive-bioinformatics.png&quot; alt=&quot;Diagram comparing two bioinformatics career modes: Firefighter (reactive, quick responses to urgent questions) versus Architect (proactive, building scalable systems)&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Bioinformatics enlightment occurs when one can regonize it has two main modes: reactive and proactive&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>career-development</category><category>data-science</category><category>biotech-careers</category><category>software-engineering</category></item><item><title>Design Docs for Bioinformatics</title><link>https://boston-wib.org/blog/quicktake/design-docs-for-bioinformatics</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/design-docs-for-bioinformatics</guid><pubDate>Thu, 24 Jul 2025 08:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-07-24-design-docs-for-bioinformatics.png&quot; alt=&quot;&quot;Scope creep&quot; diagram with thumbs down emoji showing nested circles of expanding analysis: &quot;original analysis&quot; grows to &quot;add controls&quot; then &quot;what about this pathway?&quot; with arrows questioning &quot;new reference?&quot;, &quot;add more samples?&quot;, and &quot;one more comparison?&quot;&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Should we be designing bioinformatics projects like a software engineer?&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>project-management</category><category>data-science</category><category>software-engineering</category><category>research-workflow</category></item><item><title>Build vs Buy in Biotech</title><link>https://boston-wib.org/blog/quicktake/buildVsBuy</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/buildVsBuy</guid><pubDate>Mon, 21 Jul 2025 08:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-07-21-real-cost-of-buying.png&quot; alt=&quot;Horizontal bar chart showing cost breakdown of tools, integration, and maintenance&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;The hidden costs of building vs buying in biotech&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>tech-strategy</category><category>build-vs-buy</category><category>data-infrastructure</category><category>tech-leadership</category></item><item><title>Code Review Culture in Research Labs</title><link>https://boston-wib.org/blog/quicktake/code-review-culture-in-research-labs</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/code-review-culture-in-research-labs</guid><pubDate>Fri, 18 Jul 2025 08:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/2025-07-18-volodymyr-dobrovolskyy-KrYbarbAx5s-unsplash.jpg&quot; alt=&quot;Orange cat looking intently at a computer screen displaying code&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Not enough code review and you risk irreproducible science, too much and you kill discovery momentum.&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>code-review</category><category>research-software</category><category>team-culture</category><category>software-development</category></item><item><title>The Bioinformatics Triangle: Memory, Elegance, and Speed</title><link>https://boston-wib.org/blog/quicktake/bioInfoTriangle</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/bioInfoTriangle</guid><pubDate>Wed, 16 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1597589827317-4c6d6e0a90bd?ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&amp;auto=format&amp;fit=crop&amp;q=80&amp;w=880&quot; alt=&quot;The Bioinformatics Triangle: Memory, Elegance, and Speed&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;How do you balance code aesthetics with performance in your bioinformatics workflows?&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>python</category><category>computational-biology</category><category>coding</category><category>data-science</category></item><item><title>The Power of Strategic Data Infrastructure</title><link>https://boston-wib.org/blog/quicktake/dataInfrastructure</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/dataInfrastructure</guid><pubDate>Tue, 15 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1484557052118-f32bd25b45b5?q=80&amp;w=1169&amp;auto=format&amp;fit=crop&amp;ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&quot; alt=&quot;Black and white photo of a data center server rack with ethernet cables&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Great science happens when technical infrastructure meets scientific curiosity&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>biotech</category><category>data-warehouse</category><category>cross-functional</category><category>data-strategy</category><category>scientific-computing</category></item><item><title>Why Computational Biologists Need Lab Notebooks</title><link>https://boston-wib.org/blog/quicktake/computationalLabNotebooks</link><guid isPermaLink="true">https://boston-wib.org/blog/quicktake/computationalLabNotebooks</guid><pubDate>Fri, 11 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://images.unsplash.com/photo-1566355892398-3d144894ef0a?q=80&amp;w=886&amp;auto=format&amp;fit=crop&amp;ixlib=rb-4.1.0&amp;ixid=M3wxMjA3fDB8MHxwaG90by1wYWdlfHx8fGVufDB8fHx8fA%3D%3D&quot; alt=&quot;Writing workspace with laptop keyboard, black notebook with pen, and personal accessories&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Somehow, this fundamental practice gets lost when we move to computational biology&lt;/em&gt;&lt;/p&gt;

</content:encoded><category>bioinformatics</category><category>data-science</category><category>computational-biology</category><category>reproducibility</category><category>best-practices</category></item><item><title>A Coffee with CompBio: Fast, Private, and Publish-Ready Spatial Transcriptomics App (Without Losing Your Mind)</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e4</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e4</guid><pubDate>Thu, 10 Jul 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Alex and Lorena journey through the real-life challenges of building interactive single cell spatial data visualizations for large projects.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-lG8A&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=reGGWuRj47pL&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-lG8A&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;In this episode, we journey through the real-life challenges of building interactive single cell spatial data visualizations for large projects. Lorena shares her recent adventure turning mountains of data into a web app using tools like Python, R, and the (tricky-to-pronounce) single-cell viewer &lt;a href=&quot;https://vitessce.io/&quot;&gt;&lt;em&gt;Vitessce&lt;/em&gt;&lt;/a&gt;. She discusses the hurdles of image cropping, memory limits, Python-R crossovers, and why “just putting it online” isn’t as easy as it sounds—especially when it comes to privacy, deployment, and avoiding surprise cloud bills. If you’ve ever had a collaborator say, &amp;quot;Can you just build me an app I can play with?&amp;quot;, this episode is for you.&lt;/p&gt;
&lt;p&gt;In the &amp;quot;&lt;strong&gt;Quick Sip&lt;/strong&gt;&amp;quot; segment, Alex and Lorena share tips on automating code linting with GitHub Actions. Finally, in our &amp;quot;&lt;strong&gt;Brewing Up Answers&lt;/strong&gt;&amp;quot; segment, we chat about managing people in academia vs. industry, and why it’s a very different ballgame on each side of the fence.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} className=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/fast-private-and-publish-ready-spatial/id1817024741?i=1000716612827&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/3b0KdhZJr4mhnPGixu1Fkm?si=2HeKdAU5T5qWbPgO0Fur4A&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>podcast</category><category>bioinformatics</category><category>women-in-science</category><category>quick-sips</category><category>brewing-up-for-answers</category><category>github-actions</category><category>github</category><category>academia-industry</category><category>cloud-computing</category><category>web-app</category><category>vitessce</category><category>python</category><category>r-programming</category><category>spatial-transcriptomics</category></item><item><title>A Coffee with CompBio: R Markdown</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e3</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e3</guid><pubDate>Thu, 26 Jun 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Alex and Lorena discuss a large bulk RNA-seq project that yielded lasting changes to their group’s everyday bioinformatics practices via the creation of parameterized R Markdown code templates.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-qiIA&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=9G22jH5n0lXZ&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-qiIA&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Lorena Pantano and Alexandra Bartlett discuss a large bulk RNA-seq project that yielded lasting changes to their group’s everyday bioinformatics practices via the creation of parameterized R Markdown code templates. In the &amp;quot;&lt;strong&gt;Quick Sip&lt;/strong&gt;&amp;quot; segment, they discuss &lt;a href=&quot;https://rstudio.github.io/reticulate/&quot;&gt;&lt;em&gt;reticulate&lt;/em&gt;&lt;/a&gt; for managing python environments in an R context, and in &amp;quot;&lt;strong&gt;Brewing Up Answers&lt;/strong&gt;&amp;quot;, they reflect on the differences between industry and academia bioinformatics.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} classname=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/r-markdown-because-rna-seq-code-shouldnt-be-wild-type/id1817024741?i=1000714675040&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/3y3Xp1XUsVbrbNSGiQLAPo?si=ec0cbd69073246ab&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>podcast</category><category>bioinformatics</category><category>women-in-science</category><category>quick-sips</category><category>brewing-up-for-answers</category><category>reticulate</category><category>r-markdown</category><category>rna-sequencing</category><category>python</category><category>r-programming</category></item><item><title>A Coffee with CompBio: The Thousand Dollar Alignment</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e2</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e2</guid><pubDate>Tue, 10 Jun 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;From puzzlingly low mapping rates to unexpected cloud costs caused by unoptimized compute jobs, Lorena and Alex highlight how essential clear communication and bioinformatics-aware experimental design are to any successful project.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-RqFX&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=xAGG0uwpe3d8&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-RqFX&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;Lorena Pantano and Alexandra Bartlett share the twists and turns of realizing their methylation data wasn’t what it seemed. From puzzlingly low mapping rates to unexpected cloud costs caused by unoptimized compute jobs—thankfully caught just in time thanks to cost alarms—they highlight how essential clear communication and bioinformatics-aware experimental design are to any successful project.&lt;/p&gt;
&lt;p&gt;In our new segments, &lt;strong&gt;Quick Sips&lt;/strong&gt; and &lt;strong&gt;Brewing Up for Answers&lt;/strong&gt;, we talk about &lt;a href=&quot;https://pixi.sh/latest/&quot;&gt;PIXI&lt;/a&gt; for managing software environments (Thanks to Edmund Miller) and dig into the ever-present challenge of staying organized across complex projects.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} classname=&quot;text-black dark:text-white&quot; /&gt; &lt;a href=&quot;https://apple.co/4dXieNl&quot;&gt;Apple&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt; &lt;a href=&quot;https://spoti.fi/45Np6ur&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>podcast</category><category>science-communication</category><category>bioinformatics</category><category>women-in-science</category><category>pixi</category><category>reproducible-research</category><category>cloud-computing</category><category>quick-sips</category><category>brewing-up-for-answers</category></item><item><title>A Coffee with CompBio: Nine Samples and Zero Cells</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e1</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e1</guid><pubDate>Tue, 27 May 2025 00:00:01 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Alex and Lorena dive into the messy reality of processing single-cell RNA-seq data.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-AXeu&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=9G22jHrpQ7n4&amp;playlist=false&amp;color=%23751cbf&amp;v=3&amp;playerId=ausha-AXeu&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;In our first episode, Alex and Lorena dive into the messy reality of processing single-cell RNA-seq data. What started as a simple QC project turned into a week-long journey across compute environments, mysterious pipeline errors, and zero-cell outputs. Along the way, we troubleshoot issues with Cell Ranger, uncover strange sequencing artifacts, and reflect on lessons in data handling, pipeline reproducibility, and client communication.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} classname=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}&lt;/p&gt;
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/nine-samples-and-zero-cells-a-week-in-the-life/id1817024741?i=1000710174476&quot;&gt;
    Apple
  &lt;/a&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/4XF2yQ70NXlgFHxlNOKbnb?si=aa9737c9f1c2411e&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Send us your comments, questions, and suggestions using &lt;a href=&quot;https://lnkd.in/eJ-hChm3&quot;&gt;this form&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;We are looking for sponsors! Please get in touch if you or your business would like to help support this podcast.&lt;/p&gt;
&lt;p&gt;Follow &lt;a href=&quot;https://www.linkedin.com/in/lpantano&quot;&gt;Lorena&lt;/a&gt; and &lt;a href=&quot;https://www.linkedin.com/in/alexandra-bartlett-926b32109&quot;&gt;Alex&lt;/a&gt; on LinkedIn!&lt;/p&gt;
&lt;p&gt;If you enjoyed the episode, &lt;a href=&quot;https://lnkd.in/eXQ-HpUV&quot;&gt;please subscribe and leave a review!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>podcast</category><category>science-communication</category><category>bioinformatics</category><category>women-in-science</category><category>single-cell</category><category>rna-sequencing</category><category>troubleshooting</category><category>cell-ranger</category></item><item><title>A Coffee with CompBio: Intro</title><link>https://boston-wib.org/blog/coffeewithcompbio/s1-e0</link><guid isPermaLink="true">https://boston-wib.org/blog/coffeewithcompbio/s1-e0</guid><pubDate>Tue, 27 May 2025 00:00:00 GMT</pubDate><content:encoded>&lt;img src=&quot;https://boston-wib.org//blog_images/coffeeWithCompBio_logo.jpeg&quot; alt=&quot;Coffee with CompBio Podcast Logo: Stylized orange and blue DNA double helix&quot; style=&quot;max-width: 100%; height: auto;&quot; /&gt;

&lt;p&gt;&lt;em&gt;Whether you&apos;re a researcher, student, or just bio-curious, join us for casual, insightful conversations that bridge science and real life.&lt;/em&gt;&lt;/p&gt;

&lt;iframe
  name=&quot;Ausha Podcast Player&quot;
  frameborder=&quot;0&quot;
  loading=&quot;lazy&quot;
  id=&quot;ausha-kP84&quot;
  height=&quot;220&quot;
  style=&quot;border: none; width:100%; height:220px&quot;
  src=&quot;https://player.ausha.co/?podcastId=qxaaQs4Xjwkv&amp;v=3&amp;playerId=ausha-kP84&quot;
&gt;&lt;/iframe&gt;
&lt;script src=&quot;https://player.ausha.co/ausha-player.js&quot;&gt;&lt;/script&gt;

&lt;p&gt;In the introductory episode, Lorena and Alex introduce themselves and share how they got started in computational biology. They talk about their career paths, what drew them to bioinformatics, and some of the challenges and surprises they’ve encountered along the way. They also give a preview of the kinds of topics and practical issues they’ll be covering on the podcast, from workflow basics to troubleshooting analysis hiccups.&lt;/p&gt;
&lt;p&gt;Listen to this podcast on other platforms:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaApple size={32} classname=&quot;text-black dark:text-white&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://podcasts.apple.com/us/podcast/about-us/id1817024741?i=1000710167550&quot;&gt;Apple&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&amp;lt;span style={{ display: &amp;#39;inline-flex&amp;#39;, alignItems: &amp;#39;center&amp;#39;, gap: &amp;#39;0.5em&amp;#39; }}&amp;gt;
  &lt;FaSpotify size={32} color=&quot;#1DB954&quot; /&gt;{&amp;#39; &amp;#39;}
  &lt;a href=&quot;https://open.spotify.com/episode/2zrTh0TGfuoW5flj2bO0hR?si=cXBcLwj3SB2h7ADbQ7MYSQ&quot;&gt;Spotify&lt;/a&gt;&lt;/p&gt;
&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Hosted by Ausha. See &lt;a href=&quot;https://ausha.co/privacy-policy&quot;&gt;ausha.co/privacy-policy&lt;/a&gt; for more information&lt;/p&gt;
</content:encoded><category>coffee-with-compbio</category><category>computational-biology</category><category>science-communication</category><category>podcast</category><category>science-communication</category><category>bioinformatics</category><category>women-in-science</category></item></channel></rss>