CABS · Chinese American Biopharmaceutical Society · San Francisco

Open data science
for the future of biopharma.

DS4CABS is the data-science initiative of the CABS — Chinese American Biopharmaceutical Society. We build and share open-source tools, AI agents, and learning resources for target discovery, clinical trials, regulatory affairs, and pharma analytics, powered by a global community of scientists, engineers, and interns.

100+repositories
82026 intern projects
20+DS4 functional tracks
72+followers
CABS 2026 · Organizers

Program leadership

The volunteers running operations, sponsorship, communications, and mentorship coordination for 2026.

CABS 2026 · Mentors

Our 2026 mentors

Industry and academic experts guiding the cohort 1:1 — weekly check-ins, technical support, and career advice.

CABS 2026 · Interns

The 2026 intern cohort

Students from leading universities shipping open-source work across discovery, trials, markets, and policy.

Now Recruiting · 2026

Join the 2026 Data Science Summer Program

A ~10-week immersive internship applying AI & data science to drug discovery and development. Interns work on real-world projects with structured weekly mentorship and present a final research poster featured on the CABS website. We welcome students, mentors, and program volunteers from across the Pacific Rim biopharma community.

Student Track

Mentee

Bachelor's, Master's, PhD students, and Postdocs worldwide — Bay Area preferred. Stanford, UCSF, UC Berkeley, MIT, and Pacific Rim universities especially encouraged.

  • Real-world AI / data-science project in drug discovery
  • Weekly 1:1 mentorship from industry & academia
  • Workshops, networking, and member-company exposure
  • Final research poster featured on the CABS site
Apply as Mentee →
Mentor Track

Mentor

Experienced professionals from academia, biotech, and pharma with expertise in AI, bioinformatics, or data science. Matched 1:1 with a student for the summer.

  • Matched with one mentee for the program
  • Weekly 30-minute check-ins
  • Guide research direction, technical support, career advice
  • Help scale CABS to 20+ interns in Summer 2026
Register as Mentor →
Leadership Track

Leadership & Support Team

Volunteers shaping program operations — promotion, sponsorship, mentorship coordination, communications, and administration. Help build a meaningful program for the next generation of data-science leaders.

  • Administration & Operations
  • Mentors / Co-Mentors
  • Sponsorship & Outreach
  • Communications & Promotion
  • General Support Team
Express interest →

Program timeline

Summer 2026

  1. Jan 1, 2026Applications open
  2. May 15, 2026Application deadline
  3. Jun 15, 2026Internship begins
  4. Aug 15, 2026Internship ends · final posters

Winter 2026

  1. Sep 1, 2026Applications open
  2. Nov 5, 2026Application deadline
  3. Dec 15, 2026Internship begins
  4. Feb 23, 2027Internship ends · final posters

Questions? internship@cabsweb.org · Sponsorship: fundraising@cabsweb.org · Program lead: shicheng.guo@cabsweb.org

All projects

Filter and search across the entire DS4CABS catalog.

Learn & build with us

Workshops and curricula for biologists, clinicians, and data scientists.

About DS4CABS

DS4CABS is the data-science initiative of the Chinese American Biopharmaceutical Society — a non-profit professional community headquartered in the San Francisco Bay Area.

Our open-source work spans the full biopharma value chain: from target discovery and knockout analysis, to clinical trial intelligence and regulatory analytics, to market access and medical affairs. We publish code, datasets, and educational materials so that the next generation of biopharma scientists can move faster — together.

As a 501(c)(3) non-profit, we rely on community support to keep this work free — please consider making a tax-deductible donation or reaching out at fundraising@cabsweb.org to help fund the next generation of data scientists.

Support · 501(c)(3) Non-Profit

Donate to support the next generation

The Chinese American Biopharmaceutical Society is a registered 501(c)(3) non-profit organization. Its volunteer-run Data Science Committee and Internship Program mentor students and early-career scientists — the younger generation of biopharma — entirely free of charge.

We receive no tuition and charge no fees, yet training interns on real-world projects carries real costs: AI/LLM API tokens, professional software licenses, and cloud compute. Your tax-deductible donation goes directly to covering these tools so every student can learn with the same industry-grade stack used in professional biopharma teams.

AI & API credits

AI / LLM API tokens

The single biggest cost — credits that power every intern's agent, RAG, and analysis work.

  • Anthropic Claude, OpenAI & Google Gemini API tokens
  • Embeddings & vector database usage (Pinecone, Weaviate)
  • AI coding assistants — GitHub Copilot, Cursor
  • Hugging Face Pro for models & datasets
Collaboration & PM

Team & project software

The everyday tools that let a distributed cohort plan, communicate, and document like a real team.

  • Slack — team communication & mentor channels
  • Linear & Jira — issue tracking & sprint planning
  • Confluence & Notion — docs & knowledge base
  • Zoom — mentor 1:1s, workshops & demos
Cloud, compute & dev

Compute & developer tools

The infrastructure interns need to train models, run pipelines, and ship open-source code.

  • GPU & cloud credits — AWS, GCP, Azure
  • Google Colab Pro & Weights & Biases
  • GitHub Team — CI/CD, Actions minutes, private repos
  • Figma, domains & data storage / hosting

Every gift is tax-deductible. Questions about giving or sponsorship? fundraising@cabsweb.org