Tagged: Open Source Drug Discovery Toggle Comment Threads | Keyboard Shortcuts

  • mattoddchem 1:20 pm on February 23, 2015 Permalink | Reply
    Tags: data exclusivity, economics, Open Source Drug Discovery, , patents, TRIPS   

    The Economics of Open Source Pharma – What about data exclusivity? 

    This post is about something called data exclusivity. I’m asking whether data exclusivity might be a way to ensure the existence of a financial incentive for open source drug discovery and development.

    I am actually asking, since I’m not clear on the language of the relevant law.

    I was talking with some people at the SGC in Oxford recently about the possibility of some (any) kind of assurance to investors in open source drug discovery and development (e.g. the state) that they might make their money back when the medicine hits the market. I had been mulling over this idea of a retrospective patent (RP). I was thinking afresh about data exclusivity, which, unlike the RP, is already enshrined in law and which might achieve something similar. Instinctively I’m against exclusivity, but there’s something interesting here.

    Data exclusivity is a period following a clinical trial where the funder of the trial (i.e. the people bringing the drug to market) have several years grace to market the medicine: to sell the medicine, you have to have generated the clinical trial data. This exclusivity is nothing to do with a patent – so for example generic versions of a medicine can be produced following the expiry of the period of exclusivity, unless a patent prevents that. Here’s a useful PDF background document. Other miscellaneous background can be seen here, here, a PDF from WIPO, and an argument that data exclusivity is a better incentive for invention than patents.

    So: is the 5-10 year exclusivity on the use of clinical trial data a means of protection for an open project? Imagine we run an open source drug discovery/development project, and we run a clinical trial as part of that. We disclose all the data as soon as possible. Would we have some period of exclusivity to act on the data? Would that permit a fully open project to set the drug’s price at a level that covers costs? i.e. can one think of the exclusivity as a positive enabler of an open project, rather than as a necessary evil?

    This hinges on whether the data can be public and still not acted upon by anyone else. In other words if you wanted to register your own version of the drug, you’d need to pay for your own trial. You’d have to pretend the original data aren’t there.

    You could of course go ahead and pay for your own trial anyway – after all you’d have some confidence that the trial would work out OK.  You could then release a competing medicine. But I can’t imagine any shareholders thinking it would be a good idea to go up against an open consortium, committed to low prices and with a multi-year headstart. (Unless your country’s laws mandate you do this – which is the case in India – in which case the open data would undoubtedly make the trial a lot simpler and cheaper.)

    So my conception here of “exclusivity” equates with a “manufacturing license” rather than “secrecy”. The ability to recoup costs might mean there is no need for patent protection and the associated secrecy.

    What does the law say? The sticky part is whether the “exclusivity” means that nobody else can see the data or whether nobody else can use the data. The TRIPS agreement on this seems unclear. Article 39.3 of TRIPS says data should be “disclosed” if the disclosure is accompanied by steps to ensure that the data are protected against “unfair commercial use.” It seems to me that this caveat means that the data could be disclosed openly, and that you’d need regulatory vigilance to make sure nobody uses the data to undercut the exclusivity granted to the open project. That shouldn’t honestly be that hard.


    1) Is this stupid, or has someone made this point before?
    2) Is anyone an expert on the letter of the law here? Can the data be seen but not used?

  • mattoddchem 10:31 pm on October 3, 2014 Permalink | Reply
    Tags: , Crowdsourcing, Education, , MMV, , Open Source Drug Discovery, Open Source Malaria, ,   

    Crowdsourcing Drug Discovery 

    Open Source Malaria has completed an experiment in crowdsourcing for open drug discovery.

    Identifying and developing medicines is a labour-intensive process, particularly in the discovery and optimization phases, and most particularly in the physical preparation of samples of new molecules for testing: a phase that consumes large amounts of time and money and is often a roadblock. One of the obvious things to do is to crowdsource the synthetic chemistry using students.

    In open source projects there is the tantalizing additional possibility that student teams can assemble to work on those problems because what’s needed can be fully described in the open. The openness means that the teams could learn from each other, share data and receive peer review and mentorship from interested experts based elsewhere, such as in the pharmaceutical industry.

    There have obviously been examples of crowdsourcing in various scientific arenas such as genomics (e.g., 1, 2) and there have been closed groups of students operating in the area of drug synthesis. There are examples of crowdsourcing initiatives in attempts to identify biologically active natural products (1, 2), Joerg Bentzien ran a Kaggle competition in in silico small molecule modeling, Urmi Bajpai is working with students in her lab on some biochemical projects (see also this earlier story) and there are other preparative activities of many kinds dating back to things as diverse as the AIDS quilts. I wasn’t aware of any students participating in crowdsourced synthetic chemistry as part of a project that was open source, though Patrick Thomson provided a spectacular example of how openness can lead to high quality scientific contributions from individuals. Mass recruitment of synthetic expertise is going to be one cornerstone of any scaled-up vision for Open Source Pharma, and I was very keen to see if we could complete this exercise in OSM as a precedent, and maybe learn how to make sure it works most effectively.

    We just completed this precedent. Around 50 students from Lawrence University in the US midwest worked on the synthesis of six new analogs in the “Near Neighbour” (NN) branch of Series 1 of OSM. The compounds were mailed to a different lab in the US for biological evaluation. Active compounds were discovered – one of them new to the project and quite potent. I find the closure of this loop tremendously exciting.

    Data for the Lawrence University Compounds - check out the red value for OSM-A-3!

    Data for the Lawrence University Compounds – check out the red value for OSM-A-3!

    Here’s how it worked. I was contacted a few days before the end of 2012 by two US-based academics asking whether their undergraduate cohorts could contribute to OSM. This had been an aim of the project way back in the design phase and not something we’d gotten round to. For one of the academics, Stefan Debbert, Murray Robertson and I wrote up a description of a synthetic route we’d been using for the NN compounds. The chemistry is not trivial, but possesses the advantage that the compounds are typically solids that can be easily purified and are stable under ambient conditions. Stefan looked into this and came up with some analog structures he felt his students would be able to access, and without any further input from us he got this working in his lab class in early 2013. Around June/July he wrote to say his students had finished the syntheses, and in November he uploaded all the data to the online lab book. After some quality control checking of the data by others, the compounds were shipped to Kip Guy’s lab at St Jude’s where Julie Clark performed the assessment of potency and the data were put up online last month, thereby closing the loop on this part of the project. This whole process took longer than it should have since we were all feeling our way a little on this one.

    The data have come in just in time for the writing of the first OSM paper, which is at an advanced stage. We are now faced with the interesting challenge of how to credit this student cohort with authorship. What a great problem to have.

    A Student Hard at Work in the Debbert Undergrad Lab

    A Student Hard at Work in the Debbert Undergrad Lab

    The take-home: a cohort of undergraduates successfully made new molecules that are potent in killing the malaria parasite. The data are valuable as part of a larger project of current research that will be published in the peer-reviewed literature.

     The students were able to make an impact because:

    1) A current/future research need was openly described

    2) The rules behind the project clearly stated that anyone could participate

    3) The student team was locally and carefully mentored by a dedicated individual (Stefan) willing to engage in an unusual activity

    4) The details of what was needed were described in full, i.e. previous, related research that made it clear the activity would not be open-ended

    5) The outputs were actively used, i.e. data deposited in the relevant lab notebook for the benefit of the wider project and molecules tested by a laboratory willing to do so.


    This is scaleable. Any team of undergraduates can engage in real research in this way, and perform scientific activities that are valuable to a “live” project and which might result in the undergraduates being able to publish their work. The work could be out-of-hours or, more excitingly (and as in Stefan’s case) part of a formal lab class for which the students receive credit – more on this below. I don’t know about you, but if I’d had the chance to make real molecules for a real research project as an undergrad, I would have found that quite motivating.

    One of the neatest aspects about doing this openly is the quality control. A cynical onlooker might say “Well, these are just a bunch of undergrads – how can we trust the work?” You don’t have to. In an open source project, you have access to the raw data, so you can check the quality for yourself. Many people don’t trust data in synthetic chemistry journals in any case (trust is a problem across the discipline) but raw data in open projects solves this problem in as much as it can be solved.

    So what would I change next time?

    1) Fully Independent Contribution. The collaboration was set up because Stefan contacted me to ask if he could help out. That’s fine, but in an ideal world it would be clear to a lab director what was required without asking – that’s the aim of the Github To Do List that has since been introduced to OSM. But even there it’s sometimes not obvious what’s required and I think we can use Github more effectively. As much as possible in an open project one needs to promote independent contributions and an interlinked mentorship structure (the “Linus doesn’t scale” problem). Having said that, after the initial communications, Stefan ran the whole class independently, which is a testament to his achievement.

    The OSM Consortium has a public to do list

    The OSM Consortium has a public to do list

    2) Direct Data Deposition. The students did not deposit data into the lab notebook directly – Stefan himself aggregated the work and deposited the data for each compound himself. I think this arose because using the lab notebook presented a barrier to participation that was just slightly too large, and there was a concern that students might copy each other. I want to make sure the lab notebook is so easy to use that next time the students are happy to put the data up themselves as they are working – which is the standard practice in OSM. This means making sure that everyone understands that the data they are reading is like any data in any lab book and just because it’s on the internet doesn’t mean it’s correct or final.

    One of Stefan's ELN Entries

    One of Stefan’s ELN Entries

    3) Global Lab Buddies. Ideally we would have two teams in different places communicating with each other on a synthetic route and not via the central project hub. I have in mind something that ought to be of interest to funding agencies like Fogarty, USAID or Wellcome – a lab class in a developed country pursues a synthetic route and a lab class in a developing nation pursues the same or related experiments, with direct collaboration between students and with mentorship in both places. This has the potential to be an inexpensive way of creating a “lab buddy” scheme where students can share what they learn with each other directly (with only light guidance from a PI, so that the collaboration scales well) and there is no need to spend money moving people around or organizing bench fees; effective lab-based crowdsourcing is significant in terms of how we think about organizing “networks” between universities (e.g. the APRU or WUN) – you can do a lot more for a lot less money if you don’t have to buy airplane tickets.

    The experience makes me wonder about best practice:

    1) Mentorship. How many mentors are needed for a class? There needs to be a committed local champion like Stefan, and the class size can be left up to the lab director, who also manages local health/safety requirements. There needs to be a central mentor (me, in this case) to help with any snafus. I think there is real potential here for additional mentorship from pharma professionals who could ensure that the molecules being made are reasonable for a drug discovery/development project. Such mentorship could be provided directly pro bono, or organized through a PDP (e.g. MMV, DNDi) or learned organization (e.g. the RSC). Pharma experts in a given location could contribute to their community in this way – e.g. Merck could mentor students working in schools in New Jersey.

    2) Replication. It makes sense to me for two students to run each experiment, as a form of replication check. This is real research, so there should be controls, particularly if the students are not very experienced. It’s also probably a good idea for the students to make known compounds as part of the synthetic cluster. The need for positive controls of this kind is particularly keen in open projects since biological evaluation may well be performed in different labs where protocols will inevitably differ. A modest degree of replication makes everyone happier with the numbers ultimately obtained.

    3) End-to-End. The work the students do needs to be incorporated into the larger project (at least on a wiki or in a paper) which may require that the molecules are evaluated in some way. If this is needed, then it’s crucial that it happens, so that student effort is not wasted or merely archived. Never ask a crowd to do anything that is not then used. This needs to be factored in to the planning of project resources and means that before the synthesis starts, there needs to be a commitment from somewhere that the molecules will be taken on (in this case, to kill a nasty).

    4) How to Assess – or The Brown Oil Problem. I think the greatest power of crowdsourcing lab work lies in incorporating this into the undergraduate curriculum. Imagine that you’re planning a lab class, and you check online to find 38 different current project needs in open malaria, TB and Ebola projects. Let’s say the Open Source Pharma vision is made real and there is a repository of such projects that are known to be active that very day. This is not very far-fetched. You would, I think, be very keen to use one of these real research needs in your class, to fire up your students about how cool research is. If the syntheses were well-designed you could get started as soon as the starting materials were in. But the complication is: How to Assess the Project? What happens if a student fails to prepare a molecule that ought, in theory, to be preparable? How do we assess a project outcome if the outcome is novel, as part of a research project? I have no good answer to this, but feel that there are steps that can be taken:

    i) Clarity of Design. The designer of the synthetic route to be undertaken must provide as much detail as possible about the ease of the synthesis. This honesty is simple in open projects – in the NN synthesis undertaken by Stefan it was possible for him to see all of the mis-steps and failures that the OSM team had wrestled with previously, and it was therefore possible for us to provide a lot of advice about key steps. Students themselves can even cite previous attempts as mitigating circumstances (“Though the molecule could not be prepared it was noted that Mat Todd similarly failed to generate this key intermediate in experiment 34-6 (13th Oct 2012)…” etc)

    ii) Emphasis on Approach. The marking scheme used by the lab director needs to focus more on the approach used by the student and the quality of the scientific record produced than on the final outcome. This is in any case good academic practice. There remains an issue of plagiarism if students are posting items into the public domain, but one has to wonder about the value of having the students write reports that are vulnerable to this in the modern age.

    iii) Time-resistance. The marking scheme needs to be immune to later discoveries. If it is ultimately found that a particular compound is unstable, a student who has earlier made this compound and found the same phenomenon cannot have their efforts re-marked, just because this makes the assessment too conditional and complex.


    OSM is supported financially and scientifically by the Medicines for Malaria Venture and the Australian Government

    If people want to get involved in this kind of activity, and it’s not clear from the OSM To Do List or the wiki what is needed, then don’t hesitate to get in touch directly. It’d be great to generate some more synthetic schemes suitable for students. If you are planning a research project yourselves, and are writing a proposal, consider including some element of crowdsourcing+open source so we can have lots more students contributing to real research in the public domain.

    • mattoddchem 10:31 pm on June 11, 2015 Permalink | Reply

      Example of the use of undergraduates in research: long standing program at California State: http://scholarworks.csun.edu/handle/10211.3/125029 though clearly a difference here is that the research is open access (i.e. to read) rather than necessarily open source (open ELN, visible as it happens).

  • mattoddchem 8:34 am on October 2, 2014 Permalink | Reply
    Tags: biomedical R&D, open source, Open Source Drug Discovery, , private sector, The High Line   

    The High Line and the Public Good 

    The High Line is perhaps the most beautiful urban object I know. A former railroad track snaking through Manhattan, it has been repurposed as a garden and architectural haven, above the streets and passing within feet of the surrounding buildings, which are revealed, close up, like canyons to a fighter pilot.

    High Line in NYC

    The High Line in New York City

    The line started out as private enterprise. I wonder if the development was originally publicly subsidized. It any event the line failed, since trucking made it industrially obsolete. You can still pick out the original ironwork in amongst the grasses. More recently the line was saved by an initiative based on community action, seeded with public money, followed by private money. The funding that has made the High Line incredible is shown by plaques near the middle of the trail to be this mix of public and private influence. I’m in awe of those who had the original vision to see what the line would become.

    Public Leadership for the High Line

    Public Leadership for the High Line

    Private money for the High Line Development

    Private and Corporate Support for the High Line

    I first visited the High Line four years ago, then again a few weeks back. What a difference those years have made. So much new private industry has emerged along the sides of the tracks in the streets below – cafes, bars, shops where before there was little. Another example, should we need another, that public works can act as a seed for rapid private investment.

    The High Line is also free. It’s a marvel. I read that London is planning a garden bridge. To be great places, cities need inspirational works. We will rarely achieve that from public or private sector alone, but instead a combination without a predictable recipe.

    High Line - New Section

    Start of the Newer Section of the High Line

    I couldn’t help but extrapolate to pharma.

    Medicines sustain people – enabling them to do great things. A society profits enormously if its population is healthy. Effective medicines are a public good, produced by public and private enterprise. Yet we’re not currently producing the medicines we need, and we need to ask why. Perhaps we have become overly reliant on the private sector (which currently funds about 2/3 of all biomedical R&D). Probably no single system will work in producing what we need: every disease is different. The private sector needs the public sector to do what it can’t, and to do that it needs more R&D funds.

    But the public sector can do something genuinely radical that it’s not currently doing and which the private sector cannot – embrace an open source approach where all data and ideas are freely shared, anyone may participate and there are no patents. This is the idea behind Open Source Pharma. Such an open, meritocratic arena achieves something quite astonishing: it promotes competition and collaboration at the same time.

    Scaffolding at the End of the Line

    Scaffolding at the End of the Line

    The effect of going open source in the development of new therapeutics might simply be to de-risk some avenues of enquiry – to take discovery far enough that the private sector can accommodate the risk. The effect might be to enable genuinely new medicines to be discovered, developed and marketed completely from scratch. And the effect might be to provide a shocking and competing business model, if it’s done right. My money is on all three of these.

    The High Line opens up new spaces

    The High Line Opens New Spaces

    But let’s not be lazy and assume that public R&D needs to operate just like private R&D. It would be a serious mistake to spend public money like we’re spending private, pretending that we need to use the same research model. If we’re funding it differently, we can do it differently. So why not play to the strengths of what entrepreneurial public financing can do – and open it up.

  • mattoddchem 12:24 pm on July 15, 2013 Permalink | Reply
    Tags: , , , Open Source Drug Discovery, Open Source Research   

    An Example of Open Source Drug Discovery 

    “Open Source Drug Discovery? How does that work?” I am asked this quite a lot. There are some principles and core practices that are involved, embodied in the Six Laws, but those are quite hifalutin. Let me give a practical example.

    The main point is that everything is open, so anyone can come along, make molecules, do biology, do informatics, contribute data to the lab notebooks, coordinate the research and generally contribute in any number of ways for which there are simple technological solutions. All sorts of inputs are possible, including experimental ones.

    Earlier this year there was a need to make a bunch of molecules as part of the first medicinal chemistry campaign in the open source malaria project. The hit molecule that was the starting point had an ester in it, and initial results had shown that the ester was a problem. There were several ester replacements that needed to be synthesized. Many of these had been made. A few had either not been tackled or were proving difficult to make. Alice Williamson made up a “Wanted” poster to highlight that anyone could have a go at making them (see below, and in these nice images of the process of collaboration that Alice also made).



    Because these difficulties were being tackled in real time, other scientists could read about them and suggest solutions. Patrick Thomson, an Edinburgh PhD student, was reading this and decided to join in. He got going in the lab on the sulfonamide sE, started posting his data to the same lab notebook the Sydney team were using (to make it easier to compare and share data) and started his own blog to help marshall and summarize results. He corresponded with a student in my lab in Sydney to overcome synthetic hurdles. Several of us worked together to ensure quality control of the data – simple in the digital age.


    And Patrick nailed it. He made the compound. He then mailed it off for biological evaluation. Not to Sydney, which would be a waste of good funds for freight charges, but to a local lab at The University of Dundee. With the appropriate controls and sharing of all data, testing of compounds in different labs can be trusted to deliver a consistent picture.


    In the event, Patrick’s molecules (he tested 3) were inactive. This is a shame, but at the same time the ester/sulfonamide replacement is an obvious one to make and the data enhance the paper where these results will go, on which Patrick will be an author.

    Patrick showed great initiative here, and philosophical agility – he got involved with something unfamiliar where he had not met the people he’d be working with. Maybe it’s because he agreed with the principles of openness, or maybe it was because he saw that all project data were being shared so trust wasn’t an issue – he could see the project involved quality science. Either way, Patrick has provided an excellent example of how open source experimental lab-based research can work on a practical level.

    Which makes one think of all sorts of added benefits of working in a genuinely open manner. One is that open source research provides a low-tech infrastructure for researchers from developing (endemic) countries to get involved in collaborative projects with their peers in the developed world. There are no expensive online tools, no subscription fees for software. Researchers carry out their work in local labs, and share all the data online using existing technology solutions. Crucially the mechanism scales because researchers interact directly, rather than having to pass everything by a bottleneck professor. There is no monolithic “aid” program needed, just the means for people to work together effectively and efficiently. The involvement of endemic labs remains to my mind a top priority target for open research – the barrier is identifying the labs with the right level of chemical facilities (if anyone has any ideas…).

    Today the Open Source Malaria landing page is launched – the project used to be called Open Source Drug Discovery for Malaria, but the name was changed partly to highlight more the real essence of the project – open source. Much of the research is about basic synthetic chemistry, or biology and yes it’s aimed at finding new medicines for malaria, but at the same time the power of the approach lies in how it changes the way we do basic science. The new website highlights the most productive tools used thus far – the lab notebooks, the Github To Do list, the Twitter account, the wiki.




    With these simple tools, Patrick was able to make an important, material contribution to an experimental research project without ever leaving his Wimbledon-winning home country. Anyone else can do the same if the inviolable principles of open source are agreed upon – No Need to Ask.


Compose new post
Next post/Next comment
Previous post/Previous comment
Show/Hide comments
Go to top
Go to login
Show/Hide help
shift + esc