
Publication: Open PHACTS computational protocols for in silico target validation of cellular phenotypic screens: knowing the knowns
Phenotypic screening is in a renaissance phase and is expected by many academic and industry leaders to accelerate the discovery of new drugs for new biology. Given that phenotypic screening is per definition target agnostic, the emphasis of in silico and in vitro follow-up work is on the exploration of…
Publications referencing Open PHACTS
Read on for a list of articles that refer to Open PHACTS and our work:
Publication: Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources
The diversity of online resources storing biological data in different formats provides a challenge for bioinformaticians to integrate and analyse their biological data. The semantic web provides a standard to facilitate knowledge integration using statements built as triples describing a relation between two objects. WikiPathways, an online collaborative pathway resource,…
Publication: Medicinal chemistry in the era of big data
In the era of big data medicinal chemists are exposed to an enormous amount of bioactivity data. Numerous public data sources allow for querying across medium to large data sets mostly compiled from literature. However, the data available are still quite incomplete and of mixed quality. This mini review will…
Publication: The Chemical Validation and Standardization Platform (CVSP): large-scale automated validation of chemical structure datasets
There are presently hundreds of online databases hosting millions of chemical compounds and associated data. As a result of the number of cheminformatics software tools that can be used to produce the data, subtle differences between the various cheminformatics platforms, as well as the naivety of the software users, there…
Publication: Publishing DisGeNET as Nanopublications
The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for discovery in Life Sciences. Although the scientific community is limited an inability to manually curate facts from published papers, recent approaches enable the automatic, scalable and reliable extraction of assertions from the scientific literature. While…
Publication: Extraction of relations between genes and diseases from text and large-scale data analysis: implications for translational research
Background Current biomedical research needs to leverage and exploit the large amount of information reported in scientific publications. Automated text mining approaches, in particular those aimed at finding relationships between entities, are key for identification of actionable knowledge from free text repositories. We present the BeFree system aimed at identifying…
Publication: Using the BioAssy Ontology for Analyzing High-Throughput Screening Data
High-throughput screening (HTS) is the main starting point for hit identification in drug discovery programs. This has led to a rapid increase of available screening data both within pharmaceutical companies and the public domain. We have used the BioAssay Ontology (BAO) 2.0 for assay annotation within AstraZeneca to enable comparison…
Publication: Drug Discovery FAQs: Workflows for answering cross concept drug discovery questions
Modern data-driven drug discovery requires integrated resources to support decision-making and enable new discoveries. The Open PHACTS Discovery Platform (http://dev.openphacts.org) was built to address this requirement by focusing on drug discovery questions that are of high priority to the pharmaceutical industry. Although complex, most of these frequently asked questions (FAQs)…
Publication: On the formulation of performant SPARQL queries
The combination of the flexibility of RDF and the expressiveness of SPARQL provides a powerful mechanism to model, integrate and query data. However, these properties also mean that it is nontrivial to write performant SPARQL queries. Indeed, it is quite easy to create queries that tax even the most optimised…
Publication: Scientific Lenses to Support Multiple Views over Linked Chemistry Data
When are two entries about a small molecule in different datasets the same? If they have the same drug name, chemical structure, or some other criteria? The choice depends upon the application to which the data will be put. However, existing Linked Data approaches provide a single global view over…
Publication: A Knowledge-Driven Approach to Extract Disease-Related Biomarkers from the Literature
The biomedical literature represents a rich source of biomarker information. However, both the size of literature databases and their lack of standardization hamper the automatic exploitation of the information contained in these resources. Text mining approaches have proven to be useful for the exploitation of information contained in the scientific…
Publication: Transporter taxonomy – a comparison of different transport protein classification schemes
Currently, there are more than 800 well characterized human membrane transport proteins (including channels and transporters) and there are estimates that about 10% (approx. 2000) of all human genes are related to transport. Membrane transport proteins are of interest as potential drug targets, for drug delivery, and as a cause…
Publication: Toxins in transit
The Pharmacoinformatics Research Group seeks to further understanding of transporter proteins and their interactions with drugs, with a particular focus on multidrug resistance in cancer. The development of the eTOX and Open PHACTS databases should encourage greater integration of pharmacoinformatics datasets so that more efficient in silico models can be…
Publication: Applying Linked Data Approaches to Pharmacology: Architectural Decisions and Implementation
The discovery of new medicines requires pharmacologists to interact with a number of information sources ranging from tabular data to scientific papers, and other specialized formats. In this application report, we describe a linked data platform for integrating multiple pharmacology datasets that form the basis for several drug discovery applications….
Publication: Nanopublication Guidelines
This document describes the structure of nanopublications and offers guidelines in their composition, implementation and use. It was produced by members of the Concept Web Alliance (CWA), an open collaborative community that is actively addressing the challenges associated with the production, management, interoperability and analysis of unprecedented volumes of data….
Publication: Computing Identity Co-Reference Across Drug Discovery Datasets
This paper presents the rules used within the Open PHACTS (http://www.openphacts.org) Identity Management Service to compute co-reference chains across multiple datasets. The web of (linked) data has encouraged a proliferation of identifiers for the concepts captured in datasets; with each dataset using their own identifier. A key data integration challenge…
Publication: Nanopublications for exposing experimental data in the life-sciences: a Huntingtion’s Disease case study
Data from high throughput experiments often produce far more results than can ever appear in the main text or tables of a single research article. In these cases, the majority of new associations is often archived either as supplemental information in an arbitrary format or in publisher-independent databases that can…
Publication: Open PHACTS Explorer: Bringing the web to the semantic web
The Open PHACTS Explorer is a web application that supports drug discovery via the Open PHACTS API without requiring knowledge of SPARQL or the RDF data being searched. It provides a UI layer on top of the Open PHACTS linked data cache and also provides a javascript library to facilitate…
Publication: Pav ontology: provenance, authoring and versioning
We present the Provenance, Authoring and Versioning ontology (PAV): a lightweight ontology for capturing “just enough” descriptions essential for tracking the provenance, authoring and versioning of web resources. We argue that such descriptions are essential for digital scientific content. PAV distinguishes between contributors, authors and curators of content and creators…