An adaptive reference point approach to efficiently search large chemical databases

IRIS

The ability to rapidly search large repositories of molecules is a crucial task in chemoinformatics. In this work we propose AOR, an approach based on adaptive reference points to improve state of the art performances in querying large repositories of binary fingerprints basing on the Tanimoto distance. We propose a unifying view between the context of reference points and the previously proposed hashing techniques. We also provide a mathematical model to forecast and generalize the results, that is validated by simulating queries over an excerpt of the ChemDB. Clustering techniques are finally introduced to improve the performances. For typical situations the proposed algorithm is shown to resolve queries up to 4 times faster than compared methods. © Springer International Publishing Switzerland 2014.

An adaptive reference point approach to efficiently search large chemical databases

Napolitano F.;Tagliaferri R.;Baldi P.

2014-01-01

Abstract

The ability to rapidly search large repositories of molecules is a crucial task in chemoinformatics. In this work we propose AOR, an approach based on adaptive reference points to improve state of the art performances in querying large repositories of binary fingerprints basing on the Tanimoto distance. We propose a unifying view between the context of reference points and the previously proposed hashing techniques. We also provide a mathematical model to forecast and generalize the results, that is validated by simulating queries over an excerpt of the ChemDB. Clustering techniques are finally introduced to improve the performances. For typical situations the proposed algorithm is shown to resolve queries up to 4 times faster than compared methods. © Springer International Publishing Switzerland 2014.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2014
			
	Codice ISBN
	
				978-3-319-04128-5
978-3-319-04129-2
			
	Parole chiave
	
				Binary vector search
Chemical database
Molecular fingerprits
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12070/53708

Citazioni

ND

2

ND

social impact