Matching Items (2,111)
Filtering by

Clear all filters

149882-Thumbnail Image.png
Description
K-Nearest-Neighbors (KNN) search is a fundamental problem in many application domains such as database and data mining, information retrieval, machine learning, pattern recognition and plagiarism detection. Locality sensitive hash (LSH) is so far the most practical approximate KNN search algorithm for high dimensional data. Algorithms such as Multi-Probe LSH and

K-Nearest-Neighbors (KNN) search is a fundamental problem in many application domains such as database and data mining, information retrieval, machine learning, pattern recognition and plagiarism detection. Locality sensitive hash (LSH) is so far the most practical approximate KNN search algorithm for high dimensional data. Algorithms such as Multi-Probe LSH and LSH-Forest improve upon the basic LSH algorithm by varying hash bucket size dynamically at query time, so these two algorithms can answer different KNN queries adaptively. However, these two algorithms need a data access post-processing step after candidates' collection in order to get the final answer to the KNN query. In this thesis, Multi-Probe LSH with data access post-processing (Multi-Probe LSH with DAPP) algorithm and LSH-Forest with data access post-processing (LSH-Forest with DAPP) algorithm are improved by replacing the costly data access post-processing (DAPP) step with a much faster histogram-based post-processing (HBPP). Two HBPP algorithms: LSH-Forest with HBPP and Multi- Probe LSH with HBPP are presented in this thesis, both of them achieve the three goals for KNN search in large scale high dimensional data set: high search quality, high time efficiency, high space efficiency. None of the previous KNN algorithms can achieve all three goals. More specifically, it is shown that HBPP algorithms can always achieve high search quality (as good as LSH-Forest with DAPP and Multi-Probe LSH with DAPP) with much less time cost (one to several orders of magnitude speedup) and same memory usage. It is also shown that with almost same time cost and memory usage, HBPP algorithms can always achieve better search quality than LSH-Forest with random pick (LSH-Forest with RP) and Multi-Probe LSH with random pick (Multi-Probe LSH with RP). Moreover, to achieve a very high search quality, Multi-Probe with HBPP is always a better choice than LSH-Forest with HBPP, regardless of the distribution, size and dimension number of the data set.
ContributorsYu, Renwei (Author) / Candan, Kasim S (Thesis advisor) / Sapino, Maria L (Committee member) / Chen, Yi (Committee member) / Sundaram, Hari (Committee member) / Arizona State University (Publisher)
Created2011
173937-Thumbnail Image.png
Description

Leonard Hayflick studied the processes by which cells age during the twentieth and twenty-first centuries in the United States. In 1961 at the Wistar Institute in the US, Hayflick researched a phenomenon later called the Hayflick Limit, or the claim that normal human cells can only divide forty to sixty

Leonard Hayflick studied the processes by which cells age during the twentieth and twenty-first centuries in the United States. In 1961 at the Wistar Institute in the US, Hayflick researched a phenomenon later called the Hayflick Limit, or the claim that normal human cells can only divide forty to sixty times before they cannot divide any further. Researchers later found that the cause of the Hayflick Limit is the shortening of telomeres, or portions of DNA at the ends of chromosomes that slowly degrade as cells replicate. Hayflick used his research on normal embryonic cells to develop a vaccine for polio, and from HayflickÕs published directions, scientists developed vaccines for rubella, rabies, adenovirus, measles, chickenpox and shingles.

Created2014-07-20
173939-Thumbnail Image.png
Description

Although best known for his work with the fruit fly, for which he earned a Nobel Prize and the title "The Father of Genetics," Thomas Hunt Morgan's contributions to biology reach far beyond genetics. His research explored questions in embryology, regeneration, evolution, and heredity, using a variety of approaches.

Created2007-09-25
173947-Thumbnail Image.jpg
Created1935