![151750-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/151750-Thumbnail%20Image.png?versionId=PJgeV_ob7v3aqUhP6uirki0BT_YXX5Yd&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240606/us-west-2/s3/aws4_request&X-Amz-Date=20240606T084404Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=1cc506ecfaf07d54371582bc37685bea5e2efd8932173c28efed19fa925a174e&itok=SHUpCIKx)
![151177-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/151177-Thumbnail%20Image.png?versionId=AfGfBw3tXq.h1Q0qufm8bo4Uyp1xM5YC&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240614/us-west-2/s3/aws4_request&X-Amz-Date=20240614T143714Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=61a9f749ce5288125b8ad57a17deeff80f07f186f6d8f56ee49ab1734086c559&itok=8vewKxP5)
![135568-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-07/135568-Thumbnail%20Image.png?versionId=1UdHoJiGjYVKP3qAVRjzo7PhfEXBeW5n&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240605/us-west-2/s3/aws4_request&X-Amz-Date=20240605T171117Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=1b5180b3385392d90c899891884ddc3f32430ad7b980288d5d1ac8770e2729b8&itok=X-hJft_D)
![136967-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-05/136967-Thumbnail%20Image.png?versionId=1_3vcCUyA.sp35kRhMuLMSXavWSohmmx&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T155017Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=fc630656cad90931c66825b2032ec2ca70be014c4e4d1290e09e3a655acc625b&itok=dMh_-qUp)
![136360-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-05/136360-Thumbnail%20Image.png?versionId=RZ2.eKJsxBzobZfB7zg_qjNNmAIgKMYj&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240607/us-west-2/s3/aws4_request&X-Amz-Date=20240607T231046Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=eaca5ad48aba92e86f331ff06b551b5e7947b73887bc0cabe5f63c4122fe22a0&itok=_HQgmm9I)
![130367-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130367-Thumbnail%20Image.png?versionId=i1zAyD8lpmcA0J.nvsNQA6OX4TMP.Dx7&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T154050Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=193f93da943a2bdd1ca6272f27fe2246b35e0d6e5abef7027d947a69bca3dd2d&itok=xZs-dDOr)
Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation.
Results
For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets.
Conclusions
SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.
![130370-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130370-Thumbnail%20Image.png?versionId=r1BJpf8yxqds5e1m_5oeGTnCIq0RcpXV&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T154040Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=3922d2a2882968d59c264b4579d3833b1a84402c6548cfe729a82ad7dc5016c3&itok=hqxA9aRn)
Background:
Drosophila gene expression pattern images document the spatiotemporal dynamics of gene expression during embryogenesis. A comparative analysis of these images could provide a fundamentally important way for studying the regulatory networks governing development. To facilitate pattern comparison and searching, groups of images in the Berkeley Drosophila Genome Project (BDGP) high-throughput study were annotated with a variable number of anatomical terms manually using a controlled vocabulary. Considering that the number of available images is rapidly increasing, it is imperative to design computational methods to automate this task.
Results:
We present a computational method to annotate gene expression pattern images automatically. The proposed method uses the bag-of-words scheme to utilize the existing information on pattern annotation and annotates images using a model that exploits correlations among terms. The proposed method can annotate images individually or in groups (e.g., according to the developmental stage). In addition, the proposed method can integrate information from different two-dimensional views of embryos. Results on embryonic patterns from BDGP data demonstrate that our method significantly outperforms other methods.
Conclusion:
The proposed bag-of-words scheme is effective in representing a set of annotations assigned to a group of images, and the model employed to annotate images successfully captures the correlations among different controlled vocabulary terms. The integration of existing annotation information from multiple embryonic views improves annotation performance.