Publications

26 Publications visible to you, out of a total of 26

3D models of fungal chromosomes to enhance visual integration of omics data

datafun

Abstract (Expand)

Abstract The functions of eukaryotic chromosomes and their spatial architecture in the nucleus are reciprocally dependent. Hi-C experiments are routinely used to study chromosome 3D organization by …

Authors: Thibault Poinsignon, Mélina Gallopin, Pierre Grognet, Fabienne Malagnac, Gaëlle Lelandais, Pierre Poulain

Date Published: 1st Dec 2023

Publication Type: Journal Article

DOI: 10.1093/nargab/lqad104

Citation: NAR Genomics and Bioinformatics 5(4),lqad104

Created: 26th Oct 2025 at 17:02, Last updated: 26th Oct 2025 at 17:06

Analysis of Protein-Protein Interactions networks and cross-species transfer learning comparison for seven organisms

yPublish - Bioinfo tools

Abstract (Expand)

Motivation Protein-protein interactions (PPIs) can be used for a plenty of applications like inferring protein functions or even helping the drug discovery process. For human specie, there is a lot of … validated information and functional annotations for the proteins in its interactome. In other species, the known interactome is much smaller compared with human and there are many proteins with few or no annotations by specialists. Understanding the interactome of other species helps to trace evolutionary characteristics, compare important biological processes and also build interactomes for new organisms according to other organisms more related with it instead of relying just to the human interactome. Results In this study, we evaluate the performance of PredPrIn workflow in predicting interactome for seven organisms in terms of scalability and precision showing that PredPrIn gets over than 70% of precision and it takes less than three days even on the largest datasets. We made a transfer learning analysis predicting an organism interactome from each other organism, we then showed an implication regarding to their evolutionary relation in the number of ortholog proteins shared between these organisms. We also present an analysis of functional enrichment showing the proportion of shared annotations between positive and false interactions predicted and extraction of topological features of each organism interactome such as proteins acting as hubs and bridge between modules. From each organism, one of the most frequent biological processes was selected and the proteins and pairs present in it were compared in terms of quantity in the interactome available in HINT database for that organism and the one predicted by PredPrIn. In this comparison we showed that we covered those proteins and pairs covered in HINT and also enriched these processes for almost all organisms. Conclusions In this work, we have proved the efficiency of PredPrIn workflow for protein interaction prediction for seven different organisms using scalability, performance and transfer learning analyses. We have also made cross-species interactome comparisons showing the most frequent biological processes for each organism as well as the topological features of each organism interactome showing the consistency with hypothesis about biological networks. Finally, we described the enrichment made by PredPrIn in selected biological processes showing that its prediction was important to enhance information about these organisms interactomes.

Author: Yasmmin C Martins

Date Published: 7th Jun 2023

Publication Type: Journal Article

DOI: 10.1101/2023.06.05.543725

Citation: biorxiv;2023.06.05.543725v1,[Preprint]

Created: 23rd Oct 2023 at 15:23, Last updated: 23rd Oct 2023 at 15:24

Collection of wing images for conservation of honey bees (Apis mellifera) biodiversity in Europe

Apis-wings

Abstract (Expand)

Identification of honey bee (Apis mellifera) from various parts of the world is essential for protection of their biodiversity. The identification can be based on wing measurements which is inexpensive …

Authors: Andrzej Oleksa, Eliza Căuia, Adrian Siceanu, Zlatko Puškadija, Marin Kovačić, M. Alice Pinto, Pedro João Rodrigues, Fani Hatjina, Leonidas Charistos, Maria Bouga, Janez Prešern, Irfan Kandemir, Slađan Rašić, Szilvia Kusza, Adam Tofilski

Date Published: 1st Oct 2022

Publication Type: Journal Article

DOI: 10.5281/zenodo.7244070

Citation:

Created: 28th Feb 2023 at 12:24, Last updated: 28th Feb 2023 at 14:26

Dataset: Computer software for identification of honey bee subspecies and evolutionary lineages

Apis-wings

Abstract (Expand)

Coordinates of 19 landmarks from honey bee (Apis mellifera) worker wings. They represent 1832 workers, 187 colonies, 25 subspecies and four evolutionary lineages. The material was obtained from the …

Authors: Anna Nawrocka, Irfan Kandemir, Stefan Fuchs, Adam Tofilski

Date Published: 1st Apr 2018

Publication Type: Journal Article

DOI: 10.5281/zenodo.7567336

Citation:

Created: 28th Feb 2023 at 14:25, Last updated: 28th Feb 2023 at 14:27

DSCrank: A Method for Selection and Ranking of Datasets

yPublish - Bioinfo tools

Abstract (Expand)

Considerable efforts have been made to build the Web of Data. One of the main challenges has to do with how to identify the most related datasets to connect to. Another challenge is to publish a local …

Authors: Yasmmin Cortes Martins, Fábio Faria da Mota, Maria Cláudia Cavalcanti

Date Published: 2016

Publication Type: Journal Article

DOI: 10.1007/978-3-319-49157-8_29

Citation: Metadata and Semantics Research 672:333-344,Springer International Publishing

Created: 23rd Oct 2023 at 14:59, Last updated: 23rd Oct 2023 at 15:04

EpiCurator: an immunoinformatic workflow to predict and prioritize SARS-CoV-2 epitopes

yPublish - Bioinfo tools

Abstract (Expand)

The ongoing coronavirus 2019 (COVID-19) pandemic, triggered by the emerging SARS-CoV-2 virus, represents a global public health challenge. Therefore, the development of effective vaccines is an urgent …

Authors: Cristina S. Ferreira, Yasmmin C. Martins, Rangel Celso Souza, Ana Tereza R. Vasconcelos

Date Published: 2021

Publication Type: Journal Article

DOI: 10.7717/peerj.12548

Citation: PeerJ 9:e12548

Created: 23rd Oct 2023 at 15:04, Last updated: 23rd Oct 2023 at 15:06

EURYALE: A versatile Nextflow pipeline for taxonomic classification and functional annotation of metagenomics data

Dalmolin Systems Biology Group

Abstract

Not specified

Authors: João Vitor F. Cavalcante, Iara Dantas de Souza, Diego A. A. Morais, Rodrigo J. S. Dalmolin

Date Published: 27th Aug 2024

Publication Type: Conference Paper

DOI: 10.1109/cibcb58642.2024.10702116

Citation: In: 2024 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB). IEEE, Natal, Brazil, pp 1-7

Created: 15th Dec 2025 at 17:19

FAIR Computational Workflows

Testing

Abstract

Not specified

Authors: Carole Goble, Sarah Cohen-Boulakia, Stian Soiland-Reyes, Daniel Garijo, Yolanda Gil, Michael R. Crusoe, Kristian Peters, Daniel Schober

Date Published: 2020

Publication Type: Journal Article

DOI: 10.1162/dint_a_00033

Citation: Data Intellegence 2(1-2):108-121

Created: 2nd Dec 2021 at 10:16, Last updated: 16th Jan 2023 at 13:34

Framework para a Construção de Redes Filogenéticas em Ambiente de Computação de Alto Desempenho

HP2NET - Framework for construction of phylogenetic networks on High Performance Computing (HPC) environment

Abstract (Expand)

No presente artigo é apresentado uma avaliação de desempenho de um Framework de Redes Filogenéticas no ambiente do supercomputador Santos Dumont. O trabalho reforça os benefícios de paralelizar o …

Authors: Rafael Terra, Kary Ocaña, Carla Osthoff, Lucas Cruz, Philippe Navaux, Diego Carvalho

Date Published: 19th Oct 2022

Publication Type: Conference Paper

DOI: 10.5753/wscad.2022.226366

Citation: Anais do XXIII Simpósio em Sistemas Computacionais de Alto Desempenho (WSCAD 2022),pp.73-84,Sociedade Brasileira de Computação

Created: 9th Jan 2024 at 12:54, Last updated: 9th Jan 2024 at 12:57

Framework para execução de workflows de redes filogenéticas em ambientes de computação de alto desempenho

HP2NET - Framework for construction of phylogenetic networks on High Performance Computing (HPC) environment

Abstract (Expand)

In the last years, the development of technologies, such as next-generation sequencing and high-performance computing allowed the execution of Bioinformatics experiments of high complexity and … computationally intensives. Different Bioinformatics fields need to use high-performance computing platforms to take advantage of the parallelism and tasks distribution, through specialized technologies of scientific workflows management systems. One of the Bioinformatics fields that need high-performance computing is phylogeny, a field that expresses the evolutive relations between genes and organisms, establishing which of them are most related evolutively. The phylogeny is used in several approaches, such as in the species classification; in the discovery of individuals’ kinship; in the identification of pathogens origins, and even in conservation biology. A way of representing these phylogenetic relations is using phylogenetic networks. However, the construction of these networks uses computationally intensive algorithms that require the constant manipulation of different input data. This work aims the development of a framework for construction of explicit phylogenetic networks, modeling a scientific workflow that adds different methods for the construction of the networks and the required input data treatment. The framework was developed to allow the use of multiple flows from the workflow in an automated, parallel, and distributed manner in a single execution and also to be executable in high- performance computing environments, constituting a challenging task, once the tools used are not developed focused in this environment. To orchestrate the workflow tasks, the scalable parallel programing library Parsl was used, allowing to do optimizations in the workflow’s tasks execution, performing better management of the resources. Two versions of the framework were developed, called Single Partition and Multi Partition, differing in the manner in which the resources are used. In tests performed, there was an improvement in the execution time of about five times when compared to the sequential execution of a flow without the optimizations. The framework was validated using public data of Dengue virus genomes, which were processed, annotated, and executed in the framework using the Santos Dumont supercomputer. The construction of the genomes’ explicit phylogenetic networks indicates that the framework is a functional, efficient, and easy to use tool.

Authors: Rafael Terra, Kary Ocaña, Carla Osthoff, Diego Carvalho

Date Published: 18th Feb 2022

Publication Type: Master's Thesis

Citation: TERRA, R. S. Framework para execução de workflows de redes filogenéticas em ambientes de computação de alto desempenho. 2022. 71 f. Tese. (Programa de Pós-Graduação em Modelagem Computacional) - Laboratório Nacional de Computação Científica, Petrópolis, 2022.

Created: 9th Jan 2024 at 13:16

Publications

Filters ×

Filters