Bioserver

What kind of scientists are those who do not invent?
Taste the fruits of our labour.

OUR SOFTWARE

SOFTWARE

We hope you will like our tools

ALGA (ALgorithm for Genome Assembly) is a genome-scale de novo sequence assembler based on the overlap graph approach. The method accepts at the input reads from the next generation DNA sequencing, paired or not. It can be used without setting any parameter by a user, parameters are adjusted internally by ALGA on the basis of input data. Only one optional parameter is left, the maximum allowed error rate in overlaps of reads, with its default (and suggested) value 0.ALGA incorporates several new ideas resulting in more exact contigs produced in acceptable time. Among these ideas we have creation of a sparse but quite informative graph, reduction of the graph including a procedure referring to the problem of minimum spanning tree of a local subgraph, and graph traversal connected with simultaneous analysis of contigs stored so far. The algorithm is one of tools involved in processing data in currently realized national project Genomic Map of Poland.More information can be found using the URL below and in the article: S. Swat, A. Laskowski, J. Badura, W. Frohmberg, P. Wojciechowski, A. Świercz, M. Kasprzak, J. Błażewicz, "Genome-scale de novo assembly using ALGA", Bioinformatics - 2021, vol. 37, iss. 12, s. 1644-1651.

RNAfitme is a webserver to reconstruct full-atomic RNA 3D structure based on sequence and fixed sugar-phopshate backbone, remodel the structure, and reduce steric clashes.

RNAvista is a webserver to assess RNA secondary structure with non-canonical base pairs.

Introduction GRASShopPER (GPU overlap GRaph ASSembler using Paired End Reads) is the novel assembly method that follows the approach of overlap–layout–consensus (OLC). In the method, a very efficient GPU  implementation of the exact reads alignment algorithm has been used for calculating the scores and shifts on the arcs of the graph. Two-part fork detection strategy has been introduced, which highly reduces misassembly rate in the resulting contigs. The first part is carried out during the graph traversal. In the second part, a greedy hyper-heuristic identifies undetected forks on the basis of paired-end reads information. The results of computational experiments show high coverage of the tested genome.   Download GRASShopPER can be downloaded at https://sourceforge.net/projects/grasshopper-assembler/ For the complete list of parameters, please follow Readme.txt file under the download link. System requirements GRASShopPER requires a computer with graphics processing units, and possibly the environment to run program in parallel manner. Resources used in the assembly process depend on the size of the input library. For example, a genome of bacteria of length 2Mbp requires 17 GB RAM, while one of the human chromosomes requires 82 GB.   Publication To reference GRASShopPER, please cite: A. Swiercz, W. Frohmberg, M. Kierzynka, P. Wojciechowski, P. Zurkowski, J. Badura, A. Laskowski, M. Kasprzak, J. Blazewicz, "GRASShopPER. An algorithm for de novo assembly based on GPU alignments", PLOS ONE 13(8): e0202355. 2018 https://doi.org/10.1371/journal.pone.0202355

CLAIM-MS - CLAIM Multi Source, an expanded version of CLAIM.Authors: Marek Blazewicz1,2, Giovanni Felici3, Aleksandra Swiercz1,4, Daniele Santoni3, Marcin Jaroszewski1, Agnieszka Zmienko1,4, Marta Kasprzak1,4CLAIM-MS is a method for finding functionally related genes. The novelty of this proposition is in its flexibility, as the method integrates information from many input data sources of different types. We successfully validated it on gene expression data produced by different technologies (microarray, RNA-seq) and experiment setups (case-control or multi-class, single-time-point or time-series), on protein-protein interaction networks and Gene Ontology annotations. For each dataset, a gene-gene distance metric needs to be derived in accordance with its nature and the experiment setup.  This approach expands our previous work with, among others: the ability to handle more than two data sources at once; a new robustly converging clustering algorithm (a neural gas method); a more efficient clique detection algorithm; deep analysis of underlying distance matrices, which allow tuning up the evaluation of gene clusters with respect to a particular biological dataset; this procedure significantly improves the overall quality of the outcomes. The instruction on how to run the application can be found at: README The research was supported by grant No. 2012/05/B/ST6/03026 from the National Science Centre,  Poland. A publication presenting both the method and the results is in preparation.1 Institute of Computing Science,Poznan University of Technology, Poznan, Poland. 2 Poznan Supercomputing and Networking Center, Poznan, Poland. 3 Institute for Systems Analysis and Computer Science “Antonio Ruberti”, National Research Council of Italy, Rome, Italy. 4 Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.

RNAssess is a computational server for comparison of RNA 3D models with the reference structure and for discrimination between the correct and incorrect models.

The RNApdbee is a webserver to derive secondary structures from pdb files of knotted and unknotted RNAsM. Antczak, T. Zok, M. Popenda, P. Lukasiak, R.W. Adamiak, J. Blazewicz, M. Szachniuk. RNApdbee - a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs, Nucleic Acids Research 42(W1), 2014, W368-W372 (doi:10.1093/nar/gku330).M. Antczak, M. Popenda, T. Zok, M. Zurkowski, R.W. Adamiak, M. Szachniuk. New algorithms to represent complex pseudoknotted RNA structures in dot-bracket notation, Bioinformatics 34(8), 2018, 1304-1312 (doi:10.1093/bioinformatics/btx783).T. Zok, M. Antczak, M. Zurkowski, M. Popenda, J. Blazewicz, R.W. Adamiak, M. Szachniuk. RNApdbee 2.0: multifunctional tool for RNA structure annotation, Nucleic Acids Research 46(W1), 2018 (doi:10.1093/nar/gky314).

RNAlyzer is a computational method for comparison of RNA 3D models with the reference structure and for discrimination between the correct and incorrect models.P. Lukasiak, M. Antczak, T. Ratajczak, J.M. Bujnicki, M. Szachniuk, M. Popenda, R.W. Adamiak, J. Blazewicz, RNAlyzer - novel approach for quality analysis of RNA structural models, Nucleic Acids Research, 2013, 1-13, (doi: 10.1093/nar/gkt318).Web server: http://rnassess.cs.put.poznan.pl/

RNAComposer is a tool for fully automated prediction of large RNA 3D structures. It is freely available online.M. Popenda, M. Szachniuk, M. Antczak, K.J. Purzycka, P. Lukasiak, N. Bartol, J. Blazewicz, R.W. Adamiak. Automated 3D structure composition for large RNAs. Nucleic Acids Research 40(14), 2012, e112 (doi:10.1093/nar/gks339).M. Antczak, M. Popenda, T. Zok, J. Sarzynska, T. Ratajczak, K. Tomczyk, R.W. Adamiak, M. Szachniuk. New functionality of RNAComposer: an application to shape the axis of miR160 precursor structure, Acta Biochimica Polonica 63(4), 2016, 737-744 (doi:10.18388/abp.2016_1329).

RNA FRABASE is an engine with database to search the three-dimensional fragments within 3D RNA structures using as an input the sequence(s) and / or secondary structure(s) given in the dot-bracket notation.M. Popenda, M. Blazewicz, M. Szachniuk, R.W. Adamiak. RNA FRABASE version 1.0: an engine with a database to search for the three-dimensional fragments within RNA structures. Nucleic Acids Research 36, 2008, D386-D391 (published online on October 5, 2007, doi:10.1093/nar/gkm786). M. Popenda, M. Szachniuk, M. Blazewicz, S. Wasik, E.K. Burke, J. Blazewicz, R.W. Adamiak. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures. BMC Bioinformatics, 2010, 11:231 (published online on May 6, 2010, doi:10.1186/1471-2105-11-231). 

CLAIM - coupling co-expression data and protein-protein interaction networks for functional protein analysisCLAIM (CLusterAnalysis Integration Method) is a new method for integrating co-expression data obtained through microarray experiments (MA) and protein-protein interaction (PPI) network data. Microarray and PPI data are separately clustered; the clusters are then merged in a special graph; cliques of this graph would identify a group of functionally related proteins. The biological insight provided by these groups is analyzed on the basis of co-localization and mRNA developmental expression, pointing out the new information that can be obtained by this method.CLAIM can be also used to assign proteins whose functional role is unknown to pathways using the cliques that are strongly associated with known pathways. The basic assumption is that, if a protein belongs to a clique and the other proteins in that clique are in a known pathway, then that protein is likely to belong to that pathway. Based on this assumption, pathway assignment was performed through a score prediction function, based on the presence of a protein in pathway enriched cliques.The prediction power of the algorithm appears to be sufficiently high to make this method a useful semi-automated tool for protein functional analysis.Method CLAIM has been tested on the model organism Arabidopsis thaliana.For more detailed information please read CLAIM README.Daniele Santoni3, Aleksandra Swiercz1,4, Agnieszka Żmieńko1,4, Marta Kasprzak1,4, Marek Blazewicz1,2, Paola Bertolazzi3, Giovanni Felici3, An Integrated Approach (CLuster Analysis Integration Method) to Combine Expression Data and Protein–Protein Interaction Networks in Agrigenomics: Application on Arabidopsis thaliana, OMICS: A Journal of Integrative Biology. January 2014, 18(2): 155-165. doi:10.1089/omi.2013.0050.1 Institute of Computing Science,Poznan University of Technology, Poznan, Poland. 2 Poznan Supercomputing and Networking Center, Poznan, Poland. 3 Institute for Systems Analysis and Computer Science “Antonio Ruberti”, National Research Council of Italy, Rome, Italy. 4 Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.

MCQ4Structures is a tool for structural similarity computation based on molecule tertiary structure representation in torsional angle space. Although it has been primarily designed to work with RNA structures, MCQ4Structures can be also applied to proteins (however their representation is restricted to the backbone angles).MCQ4Structures was created using Java technology. In order to use the software, one should have Java Runtime Environment 1.7 installed (JRE is freely available for download).More information about this tool is available in the article:T. Zok, M. Popenda, M. Szachniuk. MCQ4Structures to compute similarity of molecule structures, Central European Journal of Operations Research 22(3), 2014, 457-474 (doi:10.1007/s10100-013-0296-5).MCQ4Structures is available as a free Java Webstart application and can be downloaded from:

A tabu search algorithm for DNA sequencing by hybridization problem with a partial multiplicity information (a test set included)

SR-ASM algorithm

SR-ASM (Short Reads ASseMbly) algorithm is designed for DNA assembly of the short sequences coming from 454 sequencers. Here you can download the source code of the SR-ASM (Short Reads ASseMbly) algorithm, together with the sample data. The algorithm was implemented in C++ language, and tested under UNIX system (SunOS 5.9). To build the source, you will need to unpack the archive, and type 'make' in the directory where the source files were unpacked. See the file "readme.txt" for more information.Usefulness of the algorithm has been proven in tests on raw data generated during sequencing of the whole 1.84 Mbp genome of bacteria Prochlorococcus marinus. The tests of the SR-ASM algorithm were carried out on SUN Fire 6800 in Poznan Supercomputing and Networking Center.sr_asm.tar.gz: 23.84 KBreadme.txt: 1.57 KBsample.tar.gz: 59.55 KBDetailed information about the algorithm is available here. The paper with its description and computational results is:* J. Blazewicz, M. Bryja, M. Figlerowicz, P. Gawron, M. Kasprzak, E. Kirton, D. Platt, J. Przybytek, A. Swiercz, L. Szajkowski, "Whole genome assembly from 454 sequencing output via modified DNA graph concept", Computational Biology and Chemistry 33 (2009) 224-230.The newest version of the algorithm, which optionally can be compiled for GPU:Download the sourcecodeThe paper including implementation details is to be published in 2013 in the journal Foundations of Computing and Decision Sciences.Instruction how to compile the program is present in readme.txt. The algorithm can be run with different heuristics for searching for the solution ('greedy', 'flow' or 'acyclic'). Greedy is the default one, to choose the other you need to execute the program with the parameter--path-algorithm <heuristic>. For example, if you would like to run the program for the data file 'dataset.fasta' using GPU, and the 'flow' algorithm:cd algorithm./configuremake GPU=YES./alignment.exe --path-algorithm=flow dataset.fastaRun program with no parameters if you'd like to display help.