The Norwegian EMBnet node: AGM 2010 report
The Norwegian EMBnet node has 65 members, and offers a range of life science computing services:
- official mirror of the EMBL, UniProt and Genbank databases
- MRS 4 install (command line and web)
- EMBOSS (command line and web)
- GCG (holders of license for legacy users)
- R tool
- dedicated application hosting (provision of relational databases and web services for science groups)
We have our own dedicated machine room, with nightly incremental backup, offering a variety of systems to our members:
- several memory intense systems (up to 64 GB RAM)
- total of 40 CPU cores
- Multi-Tbyte capable filesystem
- GPU cluster (work in progress, not yet operational)
At present, the node employs two staff members and is active with the EMBnet TM PC, specifically with:
- technical issues
- HTS IT (published HTS IT draft report, as well as articles on EMBnet News)
Our users are affiliated with Universities all over Norway, as well as private companies working in the field of Life Sciences. As such, our services assist both in the research at the University level as well as the development of commercial products.
At present, we are offering a course on sequence mining using MRS and EMBOSS, with the aim of:
- introducing students to some commonly used sequence databases
- introducing students to sequence mining tools, primarily MRS and EMBOSS
- familiarizing the students with the command line interfaces to the tools, and the possibilities that they open up with regards to creating pipelines.
This course has been offered at the Mexican EMBnet node in March 2010, and will be offered at the University of Oslo later this summer. Feedback from the course in Mexico has been very favorable. The course material has been made available online, as has video recordings of the course.
In the past 3 years, we have also offered courses on the following subjects:
- Bioperl (July 2008)
- R (January 2009)
- EMBOSS/GCG (June 2009)
In terms of new areas of research, we believe that GPU computing is an exciting new field, and will provide significant improvements over CPU-based systems in terms of computational power for the price. Certain types of bioinformatics algorithms can be run in massively parallel mode using GPU cores and thus benefit from this technology.
To enable us to work in this new field, we have applied for, and received, funding to purchase a small GPU cluster. We have acquired a server equipped with 4 NVIDIA Tesla c1060 GPU cards, for a total of 960 processing cores. Two of these cards will be replaced by the next generation Fermi c2050 cards as soon as these are available on the market, bringing the total up to 1376 cores. The current configuration gives a processing power of approximately 3600GFLOPs. After the upgrade, this will be increased to approximately 4400GFLOPs. This system will enable us to both build competence in this important field, and allow our members to run their algorithms in an adequately powerful system. Our plan is to port various bioinformatics algorithms to the GPU processors and make them available to the node members by the end of 2010. We also plan to share our expertise with the EMBnet community.
The Norwegian EMBnet node wishes to acknowledge its user base and the Molecular Life Science committee of the University of Oslo for providing funding to achieve these goals. ().