From February 6th to February 10th, France-BioImaging organised a group meeting on the project “FBI.data” in Bordeaux. For a week, participants focused on the architecture and the implementation of image data management tools. A user-friendly response to the challenges of never-ending data production. 

New imaging technologies are very greedy in terms of image processing and data management. Beside the image itself, biological imaging generates a huge amount of metadata. The FBI.data project, one of the key missions of France-BioImaging, addresses the questions related to the computational analysis and handling of image data. 

Speeding up the implementation of tools across the infrastructure

Although the distributed FBI.data team meets once per week, the FBI.data Sprint aims at only focusing on data management scenarii and accelerating the project. Two essential aspects have been discussed:

  • First of all, the data management system architecture must be simple for them to be implemented across France-BioImaging nodes. It also has to be compatible with long-term data storage and of course, to be user-friendly – we want to keep it easy for our users! 
  • The second point is all about anticipating as many data management cases as possible. Running through all the needs of bioimaging experts and users, the team lists the specific features of each case and considers the perfect solutions for all of them.

Working for the FAIRisation of data

The FAIRisation of data for Open Science is an initiative fully endorsed by France-BioImaging. Meaning that data are Findable, Accessible, Interoperable and Reusable, the benefits for the bioimaging community are numerous. It improves transparency and reproducibility, enhances quality of results, accelerates scientific progress and method development and finally boosts collaboration within the scientific community. 

OMERO, developed by the University of Dundee & Open Microscopy Environment teams, on which the FBI.data group is working, is one of the software making user data FAIR. Being a microscopy image data management decentralised platform, it helps organise, access and archive data. Besides, it combines image and metadata storage, a viewer and data analysis resources. Furthermore, OMERO is linked to the most valuable tools for bioimaging experts (ImageJ, Napari, QPath, etc.). And users can access their data from anywhere and keep them safe.

But much more has to be set up to have functional solutions: to ease user authentication and management, manage big data transfer, and have an adequate metadata scheme. Accompanying users is one of the mission of the FBI.data team, and the FBI.data Sprint is also the occasion to join efforts from the training working group led by the training mission officer Fabrice Cordelières (Bordeaux Imaging Center) to produce adequate training material on data management.

Sharing efforts and helping the community

The FBI.data working group is composed of:

By joining their skills and experience, they are working together on setting up tools and good practices for the management and FAIRisation of data inside France-BioImaging nodes but also for the entire bioimaging community. With this in mind, the project has collaboration with, among others, the Institut Français de Bioinformatique (IFB) and the Centre National de Ressources Biologiques Marines (EMBRC), and other infrastructure through the MUDIS4LS Equipex+ project. Moreover, the FBI.data project has an open GitLab, providing image data management codes in open source, and a blog with tutorials, recommendations and so much more!

Check the France-BioImaging OMERO web portal: https://omero-fbi.fr/

And its gitlab FBI data · GitLab (in2p3.fr)

Learn more about our missions and working packages: https://france-bioimaging.org/about/work-packages/

Developed by the Serpico Inria-CNRS-Institut Curie Joint Team, member of the IPDM-BioImage Informatics node of France-BioImaging (FBI), this open-source framework could be a huge step forward in bioimaging management and analysis.

Bioimaging has a broad range of applications, addressing a variety of biological models at diverse scales of life. Thus, descriptions of novel computational approaches are often focused on target case studies. To tackle any scenario in biological imaging is a major challenge, that needs the conception and the development of a unified solution.

With this in mind, the BioImageIT project aims at providing a middleware that integrates data management with analysis using existing softwares (Omero, BioFormats, Fiji, napari, Scipy, pytorch…). The mission of BioImageIT was to design a graphical user interface (GUI) that allows any scientist without coding skills to annotate and analyze datasets using various software. By being user-centered, open-source and cross-platform (Windows, MacOS, Linux), BioImageIT created a management tool that is definitely accessible and well documented.

Started in late 2019, the project, funded by France-BioImaging, is now being deployed in 10 FBI imaging facilities. As it is a first step, the BioImageIT project have the ambition to expand the dissemination of the middleware throughout France and even further, Europe.

BioImageIT overview. a, Schematic view of BioImageIT architecture. The BioImageIT core is composed of data management and data processing functionalities. Users can access plugins by a script editor, Jupyter or the BioImageIT graphical interface (GUI). Data management functionalities exploit local files, remote files or databases such as OMERO. Data processing can perform computations in remote jobs, containers, or local runners. Image analysis is provided by plugins written in different languages. Developers can implement their own plugins in BioImageIT and design their own Graphical Interface. (b-i) LLSM processing workflow gathered in BioImageIT. Hela cell line expressing CD-M6PR-eGFP were stained with Tubulin TrackerTM Deep Red for Microtubules. b, Due to the geometry of LLS scanning, raw 3D images are skewed. c, g, First, realignment (deskew) of raw stacks is performed using Pycudadecon. d, h, Richardson Lucy deconvolution is performed using Pycudadecon. e, CD-M6PR-eGFP vesicles are tracked using Trackmate(FiJi). f, i, Deconvolved stacks and tracks are rendered using napari.

Prigent, S., Valades-Cruz, C.A., Leconte, L. et al. BioImageIT: Open-source framework for integration of image data management with analysis. Nat Methods (2022).
https://doi.org/10.1038/s41592-022-01642-9

Perrine Paul-Gilloteaux, bio-image analyst, CNRS research engineer and project manager of our Bio-image informatics node, received last month the 2017 CNRS Crystal Prize, awarding her contributions to French research.

A perfect occasion to highlight her career and her work with France BioImaging. What is eC-CLEM? How can our field deal with the massive amount of data produced? What future developments can we expect in the realm of bio-image informatics? Read the interview below to find out more.


Perrine Paul-Gilloteaux
Perrine Paul-Gilloteaux

Could you introduce yourself briefly?

My name is Perrine Paul-Gilloteaux, I’m a CNRS Research Engineer. I have a background in electrical engineering, signal and image processing, and did my PhD in augmented reality for neurosurgery through surgical microscope. I started working in bio-image analysis for microscopy in Ireland, and joined the Curie Institute on the PICT IBISA facility in 2010. I moved to Nantes in 2015, and now work in a biomedical research institute. I am also the project manager of the France BioImaging node Bio-Image Informatics IPDM (Image Processing & Data Management), and work closely with the national coordination on the aspect of data management.

I define myself as a bio-image analyst, meaning that I do my best to bridge the gap between microscopy, image analysis and biology. This means that I’m involved in data management, data processing and data analysis projects, that I provide as a service in facilities or work on as research topics.

How long have you been involved with FBI, and what main projects have you carried out with us?

I’ve been involved in FBI from its inception. I started working within the transversal IPDM working group, where we first defined the state of our management systems and worked on the interoperability of our data bases. I managed the setup of the Curie Image Database, supported by France Bio Imaging, based on the OpenImadis system. In 2015, I was nominated project manager of the IPDM node, led by Jean-Christophe Olivo-Marin and Charles Kervrann. One important part of my mission is to work with the national coordination on the data management aspect in FBI. For this, we started by making a survey of resources and management system on site. This question of data management is now central, and FBI collaborates with other infrastructures at the European level: EuroBioImaging and ELIXIR, but also at the national level with other national infrastructures in biology using microscopies, and with the French Institute of BioInformatics (ELIXIR French node).

You have developed a software called eC-CLEM. Could you explain what it consists of?

For this project, I’ve worked closely with Xavier Heiligenstein (Curie Institute, FBI working group on multimodal imaging). Ec-CLEM (for Easy-Cell-Correlative Light to Electron Microscopy) is a software designed to help correlative microscopies. The purpose is to help the fusion of information obtained by different modalities of microscopy on the same sample (for example electronic microscopy, photonic microscopy, atomic force microscopy, etc.). The software allows to register, i.e. align in the same system of coordinates, multidimensional images with big scale and resolution differences, either with a manual input of the user, either automatically when possible. In addition, it provides an estimation of the error of alignment, based on statistical methods, and detects the deformations that the sample may have undergone. I’ve developed a set of algorithms implemented as plugins for the ICY platform. [note: Perrine has published a paper about eC-CLEM in Nature Methods] During the development of this set of tools, I was greatly helped by the ICY coding parties (Hackatons) organized at Pasteur with the support of FBI, and I would encourage developers to attend such events, as there are always new things to be learnt.

Perrine Paul-Gilloteaux presents her work during the Crystal Prize Ceremony.
Perrine presents her work during the Crystal Prize Ceremony
at Nantes University in December 2017.

Bio-image informatics have taken the center stage lately, as more and more people realize how crucial image processing is for research. Could you expand a bit on that?

It is entirely true, and this is the reason why FBI has had a transversal node on that activity since its creation. I’ve cofounded a network of bio-image analysts (NEUBIAS) for that exact purpose also. The size and the number of data to be processed, the large amount of different questions to be answered from imaging and the interplaying between acquisition and processing to generate imaging data and analysis data, have led to a complexity of analysis which requires expert tools but also expert people. Bio-image informatics is a field of research by itself, and it is now recognized as such. It is bridging the gap between image processing research and biology research based on imaging. It can be seen also from the recent Nobel prizes in chemistry: in cryo-tomography or in super resolution light microscopy, both developments were relying on image processing as an essential part.

What are going to be, according to you, the next big steps and developments in the realm of image processing and data management?

The novelties in our field is two-sided: from one side we have data exploding in size and number, and on the other side, machine learning -and in particular deep learning- benefits from progress in hardware and opens the way to big progress in analysis and in particular in feature recognition (segmentation and tracking).

Regarding data management, the big issues to be solved need to involve the whole imaging community, but also to seek expertise from other fields with the same problems. Technical solutions, both from the software side (with management software such as OpenImadis, Omero, Bioemergences in France BioImaging nodes) or the hardware side (optimized hardware systems, optimized protocols of data transfer) are on their way, but will not be useful if the biologists do not put effort in data curation and data selection.

Up to this day, even with machine learning, tuning a software or a protocol to respond to a particular problem and a particular set of data requires a lot of effort, either to set up the learning network or train it in the case of machine learning, either to combine algorithm for a specific question in adapted workflows or to develop more performing algorithm. It means that we need well trained expert able to master both the image processing aspect and the biological questions behind.

On what will your FBI working group focus in 2018? What can we expect from you? (in terms of new developments, priorities, events etc.)

The priority is definitely to deal with the explosion of data we are facing. In addition to the directions exposed in the previous question (software and hardware solutions), one direction taken by IPDM on data management is the definition of quality data. For this multi-faceted topic, we have already started to set up tools to measure the quality of the data produced in term of resolution for example, based on the expertise in metrology of our facilities members, that we want to demonstrate in 2018. We will also concretize the collaborations between FBI and the other national infrastructures by running tests, for example on the speed of data transfers between node, in order to make sure that at the end of 2018, each user of the FBI nodes can easily access and process her/his data from anywhere.  A technical catalogue of software and hardware resources is under construction, to allow FBI nodes and beyond to benefit wisely from the tools and networks created by FBI. In the first semester of 2018, we will be organizing an event to discuss and define the changes in bio-image informatics that deep learning could bring about (further information to come soon, please refer to the FBI site).

References:

http://www.cnrs.fr/fr/recherche/prix/cristal.htm (in French)

Relevant publications:

Paul-Gilloteaux, Perrine, Heiligenstein Xavier et al. “EC-CLEM: flexible multidimensional registration software for correlative microscopies.” Nature Methods, vol. 14, no. 2, 2017, pp. 102–103., doi:10.1038/nmeth.4170. https://www.nature.com/articles/nmeth.4170