MUCIC - Masaryk University Cell Image Collection

In this web page, we introduce a collection of benchmark data that were generated either synthetically by a computer or created manually. In this sence, the collection serves as a complement to the existing benchmark datasets as it is commonly understood that the validation requires both testing sets: the real and the synthetic one. The real datasets with annotated ground truth offer a realistic variability that is difficult to reach in the synthetic data. Vice versa, the synthetic data can be generated in huge amount and the parameters of the acquisition process can be repeatedly changed with no additional costs. All the generated data are supported by the journal or conference papers.

Colon Tissue (fixed cells)

Here, you can find 30 synthetic images of human colon tissue including ground truth (foreground/background) images. The dataset was generated using the virtual microscope imitating the microscope Zeiss S100 (objective Zeiss 63x/1.40 Oil DIC) attached to confocal unit Atto CARV and CCD camera Micromax 1300-YHS. The image data was saved using three different file formats: ICS, HDF5 and 3D-TIFF. Please, feel free to select the format you prefer. All of them contain the same data.

Example images:

 
3D image 3D mask

The individual image files are aggregated in ZIP archives:

If you use this dataset in your research papers, please refer the following article:

This research was supported by the Ministry of Education of the Czech Republic (Projects No. LC535 and No. 2B06052).


HL60 Cell Line (fixed cells)

Here, you can find 30 synthetic images of nuclei of HL60 cell line including ground truth (foreground/background) images. Each image set contains 20 cell nuclei with specified probability of clustering (0%, 25%, 50%, and 75%). The dataset was generated using the virtual microscope imitating the microscope Zeiss S100 (objective Zeiss 63x/1.40 Oil DIC) attached to confocal unit Atto CARV and CCD camera Micromax 1300-YHS. The image data was saved using three different file formats: ICS, HDF5 and 3D-TIFF. Please, feel free to select the format you prefer. All of them contain the same data.

Example images:

 
3D image 3D labeled image

The individual image files are aggregated in ZIP archives.

If you use this dataset in your research papers, please refer the following article:

This research was supported by the Ministry of Education of the Czech Republic (Projects No. LC535 and No. 2B06052).


Granulocytes (fixed cells)

Here, you can find 30 synthetic images of nuclei of granulocytes including ground truth (foreground/background) images. Each image set contains up to 15 cell nuclei. The dataset was generated using the virtual microscope imitating the microscope Zeiss S100 (objective Zeiss 63x/1.40 Oil DIC) attached to confocal unit Atto CARV and CCD camera Micromax 1300-YHS. The image data was saved using three different file formats: ICS, HDF5 and 3D-TIFF. Please, feel free to select the format you prefer. All of them contain the same data. The individual image files are aggregated in ZIP archives.

Example images:

  
3D image   3D labeled image

If you use this dataset in your research papers, please refer the following article:

This research was supported by the Ministry of Education of the Czech Republic (Projects No. LC535 and No. 2B06052).


Time-lapse Image Sequences (living cells)

In this section, you can find a reference to collection of computer generated time-lapse image sequences of nuclei of HL60 cells stained with Hoescht (both 2D & 3D). The dataset was generated with the help of our virtual microscope. The dataset was prepared for the 1st a the 2nd edition of Cell Tracking Challenge joined to IEEE International Symposium on Biomedical Imaging

Example image sequence:

The data is available for free at this address: http://celltrackingchallenge.net/3d-datasets

If you use this dataset in your research papers, please refer one of the following articles:

This research was supported by the Czech Science Foundation, Grant No. GA14-22461S.


Vasculogenesis (living cells)

Here, you can find the sequence of frames recording the process called vasculogenesis. The endothelial cells, initially spread across the glass slide, tend to attach each other and form the networks with thin and elongated chords. This process forces some cells to be markedly elongated, as can be seen in the figure below. The dataset was generated with the help of our virtual microscope.

Example images:

  
3D image   3D mask

If you use this dataset in your research papers, please refer the following article:

This research was supported by the Czech Science Foundation, Grant No. GA14-22461S.


FiloData3D - Cells with filopodial protrusions (living cells)

This synthetic image data displays A549 lung cancer cells of three morphologically distinct phenotypes: wild-type (WT), CRMP-2-overexpressing (OE), and CRMP-2-phospho-defective (PD). Whereas the WT cells sporadically protrude a few single-branch filopodia that may appear and disappear, the OE cells protrude many of them with short lifetimes. By contrast, the PD cells protrude aberrantly long, branching filopodia with whole-experiment lifetimes. The image set consists of 180 synthetic 3D time-lapse sequences of single A549 lung cancer cells. The dataset includes three sequences for each of the three phenotypes, generated in 20 alternatives with five different signal-to-noise ratios and four different anisotropy ratios. Each sequence, being composed of 30 frames, was generated using FiloGen that imitates transient cell transfection with the green fluorescent protein conjugated to actin and the image formation process using a spinning disk confocal microscope equipped with a water Plan-Apochromatic 63x/1.20 objective lens.

Example images:

The synthetic time-lapse image data of a CRMP-2-phospho-defective cell for the given fluorescence level factor (F=1.0) and anisotropy ratio (AR=1).



The synthetic image data of a CRMP-2-overexpressing cell for varying fluorescence level factors (Fs), which indirectly affect the signal-to-noise ratios of the image data.

ZIP archive with TIFF files

If you use this dataset, being tagged as BBBC046 in the Broad Bioimage Benchmark Collection, in your research papers, please refer to the following articles:

This work was supported by the Czech Science Foundation under the grants number GJ16-03909Y and GA17-05048S, the Czech Ministry of Education, Youth and Sports under the grants number LTC17016 and CZ.02.1.01/0.0/0.0/16_013/0001775, the Russian Science Foundation under the grant number 17-11-01279, and the German Federal Ministry of Research and Education (BMBF) under the grant number 031L0102 (de.NBI).


Full segmentation annotations of 3D time-lapse microscopy images of MDA231 cells

In the field of biomedical image analysis, one of the richest benchmark datasets for objective comparison of cell segmentation and tracking methods is the Cell Tracking Challenge (CTC). The CTC offers two types of 3D reference annotations: Gold reference tracking annotations (TRA) and Silver reference segmentation annotations (ST).

We present fully volumetric manual annotations for two temporal sequences of a 3D CTC dataset, Fluo-C3DL-MDA231, which is characterized by the most complex cell shapes out of all 3D CTC datasets.




A 3D visualization of one cell mask that consists of 5 slices (w.r.t. the z-axis). The first row shows a 3D visualization of the 3 annotations: (A) Gold reference tracking annotations (TRA), (B) Silver reference segmentation annotations (ST), and (C) the proposed full annotations (FA). Image (D) shows all five 2D slices of the cell. Image (D) shows a 3D visualization of the slices (D). It is visible that thin protrusions are absent in images (A) and (B).

ZIP archive with TIF files

The respective microscopy images can be dowloaded from CTC website: http://celltrackingchallenge.net/3d-datasets/

We provide three human annotations for the first sequence and one for the second. For the first sequence, we also provide a result of majority voting.

The full annotations are consistent with CTC tracking annotations (TRA) and will be kept up-to-date in the future. Soon, we plan to provide more annotations and majority voting for the second sequence.

The dataset is used in a respective paper submitted to the ISBI25 conference. The work was founded by the Czech Science Foundation, project no. GA21-20374S.