Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
US12540892B2 - Information processing device, information processing method, program, and information processing system - Google Patents
[go: Go Back, main page]

US12540892B2 - Information processing device, information processing method, program, and information processing system - Google Patents

Information processing device, information processing method, program, and information processing system

Info

Publication number
US12540892B2
US12540892B2 US17/636,013 US202017636013A US12540892B2 US 12540892 B2 US12540892 B2 US 12540892B2 US 202017636013 A US202017636013 A US 202017636013A US 12540892 B2 US12540892 B2 US 12540892B2
Authority
US
United States
Prior art keywords
unit
clustering
spectral data
information processing
pieces
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US17/636,013
Other versions
US20220283073A1 (en
Inventor
Kenji Yamane
Junichiro Enoki
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Group Corp
Original Assignee
Sony Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Group Corp filed Critical Sony Group Corp
Assigned to Sony Group Corporation reassignment Sony Group Corporation ASSIGNMENT OF ASSIGNOR'S INTEREST Assignors: ENOKI, JUNICHIRO, YAMANE, KENJI
Publication of US20220283073A1 publication Critical patent/US20220283073A1/en
Priority to US19/421,764 priority Critical patent/US20260104343A1/en
Application granted granted Critical
Publication of US12540892B2 publication Critical patent/US12540892B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N15/00Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
    • G01N15/10Investigating individual particles
    • G01N15/14Optical investigation techniques, e.g. flow cytometry
    • G01N15/1429Signal processing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/52Use of compounds or compositions for colorimetric, spectrophotometric or fluorometric investigation, e.g. use of reagent paper and including single- and multilayer analytical elements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/10Pre-processing; Data cleansing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/231Hierarchical techniques, i.e. dividing or merging pattern sets so as to obtain a dendrogram
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/58Extraction of image or video features relating to hyperspectral data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/60Extraction of image or video features relating to illumination properties, e.g. using a reflectance or lighting model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N15/00Investigating characteristics of particles; Investigating permeability, pore-volume or surface-area of porous materials
    • G01N15/10Investigating individual particles
    • G01N2015/1006Investigating individual particles for cytology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/03Recognition of patterns in medical or anatomical images

Definitions

  • the present disclosure relates to an information processing device, an information processing method, a program, and an information processing system.
  • a flow cytometer In the fields of medicine, biochemistry, and the like, in general, a flow cytometer is used in order to quickly measure characteristics of each of a large number of particles.
  • the flow cytometer is a device that irradiates a particle such as a cell or a bead flowing in a flow cell with a light beam to detect fluorescence, scattered light, or the like emitted from the particle, and optically measures characteristics of each particle.
  • the present disclosure proposes an information processing device, an information processing method, a program, and an information processing system capable of analyzing data increased by multicoloring.
  • an information processing device comprises: a dimensional compression unit that executes dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes; an initial value determination unit that determines an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value.
  • FIG. 1 is a block diagram illustrating a configuration of an information processing system according to a first embodiment.
  • FIG. 2 is a diagram illustrating an example of a fluorescence spectrum.
  • FIG. 3 is a diagram illustrating an example of a pre-processing parameter table.
  • FIG. 4 is a diagram illustrating an example of a spectrum plot.
  • FIG. 5 is a diagram illustrating an example of a clustering result.
  • FIG. 6 is a diagram for explaining fluorescence separation processing.
  • FIG. 7 is a diagram illustrating a display example of an analysis result.
  • FIG. 8 is a flowchart illustrating a flow of pre-processing by a pre-processing unit.
  • FIG. 9 is a flowchart illustrating a flow of processing by a clustering processing unit.
  • FIG. 10 is a flowchart illustrating a flow of fluorescence separation processing by a fluorescence separation unit.
  • FIG. 11 is a schematic diagram for explaining a series of flow from initialization to learning in FlowSOM.
  • FIG. 12 is a block diagram illustrating a configuration of an information processing system according to a second embodiment.
  • FIG. 13 is a flowchart illustrating an operation example of a pre-processing/fluorescence separation unit according to the second embodiment.
  • FIG. 14 is a flowchart illustrating an example of clustering processing executed by a clustering processing unit according to the second embodiment.
  • FIG. 15 is a diagram for explaining operation at the time of initializing a representative node vector of each representative node according to the present embodiment, described in step S 222 of FIG. 14 .
  • FIG. 16 is a flowchart illustrating an example of clustering by batch learning according to the present embodiment, described in step S 223 of FIG. 14 .
  • FIG. 17 is a diagram for explaining operation for determining the number of clusters using consensus clustering, described in step S 224 of FIG. 14 .
  • FIG. 18 is a block diagram illustrating a hardware configuration example of an information processing device.
  • a recent flow cytometer can acquire more abundant information by staining a particle such as a cell with a plurality of fluorescent dyes and measuring many fluorescence signals at one time. Meanwhile, due to such multicoloring of the flow cytometer, a limit of analysis by manual gating as in related art is pointed out. For example, when n types of fluorescent dyes are plotted in a two-dimensional coordinate system, there are nC2 combination patterns of fluorescent dyes. That is, there are 15 combination patterns for six colors, and there are 190 combination patterns for 20 colors.
  • a filter type flow cytometer that detects intensity of one or more specific wavelengths in fluorescence and a spectrum type flow cytometer that can acquire more information by detecting fluorescence emitted from each fluorescent dye as a wavelength spectrum.
  • a spectrum type flow cytometer that can acquire more information by detecting fluorescence emitted from each fluorescent dye as a wavelength spectrum.
  • fluorescence intensity for each wavelength is obtained, it is possible to acquire a large amount of features from one particle (also referred to as a sample). Therefore, by adopting the spectral type flow cytometer, more detailed analysis can be performed.
  • an information processing device an information processing method, a program, and an information processing system capable of making a display result of spectral data and a clustering result correspond to each other more correctly will be described with an example.
  • logicle conversion is performed as pre-processing on spectral data acquired by the flow cytometer.
  • clustering is performed using the spectral data that has been subjected to the logicle conversion, and a clustering result obtained by the clustering is displayed to a user.
  • it can be determined whether or not clustering is performed on a portion having a large value in the spectral data. Therefore, it is possible to perform clustering such that a display result of the spectral data and a clustering result more correctly correspond to each other.
  • the information processing device, the information processing method, the program, and the information processing system according to the present embodiment will be described in detail with reference to the drawings.
  • FIG. 1 is a block diagram illustrating a configuration of an information processing system according to the present embodiment.
  • an information processing system 4 includes an information processing device 1 and a measurement device 3 .
  • the measurement device 3 is a measurement device capable of detecting a fluorescence spectrum of each color from a cell or the like to be measured.
  • the measurement device 3 is, for example, a flow cytometer.
  • a measurement sample measured by the flow cytometer may be a biologically derived particle such as a cell, a microorganism, or a biologically relevant particle.
  • the cell may be an animal cell (for example, a corpuscle-based cell), or a plant cell.
  • the microorganism may be a bacterium such as Escherichia coli , a virus such as tobacco mosaic virus, or a fungus such as yeast.
  • the biologically relevant particle may be a particle constituting a cell such as a chromosome, a liposome, mitochondria, or various organelles.
  • the biologically relevant particle may include a biologically relevant polymer such as a nucleic acid, a protein, a lipid, a sugar chain, or a complex thereof.
  • Each of these biologically derived particles may have either a spherical shape or a non-spherical shape, and is not particularly limited in size and mass.
  • the measurement sample may be an industrially synthesized particle such as a latex particle, a gel particle, or an industrial particle.
  • the industrially synthesized particle may be a particle synthesized with an organic resin material such as polystyrene or polymethyl methacrylate, an inorganic material such as glass, silica, or a magnetic body, or a metal such as gold colloid or aluminum.
  • each of these industrially synthesized particles may have either a spherical shape or a non-spherical shape, and is not particularly limited in size and mass.
  • the measurement sample can be labeled (stained) with one or more fluorescent dyes prior to measurement of a fluorescence spectrum.
  • the measurement sample may be labeled with a fluorescent dye by a known method. Specifically, when the measurement sample is a cell, a fluorescently labeled antibody that is selectively bonded to an antigen present on a cell surface is mixed with a cell to be measured, and the fluorescently labeled antibody is bonded to an antigen on a surface of the cell. As a result, the cell to be measured can be labeled with a fluorescent dye. Alternatively, the cell to be measured can be labeled with a fluorescent dye by mixing a fluorescent dye that is selectively taken into a specific cell with the cell to be measured.
  • the fluorescently labeled antibody is an antibody to which a fluorescent dye is bonded as a label.
  • the fluorescently labeled antibody may be an antibody to which a fluorescent dye is directly bonded.
  • the fluorescently labeled antibody may be an antibody obtained by bonding a fluorescent dye to which avidin is bonded to a biotin-labeled antibody by an avidin-biotin reaction. Note that as the antibody, either a polyclonal antibody or a monoclonal antibody can be used.
  • the fluorescent dye for labeling a cell is not particularly limited, and at least one or more known dyes used for staining a cell or the like can be used.
  • the fluorescent dye phycoerythrin (PE), fluorescein isothiocyanate (FITC), PE-Cy5, PE-Cy7, PE-Texas Red (registered trademark), allophycocyanin (APC), APC-Cy7, ethidium bromide, propidium iodide, Hoechst (registered trademark) 33258, Hoechst (registered trademark) 33342, 4′,6-diamidino-2-phenylindole (DAPI), acridineorange, chromomycin, mithramycin, olivomycin, pyronin Y, thiazole orange, rhodamine 101, isothiocyanate, BCECF, BCECF-AM, C.
  • SNARF-1 C. SNARF-1-AMA, aequorin, Indo-1, Indo-1-AM, Fluo-3, Fluo-3-AM, Fura-2, Fura-2-AM, oxonol, Texas Red (registered trademark), Rhodamine 123, 10-N-nony-acridine orange, fluorescein, fluorescein diacetate, carboxyfluorescein, carboxyfluorescein diacetate, carboxydichlorofluorescein, and carboxydichlorofluorescein diacetate can be used.
  • derivatives of the above-described fluorescent dyes and the like can also be used.
  • the flow cytometer includes a laser light source that emits laser light having a wavelength capable of exciting a fluorescent dye with which a measurement sample S is labeled, a flow cell through which the measurement sample S flows in one direction, and a photodetector that receives one or more of fluorescence, phosphorescence, and scattered light from the measurement sample S irradiated with the laser light.
  • the laser light source is, for example, a semiconductor laser light source that emits laser light having a predetermined wavelength.
  • a plurality of laser light sources may be disposed. When the plurality of laser light sources is disposed, positions irradiated with laser light from the laser light sources may be the same as or different from each other in the flow cell. However, in a case where different positions are irradiated with laser light from the plurality of laser light sources, light from the measurement sample S can be detected by different photodetectors. In such a case, even in a case where dyes that emit fluorescent rays having close wavelengths are used, a fluorescence spectrum of each of the fluorescent rays can be measured without color mixing.
  • the laser light emitted from the laser light source may be either pulsed light or continuous light.
  • the laser light source a plurality of semiconductor laser light sources that emits laser light having a wavelength of 480 nm and laser light having a wavelength of 640 nm may be used.
  • the flow cell is a flow path through which the plurality of measurement samples S flows in line in one direction. Specifically, through the flow cell, a sheath liquid enclosing the measurement samples S flows at a high speed as a laminar flow, and the plurality of measurement samples S thereby flows in line in one direction.
  • the flow cell can be formed in a microchip or a cuvette.
  • the photodetector detects light from the measurement sample S irradiated with laser light by photoelectric conversion.
  • the light from the measurement sample S can include at least one of fluorescence, phosphorescence, and scattered light.
  • the photodetector may include a detector that detects scattered light LS including forward scattered light and side scattered light from the measurement sample S, and a light receiving element array that detects fluorescence from the measurement sample S.
  • the detector may be, for example, a known photoelectric conversion element such as a charge coupled device (CCD), a complementary metal oxide semiconductor (CMOS), or a photodiode.
  • the light receiving element array can be constituted by, for example, arranging a plurality of independent detection channels having different wavelength ranges of light to be detected.
  • the light receiving element array may be a light receiving element array in which a plurality of photo multiplier tubes (PMTs) or photodiodes having different wavelength ranges to be detected are arranged one-dimensionally or the like, an image sensor in which pixels are arranged in a two-dimensional lattice pattern, or the like.
  • the light receiving element array photoelectrically converts fluorescence from the measurement sample S spectrally dispersed for each wavelength by a spectroscopic element such as a prism or a grating.
  • the measurement sample S flowing in the flow cell is irradiated with laser light emitted from the laser light source.
  • the measurement sample S emits scattered light and fluorescence (or phosphorescence) by being irradiated with the laser light.
  • the scattered light emitted from the measurement sample S is detected by the detector.
  • the fluorescence emitted from the measurement sample S is spectrally dispersed into light for each wavelength by the spectroscopic element and then received by the light receiving element array. As a result, a spectrum of the fluorescence emitted from the measurement sample S is detected.
  • the measurement device 3 is, for example, a spectrum type flow cytometer that causes a fluorescent-stained cell to flow through a flow cell at a high speed and irradiates the flowing cell with a light beam to detect a fluorescence spectrum for each fluorescent dye emitted from the cell.
  • FIG. 2 is a diagram illustrating an example of the fluorescence spectrum. As illustrated in FIG. 2 , the fluorescence spectrum is expressed by fluorescence intensity for each channel corresponding to a wavelength.
  • the measurement device 3 outputs a detected fluorescence spectrum as measurement data 2 .
  • the measurement data 2 includes spectral data of fluorescence for each cell.
  • the measurement device 3 transfers the measurement data 2 to, for example, the information processing device 1 .
  • the information processing device 1 acquires and analyzes the measurement data 2 measured by the measurement device 3 , and displays an analysis result. Note that the information processing device 1 and the measurement device 3 may be connected to each other via a network, and the information processing device 1 may acquire the measurement data 2 via the network.
  • the information processing device 1 includes a pre-processing unit 11 , a pre-processing parameter table 12 , a spectrum output unit 13 , a clustering processing unit 14 , a clustering result presentation unit 15 , a fluorescence separation unit 16 , and a normal analysis presentation unit 17 .
  • a pre-processing unit 11 a pre-processing parameter table 12 , a spectrum output unit 13 , a clustering processing unit 14 , a clustering result presentation unit 15 , a fluorescence separation unit 16 , and a normal analysis presentation unit 17 .
  • the pre-processing parameter table 12 , the clustering processing unit 14 , and the fluorescence separation unit 16 may be performed in a cloud. In this case, the measurement data 2 is also transferred to the cloud.
  • the pre-processing unit 11 performs pre-processing on spectral data according to a selected parameter.
  • the pre-processing is coordinate conversion for performing display from an actual observation value.
  • the conversion may be, for example, simple log 10 conversion or logicle conversion considering characteristics of an observation device.
  • the parameter includes W, T, M, and A.
  • W is a value that linearly displays a value near zero.
  • T is a maximum value of fluorescence intensity, and is, for example, 10 4 .
  • M is a maximum value of a display coordinate after conversion.
  • A is a minimum negative value to be converted.
  • the pre-processing parameter table 12 is a table that stores a pre-processing parameter.
  • FIG. 3 is a diagram illustrating an example of the pre-processing parameter table 12 .
  • the pre-processing parameter table 12 stores a plurality of combinations (also referred to as parameter sets) of values of W, T, M, and A.
  • a parameter ID is an identifier that identifies a combination of parameters.
  • the pre-processing unit 11 performs logicle conversion using a combination of W, T, M, and A selected from the pre-processing parameter table 12 by a user. Note that the pre-processing unit 11 may use a default value instead of selecting a parameter from the pre-processing parameter table 12 . Alternatively, a user can designate a value other than a value stored in the pre-processing parameter table 12 .
  • the pre-processing unit 11 performs pre-processing every time a user changes a pre-processing parameter. Note that the pre-processing unit 11 may perform conversion of non-linear processing such as log conversion or bi-exponential conversion instead of logicle conversion.
  • the spectrum output unit 13 generates an image of a spectrum plot using spectral data that has been pre-processed by the pre-processing unit 11 , and displays the generated image.
  • FIG. 4 is a diagram illustrating an example of the spectral plot.
  • the spectral plot indicates a detection wavelength (wavelength) on the horizontal axis and fluorescence intensity on the vertical axis, and expresses information (population information) regarding the number of fine particles (the number of events or the density) by different color shades, color tones, and the like.
  • LD 488 on the vertical axis indicates fluorescence when laser light having a wavelength of 488 nm (nanometre) is emitted, and “A” indicates that a measured value is cumulative intensity.
  • information regarding the number of fine particles is expressed by shading, but in an actual screen, the information regarding the number of fine particles is expressed by color.
  • the clustering processing unit 14 perform clustering on a cell using spectral data that has been pre-processed by the pre-processing unit 11 .
  • K is designated, for example, as K-means, and the clustering processing unit 14 classifies the spectral data into K clusters.
  • the clustering processing unit 14 may automatically determine the number of divisions as in Flow self-organizing map (FlowSOM).
  • FlowSOM Flow self-organizing map
  • the clustering processing unit 14 may perform dimensional compression and perform gating on a result of the dimension compression to perform clustering.
  • the clustering processing unit 14 may perform two-stage clustering such as meta clustering and use two cluster definitions such as a cluster ID and a meta cluster ID.
  • the meta cluster is a collection of clusters.
  • the clustering processing unit 14 can easily make a display result of the spectral data correspond to a clustering result.
  • the clustering result presentation unit 15 displays a clustering result by the clustering processing unit 14 on a display device.
  • the clustering result presentation unit 15 visualizes the number of classifications or which cell group belongs to which group.
  • FIG. 5 is a diagram illustrating an example of the clustering result.
  • FIG. 5 illustrates a case where clustering is performed using FlowSOM.
  • circles indicate clusters, and the clusters are classified into meta clusters M #1 to M #5 with different shadings.
  • the meta clusters M #1 to M #5 are displayed in different colors.
  • the fluorescence separation unit 16 acquires the measurement data 2 and performs fluorescence separation processing (also referred to as unmixing).
  • FIG. 6 is a diagram for explaining fluorescence separation processing.
  • FIG. 6 ( a ) illustrates a measured fluorescence spectrum.
  • the measured fluorescence spectrum is obtained by superimposing fluorescence spectra of, for example, three fluorescent rays.
  • the fluorescence separation unit 16 separates the spectrum into three spectra of fluorescence #1 to #3 using a reference spectrum illustrated in FIG. 6 ( c ).
  • the reference spectrum is a fluorescence spectrum for each fluorescence.
  • the spectra of the separated fluorescence #1 to #3 are illustrated in FIG. 6 ( d ) .
  • the fluorescence separation unit 16 calculates fluorescence intensity by, for example, taking a weighted average using the spectrum separated for each fluorescence.
  • FIG. 6 ( e ) illustrates the intensities of the fluorescence #1 to #3 calculated by the fluorescence separation unit 16 .
  • FIG. 7 is a diagram illustrating a display example of the analysis result.
  • FIG. 7 illustrates a two-dimensional plot with two axes, APC-Cy7::CD 24 and PE-Dazzle 594::CD 38.
  • APC-Cy7::CD 24 and PE-Dazzle 594::CD 38 are fluorescent dye-labeled antibodies used for measurement of fluorescence intensity.
  • “APC-Cy7” and “PE-Dazzle 594” are fluorescent dyes
  • “CD 24” and “CD 38” are antibodies.
  • the information processing device 1 may perform pre-processing and clustering on a cell group included in a partial region of the two-dimensional plot in response to selection of the partial region by a user, and display a clustering result.
  • the information processing device 1 may perform pre-processing and clustering on a cell group included in a partial region of the spectral plot in response to selection of the partial region by a user, and display a clustering result.
  • a characteristic portion such as dense portion in the spectral plot is often selected by a user.
  • FIG. 8 is a flowchart illustrating a flow of pre-processing by the pre-processing unit 11 .
  • the pre-processing unit 11 selects a pre-processing parameter on the basis of a user's instruction (step S 1 ).
  • the pre-processing unit 11 performs pre-processing on spectral data using the pre-processing parameter (step S 2 ). Then, the pre-processing unit 11 determines whether or not a user has changed the pre-processing parameter (step S 3 ). If the user has changed the pre-processing parameter, the pre-processing unit 11 changes the pre-processing parameter and performs pre-processing on the spectral data (step S 4 ). The process returns to step S 3 .
  • the pre-processing unit 11 determines whether or not all target cells have been processed (step S 5 ). If there is a target cell that has not been processed, the process returns to step S 2 , and another cell is processed. Meanwhile, if all target cells have been processed, the pre-processing unit 11 instructs the spectrum output unit 13 to present the pre-processed spectral data to a user (step S 6 ). In addition, the pre-processing unit 11 transmits the pre-processed spectral data to the clustering processing unit 14 .
  • the pre-processing unit 11 performs pre-processing on the spectral data, and the clustering processing unit 14 can thereby make the clustering result correspond to the display result of the spectral data.
  • FIG. 9 is a flowchart illustrating a flow of processing by the clustering processing unit 14 .
  • the clustering processing unit 14 acquires the pre-processed spectral data from the pre-processing unit 11 (step S 11 ), and performs clustering processing (step S 12 ). Then, the clustering processing unit 14 instructs the clustering result presentation unit 15 to present a clustering result (step S 13 ).
  • the clustering processing unit 14 performs clustering using the pre-processed spectral data, it is possible to make the clustering result correspond to the display result of the spectral data.
  • FIG. 10 is a flowchart illustrating a flow of fluorescence separation processing by the fluorescence separation unit 16 .
  • the fluorescence separation unit 16 acquires a reference spectrum (step S 21 ). Then, the fluorescence separation unit 16 acquires spectral data for one cell (step S 22 ) and performs fluorescence separation processing (step S 23 ). Then, the fluorescence separation unit 16 determines whether or not all cells have been processed (step S 24 ). If there is a cell that has not been processed, the process returns to step S 22 . If all cells have been processed, fluorescence separation processing ends.
  • the normal analysis presentation unit 17 can analyze the fluorescence data and display the analysis result.
  • the pre-processing unit 11 acquires spectral data and performs logicle conversion as pre-processing. Then, the clustering processing unit 14 performs clustering using spectral data that has been subjected to logicle conversion. Then, the clustering result presentation unit 15 displays ae clustering result on a display device. Therefore, the information processing device 1 can prevent clustering from being performed on a portion having a large value in the spectral data. Therefore, the information processing device 1 can perform clustering such that the display result of the spectral data and the clustering result correspond to each other.
  • the recent flow cytometer has become multispectral, in which a particle such as a cell is stained with a plurality of fluorescent dyes.
  • a method for classifying data by clustering such as FlowSOM and then analyzing the data is considered as described above.
  • FIG. 11 is a schematic diagram for explaining a series of flow from initialization to learning in FlowSOM.
  • the example illustrated in FIG. 11 indicates a case where 100 vectors (coordinate values) included in a data group 900 to be analyzed are classified into nine representative nodes N 11 to N 33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state.
  • a predetermined number (nine in this example) of pieces of data are selected from the data group 900 by random sampling, and the selected pieces of data are set as initial node vectors N_ij of the representative nodes N 11 to N 33 (step S 1 ).
  • step S 2 learning to update the representative node vector N_ij of each of the representative nodes N 11 to N 33 is executed. Specifically, first, one piece of data is acquired from the data group 900 by random sampling (step S 2 ).
  • represents a learning ratio.
  • the learning ratio ⁇ may be set on the basis of an empirical rule or the like, and may be, for example, a value such as 0.1 or 0.01.
  • New representative node vector N _ ij (1 ⁇ ) ⁇ old representative node vector N _ ij + ⁇ vector of acquired data (1)
  • ⁇ (r) may be a parameter of an algorithm determined on the basis of an empirical rule, and may be, for example, a value such as 0.1 or 0.01. In addition, ⁇ (r) may be a fixed value or may be changed depending on the number of times of learning.
  • a ⁇ (r) value may be changed between first learning and second learning.
  • the same may apply to a.
  • New peripheral representative node vector N _ ij (1 ⁇ ( r )) ⁇ old peripheral representative node vector+ ⁇ ( r ) ⁇ vector of acquired data (2)
  • steps S 2 to S 4 is repeatedly executed until the processing for all the pieces of data included in the data group 900 is completed.
  • step S 2 to S 4 since learning is performed sequentially by random sampling (steps S 2 to S 4 ), the processing is inseparable processing in which updating of a vector of an SOM node cannot be parallelized (divided). As a result, there is a problem that processing efficiency or processing speed cannot be improved.
  • the number of pieces of data to be analyzed has increased due to an improvement in measurement speed of the flow cytometer.
  • the number of pieces of data increases, there is not only a problem that drawing time in data analysis increases, but also a problem that analysis time increases due to a simple increase in the number of targeted clusters in addition to the increase in the drawing time when a cluster generated by a user is additionally analyzed.
  • an information processing device an information processing method, a program, and an information processing system capable of efficiently executing clustering capable of suppressing variations in results depending on execution will be described with an example.
  • FIG. 12 is a block diagram illustrating a configuration of an information processing system according to the present embodiment.
  • an information processing system 200 includes an information processing device 201 and a measurement device 3 .
  • the measurement device 3 and a measurement sample may be similar to the measurement device 3 and the measurement sample described in the first embodiment, and therefore detailed description thereof is omitted here.
  • the information processing device 201 includes a pre-processing/fluorescence separation unit 211 , a pre-processing parameter table 12 , a spectrum output unit 13 , a clustering processing unit 214 , a clustering result presentation unit 15 , and a normal analysis presentation unit 17 .
  • the pre-processing parameter table 12 , the clustering result presentation unit 15 , and the normal analysis presentation unit 17 may be similar to those according to the first embodiment.
  • the pre-processing/fluorescence separation unit 211 may have the functions of both the pre-processing unit 11 and the fluorescence separation unit 16 according to the first embodiment.
  • the spectrum output unit 13 is omitted in FIG. 12 , but a spectrum output unit 13 similar to that of the first embodiment may be added.
  • the pre-processing/fluorescence separation unit 211 separates measurement data 2 , which is spectral data, into a fluorescence spectrum for each fluorescent dye by using a reference spectrum. Then, the pre-processing/fluorescence separation unit 211 executes pre-processing such as Logicle conversion on an unmixed fluorescence spectrum. The pre-processed fluorescence spectrum for each fluorescent dye is presented to a user by the normal analysis presentation unit 17 .
  • a parameter set of conversion parameters used in the pre-processing may be selected by a user's designation of a parameter set to be used from parameter sets managed in the pre-processing parameter table 12 (see FIG. 3 ) as in the first embodiment.
  • the parameter sets managed in the pre-processing parameter table 12 may be finely adjustable by a user.
  • the clustering processing unit 214 executes clustering processing on the pre-processed fluorescence spectrum output from the pre-processing/fluorescence separation unit 211 .
  • a clustering result generated by the clustering processing unit 214 is presented to a user by the clustering result presentation unit 15 .
  • FIG. 13 is a flowchart illustrating an operation example of the pre-processing/fluorescence separation unit according to the present embodiment.
  • the pre-processing/fluorescence separation unit 211 first selects a pre-processing parameter on the basis of a user's instruction as in step S 1 of FIG. 8 (step S 201 ).
  • the pre-processing/fluorescence separation unit 211 executes fluorescence separation processing using a reference spectrum on spectral data of all the cells included in the measurement data 2 (step S 202 ).
  • the pre-processing/fluorescence separation unit 211 executes pre-processing using a pre-processing parameter selected or changed by a user on the spectral data of all the cells included in the measurement data 2 (steps S 203 to S 206 ).
  • the pre-processing/fluorescence separation unit 211 inputs the pre-processed spectral data to the clustering processing unit 214 (step S 207 ), and ends this operation.
  • the pre-processing/fluorescence separation unit 211 may instruct the spectrum output unit 13 to present the pre-processed spectral data to a user, for example, as in step S 6 of FIG. 8 .
  • the spectral data of all the cells included in the measurement data 2 is targeted, but the target of this operation is not limited thereto, and only needs to be spectral data of the number of cells equal to or larger than the number of node divisions (hereinafter, also referred to as a necessary number) in clustering described later. The same may apply to operation of a clustering processing unit described later.
  • a basic flow of operation executed by the clustering processing unit 214 according to the present embodiment may be, for example, similar to the operation described with reference to FIG. 9 in the first embodiment. However, in the present embodiment, the clustering processing executed in step S 12 of FIG. 9 is replaced with clustering processing described later with reference to FIG. 14 .
  • FIG. 14 is a flowchart illustrating an example of the clustering processing executed by the clustering processing unit according to the present embodiment.
  • the clustering processing unit 214 according to the present embodiment can also function as one or more of a dimensional compression unit, an initial value determination unit, a clustering unit, an average value calculation unit, one or more allocation units, an update unit, a node number changing unit, a meta clustering unit, a division unit, and a node setting unit in claims.
  • the clustering processing unit 214 first executes main component analysis on spectral data of all the cells (or the number of cells equal to or larger than the necessary number) included in the measurement data 2 (step S 221 ).
  • the spectral data targeted in step S 221 may be spectral data pre-processed after fluorescence separation.
  • the number of main components is two, but is not limited thereto, and may be three or more.
  • step S 221 not only the main component analysis but also various types of dimensional compression using a statistical data analysis method such as T-SNE may be executed.
  • the clustering processing unit 214 determines an initial value of a representative node vector of an SOM node on the basis of a first main component and a second main component determined in step S 221 and the values (vectors) of all the cells (or the number of cells equal to or larger than the necessary number) included in the measurement data 2 (step S 222 ). Note that details of step S 222 will be described later with reference to FIG. 15 .
  • step S 223 the clustering processing unit 214 executes SOM clustering processing by performing batch learning. Note that details of step S 223 will be described later with reference to FIG. 16 .
  • the clustering processing unit 214 executes processing of determining the number of clusters (hereinafter, referred to as meta clustering) such as consensus clustering on a result of the SOM clustering processing executed in step S 223 (step S 224 ). As a result, the number of clusters in the clustering processing is determined. Note that details of step S 223 in a case of using consensus clustering will be described later with reference to FIG. 17 .
  • the clustering processing unit 214 instructs the clustering result presentation unit 15 to present a clustering result to a user (step S 225 ).
  • a minimum spanning tree (MST) method can be used for visualization for presenting the clustering result to the user.
  • FIG. 15 is a diagram for explaining operation at the time of initializing a representative node vector of each representative node according to the present embodiment, described in step S 222 of FIG. 14 .
  • the example illustrated in FIG. 15 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N 11 to N 33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the example illustrated in FIG. 11 .
  • the clustering processing unit 214 calculates an average value in each dimension for a data group including spectral data after fluorescence separation and pre-processing (hereinafter, referred to as processed spectral data) (hereinafter, simply referred to as a data group) 302 (step S 301 ). For example, in a case where the number of dimensions of data is ten, for each of one to ten dimensions, values in the dimensions in all pieces of the processed spectral data included in the data group 302 are summed, and an average thereof is calculated.
  • processed spectral data hereinafter, simply referred to as a data group
  • the number of dimensions is a value corresponding to the number of types of data, and can correspond to the number of channels in a case of spectral data, for example. Therefore, for example, in a case where a light receiving element array has 32 channels, that is, a light receiving element array detects fluorescence by dividing an entire detection range into 32 wavelength bands, the number of dimensions of spectral data acquired by the light receiving element array is 32.
  • the clustering processing unit 214 performs dimensional compression on all pieces of the processed spectral data included in the data group 302 to determine a first eigenvalue, a second eigenvalue, variance of the first eigenvalue, and variance of the second eigenvalue (step S 302 ).
  • a statistical data analysis method such as main component analysis can be used.
  • the first eigenvalue and the second eigenvalue may be selected according to a predetermined rule or arbitrarily from a dimension after the dimensional compression, and the number of the eigenvalues is not limited to two of the first eigenvalue and the second eigenvalue, and may be one or three or more, for example.
  • the clustering processing unit 214 calculates an initial node vector of each of the representative nodes N 11 to N 33 using the following formula (3) (step S 303 ).
  • an initial value (initial node vector) of a representative node vector of a representative node Nij having coordinates (i, j) is represented by N_ij.
  • the number of row divisions is the number of representative nodes arranged in a row direction, and is three in this example.
  • the number of column divisions is the number of representative nodes arranged in a column direction, and is three in this example.
  • Initial node vector N _ ij average value in each dimension+(first eigenvalue/number of row divisions) ⁇ i +(second eigenvalue/number of column divisions) ⁇ j (3)
  • FIG. 16 is a flowchart illustrating an example of clustering by batch learning according to the present embodiment described in step S 223 of FIG. 14 .
  • the example illustrated in FIG. 16 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N 11 to N 33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the example illustrated in FIG. 15 .
  • the clustering processing unit 214 executes processing of selecting processed spectral data piece by piece from the data group 302 and allocating the selected piece of processed spectral data to a representative node closest thereto (step S 304 ). This allocation processing is repeated until allocation of all pieces (or the number of pieces equal to or larger than the necessary number) of processed spectral data in the data group 302 is completed.
  • the clustering processing unit 214 executes processing of selecting processed spectral data piece by piece from the data group 302 and allocating the selected piece of processed spectral data to a representative node closest thereto (step S 304 ). This allocation processing is repeated until allocation of all pieces (or the number of pieces equal to or larger than the necessary number) of processed spectral data in the data group 302 is completed.
  • the clustering processing unit 214 executes processing of selecting processed spectral data piece by piece from the data group 302 and allocating the selected piece of processed spectral data to a representative node closest thereto (step S 304 ). This allocation processing is repeated until allocation of
  • the pieces of processed spectral data with cell IDs ‘1’ and ‘3’ are allocated to a representative node N 11
  • the piece of processed spectral data with a cell ID ‘2’ is allocated to a representative node N 12
  • the cell ID may be an identifier for uniquely identifying a cell corresponding to the processed spectral data registered in the data group 302 .
  • the order in selecting the processed spectral data from the data group 302 is not particularly limited, and may be, for example, various orders such as ascending order of cell IDs for uniquely identifying cells of the processed spectral data registered in the data group 302 and registration order in the data group 302 .
  • the clustering processing unit 214 executes processing of updating the representative node vector of each of the representative nodes N 11 to N 33 on the basis of the processed spectral data allocated to each of the representative nodes N 11 to N 33 (step S 305 ).
  • a new representative node vector after the update is calculated on the basis of the following formula (4).
  • new representative node vector N _ ij (1 ⁇ ) ⁇ old representative node vector N _ ij + ⁇ average value of allocated processed spectral data (4)
  • the clustering processing unit 214 executes processing of further multiplying the change by the formula (4) by ⁇ , represented by the following formula (5), on the peripheral representative node located around the representative node having the representative node vector N_ij updated in step S 305 , and thereby updates the peripheral representative node vector N_ij of the peripheral representative node (step S 306 ).
  • the representative node vector of each of the representative nodes N 11 to N 33 is updated by repeating the processing in steps S 304 to S 306 described above a predetermined number of times (for example, one or more times). Note that in the repetition of the processing in steps S 304 to S 306 , the learning ratio ⁇ and/or ⁇ may change (for example, decrease or increase) every time the processing is repeated.
  • processing from the allocation of the processed spectral data to the update of the representative node vector can be subdivided and executed by different information processing devices.
  • the processed spectral data included in the data group 302 can be divided into a plurality of groups, and the spectral data belonging to each group can be allocated to the representative node by different information processing devices.
  • the clustering processing unit 214 includes an allocation unit that allocates the plurality of pieces of processed spectral data to any of the representative nodes N 11 to N 33 , the number of allocation units included in the clustering processing unit 214 may be equal to or less than the number of groups, and the allocation units may be executed in different information processing devices.
  • FIG. 17 is a diagram for explaining operation for determining the number of clusters using consensus clustering described in step S 224 of FIG. 14 .
  • the example illustrated in FIG. 17 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N 11 to N 33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the examples illustrated in FIGS. 15 and 16 .
  • step S 224 of FIG. 14 by executing meta clustering such as consensus clustering, a more suitable number of divisions and a combination of representative nodes constituting a representative node after meta clustering (hereinafter, referred to as a meta representative node) are determined as illustrated in (b) of FIG. 17 .
  • meta representative node a more suitable number of divisions and a combination of representative nodes constituting a representative node after meta clustering
  • FIG. 17 exemplifies a case where, as a result of meta clustering, one meta representative node NN 1 is constituted by representative nodes N 11 , N 12 , and N 21 , one meta representative node NN 2 is constituted by representative nodes N 22 , N 31 , and N 32 , and one meta representative node NN 3 is constituted by representative nodes N 13 , N 23 , and N 33 .
  • the number of node divisions (nine in this example) set at the time of execution of clustering may be the number of divisions determined by default or the number of divisions set by a user.
  • the information processing device 201 may further include an operation input unit that functions as a node setting unit that causes a user to set the number of node divisions.
  • the initial node vector (initial value) of each representative node is determined on the basis of an average value of the processed spectral data for each dimension and an eigenvalue obtained by dimensional compression of the processed spectral data, it is possible to set the same initial node vector all the time for the data group 302 having the same population. As a result, it is possible to avoid that a clustering result varies depending on execution.
  • processing from the allocation of the processed spectral data to the update of the representative node vector can be subdivided and executed by different information processing devices.
  • the clustering target is not limited thereto, and for example, the measurement data 2 acquired by the measurement device 3 may be subjected to clustering by the clustering processing unit 214 .
  • the pre-processing/fluorescence separation unit 211 executes pre-processing on the measurement data 2 , and inputs spectral data before fluorescence separation obtained by the pre-processing to the clustering processing unit 214 .
  • the clustering processing unit 214 executes the above-described clustering processing on the input spectral data before fluorescence separation.
  • the pre-processing/fluorescence separation unit 211 executes pre-processing on a fluorescence spectrum for each fluorescent dye obtained by executing fluorescence separation processing on spectral data included in the measurement data 2
  • the operation of the pre-processing/fluorescence separation unit 211 is not limited thereto, and for example, the pre-processing/fluorescence separation unit 211 may execute pre-processing on spectral data included in the measurement data 2 first, and then may execute fluorescence separation processing.
  • FIG. 18 is a block diagram illustrating a hardware configuration example of the information processing devices according to the embodiments of the present disclosure. Note that, in the following description, the information processing device 1 will be exemplified, but the same applies also to the information processing device 201 similarly.
  • the information processing device 1 includes a central processing unit (CPU) 901 , a read only memory (ROM) 903 , and a random access memory (RAM) 905 .
  • the information processing device 1 includes a host bus 907 , a bridge 909 , an external bus 911 , an interface 913 , an input device 915 , an output device 917 , a storage device 919 , a drive 921 , a connection port 925 , and a communication device 929 .
  • the information processing device 1 may include a processing circuit called a digital signal processor (DSP) or an application specific integrated circuit (ASIC) instead of or in addition to the CPU 901 .
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • the CPU 901 functions as an arithmetic processing device and a control device, and controls the overall operation or a part thereof in the information processing device 1 according to various programs recorded in the ROM 903 , the RAM 905 , the storage device 919 , or a removable recording medium 923 .
  • the CPU 901 controls the overall operation of each functional unit included in the information processing device 1 in the above embodiment.
  • the ROM 903 stores a program, an operation parameter, and the like used by the CPU 901 .
  • the RAM 905 primarily stores a program used in execution of the CPU 901 , a parameter that appropriately changes in the execution, and the like.
  • the CPU 901 , the ROM 903 , and the RAM 905 are connected to each other by the host bus 907 constituted by an internal bus such as a CPU bus. Furthermore, the host bus 907 is connected to the external bus 911 such as a peripheral component interconnect/interface (PCI) bus via the bridge 909 .
  • PCI peripheral component interconnect/interface
  • the input device 915 is a device operated by a user, such as a mouse, a keyboard, a touch panel, a button, a switch, or a lever.
  • the input device 915 may be, for example, a remote control device using an infrared ray or another radio wave, or an external connection device 927 such as a mobile phone corresponding to operation of the information processing device 1 .
  • the input device 915 includes an input control circuit that generates an input signal on the basis of information input by a user and outputs the input signal to the CPU 901 . By operating the input device 915 , a user inputs various types of data to the information processing device 1 or instructs the information processing device 1 to perform processing operation.
  • the output device 917 is constituted by a device capable of visually or aurally notifying a user of acquired information.
  • the output device 917 can be, for example, a display device such as an LCD, a PDP, or an OELD, a sound output device such as a speaker or a headphone, or a printer device.
  • the output device 917 outputs a result obtained by processing of the information processing device 1 as a video such as a text or an image, or as a sound such as audio.
  • the storage device 919 is a data storage device constituted as an example of a storage of the information processing device 1 .
  • the storage device 919 is constituted by, for example, a magnetic storage device such as a hard disk drive (HDD), a semiconductor storage device, an optical storage device, or a magneto-optical storage device.
  • the storage device 919 stores a program and various types of data executed by the CPU 901 , various types of data acquired from the outside, and the like.
  • the drive 921 is a reader/writer for the removable recording medium 923 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory, and is built in or externally attached to the information processing device 1 .
  • the drive 921 reads information recorded in the attached removable recording medium 923 and outputs the information to the RAM 905 .
  • the drive 921 writes a record in the attached removable recording medium 923 .
  • the connection port 925 is a port for directly connecting a device to the information processing device 1 .
  • the connection port 925 can be, for example, a universal serial bus (USB) port, an IEEE 1394 port, or a small computer system interface (SCSI) port.
  • the connection port 925 may be an RS-232C port, an optical audio terminal, a high-definition multimedia interface (HDMI) (registered trademark) port, or the like.
  • HDMI high-definition multimedia interface
  • the communication device 929 is, for example, a communication interface constituted by a communication device or the like for connection to a communication network NW.
  • the communication device 929 can be, for example, a communication card for wired or wireless local area network (LAN), Bluetooth (registered trademark), or wireless USB (WUSB).
  • the communication device 929 may be a router for optical communication, a router for asymmetric digital subscriber line (ADSL), a modem for various types of communication, or the like.
  • the communication device 929 transmits and receives a signal and the like to and from the Internet or another communication device using a predetermined protocol such as TCP/IP.
  • the communication network NW connected to the communication device 929 is a network connected in a wired or wireless manner, and is, for example, the Internet, a home LAN, infrared communication, radio wave communication, or satellite communication.
  • the information processing system 4 includes the information processing device 1 or 201 and the measurement device 3 , but the present technology is not limited to such an example.
  • the information processing device 1 or 201 may have a function (measurement function) of the measurement device 3 .
  • the information processing system 4 is implemented by the information processing device 1 or 201 .
  • the measurement device 3 may have the functions of the information processing device 1 or 201 .
  • the information processing system 4 is implemented by the measurement device 3 .
  • the measurement device 3 may have some of the functions of the information processing device 1 or 201
  • the information processing device 1 or 201 may have some of the functions of the measurement device 3 .
  • An information processing device including:
  • the information processing device further including an average value calculation unit that calculates an average value of the plurality of pieces of spectral data for each dimension, in which
  • the information processing device according to any one of (1) to (7), further including
  • the information processing device further including
  • the information processing device according to any one of (1) to (14), further including
  • the information processing device according to any one of (11) to (14), further including
  • An information processing method including:
  • An information processing system comprising:

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Immunology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Pathology (AREA)
  • Biochemistry (AREA)
  • Hematology (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Dispersion Chemistry (AREA)
  • Signal Processing (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Computational Linguistics (AREA)
  • Urology & Nephrology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Cell Biology (AREA)
  • Microbiology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)

Abstract

Data increased by multicoloring is analyzed. An information processing device according to an embodiment includes: a dimensional compression unit (214) that executes dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes; an initial value determination unit (214) that determines an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and a clustering unit (214) that executes clustering on the plurality of pieces of spectral data using the initial value.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims the benefit under 35 U.S.C. § 371 as a U.S. National Stage Entry of International Application No. PCT/JP2020/027196, filed in the Japanese Patent Office as a Receiving Office on Jul. 13, 2020, which claims priority to Japanese Patent Application Number JP2020-093877, filed in the Japanese Patent Office on May 29, 2020 and JP2019-152593, filed in the Japanese Patent Office on Aug. 23, 2019, each of which is hereby incorporated by reference in its entirety.
FIELD
The present disclosure relates to an information processing device, an information processing method, a program, and an information processing system.
BACKGROUND
In the fields of medicine, biochemistry, and the like, in general, a flow cytometer is used in order to quickly measure characteristics of each of a large number of particles. The flow cytometer is a device that irradiates a particle such as a cell or a bead flowing in a flow cell with a light beam to detect fluorescence, scattered light, or the like emitted from the particle, and optically measures characteristics of each particle.
CITATION LIST Patent Literature
    • Patent Literature 1: JP 2007-132921 A
    • Patent Literature 2: JP 2016-511397 A
SUMMARY Technical Problem
In a recent flow cytometer, multicoloring in which a particle such as a cell is stained with a plurality of fluorescent dyes has progressed. However, as the multicoloring progresses, the number of fluorescent substances that can be measured at one time increases, and combination explosion occurs. As a result, there is a problem that the amount of data to be processed increases and analysis is difficult.
Therefore, the present disclosure proposes an information processing device, an information processing method, a program, and an information processing system capable of analyzing data increased by multicoloring.
Solution to Problem
To solve the above-described problem, an information processing device according to one aspect of the present disclosure comprises: a dimensional compression unit that executes dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes; an initial value determination unit that determines an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value.
Note that the above effects are not necessarily limited, and any of the effects described in the present specification or other effects that can be grasped from the present specification may be exhibited together with or instead of the above effects.
BRIEF DESCRIPTION OF DRAWINGS
FIG. 1 is a block diagram illustrating a configuration of an information processing system according to a first embodiment.
FIG. 2 is a diagram illustrating an example of a fluorescence spectrum.
FIG. 3 is a diagram illustrating an example of a pre-processing parameter table.
FIG. 4 is a diagram illustrating an example of a spectrum plot.
FIG. 5 is a diagram illustrating an example of a clustering result.
FIG. 6 is a diagram for explaining fluorescence separation processing.
FIG. 7 is a diagram illustrating a display example of an analysis result.
FIG. 8 is a flowchart illustrating a flow of pre-processing by a pre-processing unit.
FIG. 9 is a flowchart illustrating a flow of processing by a clustering processing unit.
FIG. 10 is a flowchart illustrating a flow of fluorescence separation processing by a fluorescence separation unit.
FIG. 11 is a schematic diagram for explaining a series of flow from initialization to learning in FlowSOM.
FIG. 12 is a block diagram illustrating a configuration of an information processing system according to a second embodiment.
FIG. 13 is a flowchart illustrating an operation example of a pre-processing/fluorescence separation unit according to the second embodiment.
FIG. 14 is a flowchart illustrating an example of clustering processing executed by a clustering processing unit according to the second embodiment.
FIG. 15 is a diagram for explaining operation at the time of initializing a representative node vector of each representative node according to the present embodiment, described in step S222 of FIG. 14 .
FIG. 16 is a flowchart illustrating an example of clustering by batch learning according to the present embodiment, described in step S223 of FIG. 14 .
FIG. 17 is a diagram for explaining operation for determining the number of clusters using consensus clustering, described in step S224 of FIG. 14 .
FIG. 18 is a block diagram illustrating a hardware configuration example of an information processing device.
DESCRIPTION OF EMBODIMENTS
Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings. In addition, the present disclosure will be described according to the following item order.
    • 1. First Embodiment
    • 1.1 Configuration of information processing system
    • 1.2 Operation of information processing device
    • 1.3 Action and effect
    • 2. Second Embodiment
    • 2.1 Configuration of information processing system
    • 2.2 Operation example of information processing device
    • 2.2.1 Operation example of pre-processing/fluorescence separation unit
    • 2.2.2 Operation example of clustering processing unit
    • 2.2.3 Initialization of representative node vector
    • 2.2.4 Clustering by batch learning
    • 2.2.5 Determination of the number of clusters using consensus clustering
    • 2.3 Action and effect
    • 3. Hardware configuration of information processing device
1. First Embodiment
A recent flow cytometer can acquire more abundant information by staining a particle such as a cell with a plurality of fluorescent dyes and measuring many fluorescence signals at one time. Meanwhile, due to such multicoloring of the flow cytometer, a limit of analysis by manual gating as in related art is pointed out. For example, when n types of fluorescent dyes are plotted in a two-dimensional coordinate system, there are nC2 combination patterns of fluorescent dyes. That is, there are 15 combination patterns for six colors, and there are 190 combination patterns for 20 colors.
As a method for analyzing such increased data, a method for adopting automatic classification by clustering instead of conventional manual gating, classifying data by clustering, and then analyzing the data is considered.
However, in general clustering, a certain piece of data is classified into any cluster. Therefore, even data that is located at a boundary between two or more clusters and is difficult to determine is classified into any cluster. Therefore, when data acquired by the flow cytometer is classified by clustering, there is a possibility that a certain degree of misclassification may occur.
Meanwhile, there are a filter type flow cytometer that detects intensity of one or more specific wavelengths in fluorescence and a spectrum type flow cytometer that can acquire more information by detecting fluorescence emitted from each fluorescent dye as a wavelength spectrum. In the spectral type flow cytometer, since fluorescence intensity for each wavelength is obtained, it is possible to acquire a large amount of features from one particle (also referred to as a sample). Therefore, by adopting the spectral type flow cytometer, more detailed analysis can be performed.
However, in spectral data obtained by the spectral type flow cytometer, the fluorescence intensity for each wavelength changes exponentially instead of linearly. Therefore, in order to display the spectral data easily recognizable to a user, it is necessary to perform scale conversion on the spectral data. Meanwhile, in clustering, classification is performed on the basis of a distance between pieces of data. Therefore, if the spectral data in which the fluorescence intensity for each wavelength changes exponentially is subjected to clustering as it is, a difference on a portion having large fluorescence intensity strongly affects the entire distance between pieces of data, and erroneous classification may occur. As a result, there is a possibility that a clustering result does not correctly correspond to spectral data to be subjected to scale conversion and displayed.
Therefore, in the present embodiment, an information processing device, an information processing method, a program, and an information processing system capable of making a display result of spectral data and a clustering result correspond to each other more correctly will be described with an example.
Specifically, logicle conversion is performed as pre-processing on spectral data acquired by the flow cytometer. Then, clustering is performed using the spectral data that has been subjected to the logicle conversion, and a clustering result obtained by the clustering is displayed to a user. As a result, in the present embodiment, it can be determined whether or not clustering is performed on a portion having a large value in the spectral data. Therefore, it is possible to perform clustering such that a display result of the spectral data and a clustering result more correctly correspond to each other. Hereinafter, the information processing device, the information processing method, the program, and the information processing system according to the present embodiment will be described in detail with reference to the drawings.
However, the technology according to the present disclosure does not need to solve all the above-described problems at the same time. Therefore, it is understood that those solving some or all of the above-described problems by including some or all of the configurations described later are included in the technical scope of the present disclosure.
1.1 Configuration of Information Processing System
FIG. 1 is a block diagram illustrating a configuration of an information processing system according to the present embodiment. As illustrated in FIG. 1 , an information processing system 4 includes an information processing device 1 and a measurement device 3.
The measurement device 3 is a measurement device capable of detecting a fluorescence spectrum of each color from a cell or the like to be measured. The measurement device 3 is, for example, a flow cytometer. A measurement sample measured by the flow cytometer may be a biologically derived particle such as a cell, a microorganism, or a biologically relevant particle. For example, the cell may be an animal cell (for example, a corpuscle-based cell), or a plant cell. For example, the microorganism may be a bacterium such as Escherichia coli, a virus such as tobacco mosaic virus, or a fungus such as yeast. The biologically relevant particle may be a particle constituting a cell such as a chromosome, a liposome, mitochondria, or various organelles. Note that the biologically relevant particle may include a biologically relevant polymer such as a nucleic acid, a protein, a lipid, a sugar chain, or a complex thereof. Each of these biologically derived particles may have either a spherical shape or a non-spherical shape, and is not particularly limited in size and mass.
In addition, the measurement sample may be an industrially synthesized particle such as a latex particle, a gel particle, or an industrial particle. For example, the industrially synthesized particle may be a particle synthesized with an organic resin material such as polystyrene or polymethyl methacrylate, an inorganic material such as glass, silica, or a magnetic body, or a metal such as gold colloid or aluminum. Similarly, each of these industrially synthesized particles may have either a spherical shape or a non-spherical shape, and is not particularly limited in size and mass.
The measurement sample can be labeled (stained) with one or more fluorescent dyes prior to measurement of a fluorescence spectrum. The measurement sample may be labeled with a fluorescent dye by a known method. Specifically, when the measurement sample is a cell, a fluorescently labeled antibody that is selectively bonded to an antigen present on a cell surface is mixed with a cell to be measured, and the fluorescently labeled antibody is bonded to an antigen on a surface of the cell. As a result, the cell to be measured can be labeled with a fluorescent dye. Alternatively, the cell to be measured can be labeled with a fluorescent dye by mixing a fluorescent dye that is selectively taken into a specific cell with the cell to be measured.
The fluorescently labeled antibody is an antibody to which a fluorescent dye is bonded as a label. The fluorescently labeled antibody may be an antibody to which a fluorescent dye is directly bonded. Alternatively, the fluorescently labeled antibody may be an antibody obtained by bonding a fluorescent dye to which avidin is bonded to a biotin-labeled antibody by an avidin-biotin reaction. Note that as the antibody, either a polyclonal antibody or a monoclonal antibody can be used.
The fluorescent dye for labeling a cell is not particularly limited, and at least one or more known dyes used for staining a cell or the like can be used. For example, as the fluorescent dye, phycoerythrin (PE), fluorescein isothiocyanate (FITC), PE-Cy5, PE-Cy7, PE-Texas Red (registered trademark), allophycocyanin (APC), APC-Cy7, ethidium bromide, propidium iodide, Hoechst (registered trademark) 33258, Hoechst (registered trademark) 33342, 4′,6-diamidino-2-phenylindole (DAPI), acridineorange, chromomycin, mithramycin, olivomycin, pyronin Y, thiazole orange, rhodamine 101, isothiocyanate, BCECF, BCECF-AM, C. SNARF-1, C. SNARF-1-AMA, aequorin, Indo-1, Indo-1-AM, Fluo-3, Fluo-3-AM, Fura-2, Fura-2-AM, oxonol, Texas Red (registered trademark), Rhodamine 123, 10-N-nony-acridine orange, fluorescein, fluorescein diacetate, carboxyfluorescein, carboxyfluorescein diacetate, carboxydichlorofluorescein, and carboxydichlorofluorescein diacetate can be used. In addition, derivatives of the above-described fluorescent dyes and the like can also be used.
The flow cytometer includes a laser light source that emits laser light having a wavelength capable of exciting a fluorescent dye with which a measurement sample S is labeled, a flow cell through which the measurement sample S flows in one direction, and a photodetector that receives one or more of fluorescence, phosphorescence, and scattered light from the measurement sample S irradiated with the laser light.
The laser light source is, for example, a semiconductor laser light source that emits laser light having a predetermined wavelength. A plurality of laser light sources may be disposed. When the plurality of laser light sources is disposed, positions irradiated with laser light from the laser light sources may be the same as or different from each other in the flow cell. However, in a case where different positions are irradiated with laser light from the plurality of laser light sources, light from the measurement sample S can be detected by different photodetectors. In such a case, even in a case where dyes that emit fluorescent rays having close wavelengths are used, a fluorescence spectrum of each of the fluorescent rays can be measured without color mixing. Note that the laser light emitted from the laser light source may be either pulsed light or continuous light. For example, as the laser light source, a plurality of semiconductor laser light sources that emits laser light having a wavelength of 480 nm and laser light having a wavelength of 640 nm may be used.
The flow cell is a flow path through which the plurality of measurement samples S flows in line in one direction. Specifically, through the flow cell, a sheath liquid enclosing the measurement samples S flows at a high speed as a laminar flow, and the plurality of measurement samples S thereby flows in line in one direction. The flow cell can be formed in a microchip or a cuvette.
The photodetector detects light from the measurement sample S irradiated with laser light by photoelectric conversion. The light from the measurement sample S can include at least one of fluorescence, phosphorescence, and scattered light.
For example, the photodetector may include a detector that detects scattered light LS including forward scattered light and side scattered light from the measurement sample S, and a light receiving element array that detects fluorescence from the measurement sample S.
The detector may be, for example, a known photoelectric conversion element such as a charge coupled device (CCD), a complementary metal oxide semiconductor (CMOS), or a photodiode. The light receiving element array can be constituted by, for example, arranging a plurality of independent detection channels having different wavelength ranges of light to be detected. Specifically, the light receiving element array may be a light receiving element array in which a plurality of photo multiplier tubes (PMTs) or photodiodes having different wavelength ranges to be detected are arranged one-dimensionally or the like, an image sensor in which pixels are arranged in a two-dimensional lattice pattern, or the like. The light receiving element array photoelectrically converts fluorescence from the measurement sample S spectrally dispersed for each wavelength by a spectroscopic element such as a prism or a grating.
In the flow cytometer having the above configuration, first, the measurement sample S flowing in the flow cell is irradiated with laser light emitted from the laser light source. The measurement sample S emits scattered light and fluorescence (or phosphorescence) by being irradiated with the laser light. The scattered light emitted from the measurement sample S is detected by the detector. Meanwhile, the fluorescence emitted from the measurement sample S is spectrally dispersed into light for each wavelength by the spectroscopic element and then received by the light receiving element array. As a result, a spectrum of the fluorescence emitted from the measurement sample S is detected.
Note that in the following description, it is assumed that a cell is to be measured. The measurement device 3 is, for example, a spectrum type flow cytometer that causes a fluorescent-stained cell to flow through a flow cell at a high speed and irradiates the flowing cell with a light beam to detect a fluorescence spectrum for each fluorescent dye emitted from the cell. FIG. 2 is a diagram illustrating an example of the fluorescence spectrum. As illustrated in FIG. 2 , the fluorescence spectrum is expressed by fluorescence intensity for each channel corresponding to a wavelength.
The measurement device 3 outputs a detected fluorescence spectrum as measurement data 2. The measurement data 2 includes spectral data of fluorescence for each cell. The measurement device 3 transfers the measurement data 2 to, for example, the information processing device 1.
The information processing device 1 acquires and analyzes the measurement data 2 measured by the measurement device 3, and displays an analysis result. Note that the information processing device 1 and the measurement device 3 may be connected to each other via a network, and the information processing device 1 may acquire the measurement data 2 via the network.
The information processing device 1 includes a pre-processing unit 11, a pre-processing parameter table 12, a spectrum output unit 13, a clustering processing unit 14, a clustering result presentation unit 15, a fluorescence separation unit 16, and a normal analysis presentation unit 17. Note that all or some of these functional units may be performed in a cloud. For example, the pre-processing parameter table 12, the clustering processing unit 14, and the fluorescence separation unit 16 may be performed in a cloud. In this case, the measurement data 2 is also transferred to the cloud.
The pre-processing unit 11 performs pre-processing on spectral data according to a selected parameter. Here, the pre-processing is coordinate conversion for performing display from an actual observation value. The conversion may be, for example, simple log 10 conversion or logicle conversion considering characteristics of an observation device. The parameter includes W, T, M, and A.
W is a value that linearly displays a value near zero. T is a maximum value of fluorescence intensity, and is, for example, 104. M is a maximum value of a display coordinate after conversion. A is a minimum negative value to be converted.
The pre-processing parameter table 12 is a table that stores a pre-processing parameter. FIG. 3 is a diagram illustrating an example of the pre-processing parameter table 12. As illustrated in FIG. 3 , the pre-processing parameter table 12 stores a plurality of combinations (also referred to as parameter sets) of values of W, T, M, and A. A parameter ID is an identifier that identifies a combination of parameters.
The pre-processing unit 11 performs logicle conversion using a combination of W, T, M, and A selected from the pre-processing parameter table 12 by a user. Note that the pre-processing unit 11 may use a default value instead of selecting a parameter from the pre-processing parameter table 12. Alternatively, a user can designate a value other than a value stored in the pre-processing parameter table 12. The pre-processing unit 11 performs pre-processing every time a user changes a pre-processing parameter. Note that the pre-processing unit 11 may perform conversion of non-linear processing such as log conversion or bi-exponential conversion instead of logicle conversion.
The spectrum output unit 13 generates an image of a spectrum plot using spectral data that has been pre-processed by the pre-processing unit 11, and displays the generated image. FIG. 4 is a diagram illustrating an example of the spectral plot. The spectral plot indicates a detection wavelength (wavelength) on the horizontal axis and fluorescence intensity on the vertical axis, and expresses information (population information) regarding the number of fine particles (the number of events or the density) by different color shades, color tones, and the like.
In FIG. 4 , “LD 488” on the vertical axis indicates fluorescence when laser light having a wavelength of 488 nm (nanometre) is emitted, and “A” indicates that a measured value is cumulative intensity. In addition, in FIG. 4 , information regarding the number of fine particles is expressed by shading, but in an actual screen, the information regarding the number of fine particles is expressed by color.
As illustrated in the vertical axis in FIG. 4 , in the spectrum plot, the vertical axis corresponding to fluorescence intensity is subjected to logicle conversion and displayed. For this reason, arrow 91 and arrow 92 are displayed with the same length in FIG. 4 , but ranges indicated by the arrows are largely different from each other. That is, when the vertical axis is linear, the lengths are completely different from each other, and the length of arrow 91 is much longer than that of arrow 92.
The clustering processing unit 14 perform clustering on a cell using spectral data that has been pre-processed by the pre-processing unit 11. K is designated, for example, as K-means, and the clustering processing unit 14 classifies the spectral data into K clusters. Alternatively, the clustering processing unit 14 may automatically determine the number of divisions as in Flow self-organizing map (FlowSOM).
Alternatively, for example, as in T-SNE, the clustering processing unit 14 may perform dimensional compression and perform gating on a result of the dimension compression to perform clustering. Alternatively, the clustering processing unit 14 may perform two-stage clustering such as meta clustering and use two cluster definitions such as a cluster ID and a meta cluster ID. Here, the meta cluster is a collection of clusters.
By performing clustering using spectral data that has been subjected to logicle conversion, the clustering processing unit 14 can easily make a display result of the spectral data correspond to a clustering result.
The clustering result presentation unit 15 displays a clustering result by the clustering processing unit 14 on a display device. The clustering result presentation unit 15 visualizes the number of classifications or which cell group belongs to which group. FIG. 5 is a diagram illustrating an example of the clustering result. FIG. 5 illustrates a case where clustering is performed using FlowSOM. In FIG. 5 , circles indicate clusters, and the clusters are classified into meta clusters M #1 to M #5 with different shadings. In actual display, the meta clusters M #1 to M #5 are displayed in different colors. By display of the clustering result, a user can be notified of a distribution of clusters and meta clusters.
The fluorescence separation unit 16 acquires the measurement data 2 and performs fluorescence separation processing (also referred to as unmixing). FIG. 6 is a diagram for explaining fluorescence separation processing. FIG. 6(a) illustrates a measured fluorescence spectrum. As illustrated in FIG. 6(b), the measured fluorescence spectrum is obtained by superimposing fluorescence spectra of, for example, three fluorescent rays.
Therefore, the fluorescence separation unit 16 separates the spectrum into three spectra of fluorescence #1 to #3 using a reference spectrum illustrated in FIG. 6(c). Here, the reference spectrum is a fluorescence spectrum for each fluorescence. The spectra of the separated fluorescence #1 to #3 are illustrated in FIG. 6(d). The fluorescence separation unit 16 calculates fluorescence intensity by, for example, taking a weighted average using the spectrum separated for each fluorescence. FIG. 6(e) illustrates the intensities of the fluorescence #1 to #3 calculated by the fluorescence separation unit 16.
The normal analysis presentation unit 17 performs analysis using the fluorescence intensity separated by the fluorescence separation unit 16 and displays an analysis result on a display device. FIG. 7 is a diagram illustrating a display example of the analysis result. FIG. 7 illustrates a two-dimensional plot with two axes, APC-Cy7::CD 24 and PE-Dazzle 594::CD 38. Here, APC-Cy7::CD 24 and PE-Dazzle 594::CD 38 are fluorescent dye-labeled antibodies used for measurement of fluorescence intensity. “APC-Cy7” and “PE-Dazzle 594” are fluorescent dyes, and “CD 24” and “CD 38” are antibodies. By the two-dimensional plot, a user can be notified of a distribution of cells with respect to the two fluorescent dyes.
Note that the information processing device 1 may perform pre-processing and clustering on a cell group included in a partial region of the two-dimensional plot in response to selection of the partial region by a user, and display a clustering result. In addition, the information processing device 1 may perform pre-processing and clustering on a cell group included in a partial region of the spectral plot in response to selection of the partial region by a user, and display a clustering result. In particular, a characteristic portion such as dense portion in the spectral plot is often selected by a user.
1.2 Operation of Information Processing Device
Next, operation of the information processing device 1 will be described with reference to FIGS. 8 to 10 . FIG. 8 is a flowchart illustrating a flow of pre-processing by the pre-processing unit 11. As illustrated in FIG. 8 , the pre-processing unit 11 selects a pre-processing parameter on the basis of a user's instruction (step S1).
Then, the pre-processing unit 11 performs pre-processing on spectral data using the pre-processing parameter (step S2). Then, the pre-processing unit 11 determines whether or not a user has changed the pre-processing parameter (step S3). If the user has changed the pre-processing parameter, the pre-processing unit 11 changes the pre-processing parameter and performs pre-processing on the spectral data (step S4). The process returns to step S3.
Meanwhile, if the user does not change the pre-processing parameter, the pre-processing unit 11 determines whether or not all target cells have been processed (step S5). If there is a target cell that has not been processed, the process returns to step S2, and another cell is processed. Meanwhile, if all target cells have been processed, the pre-processing unit 11 instructs the spectrum output unit 13 to present the pre-processed spectral data to a user (step S6). In addition, the pre-processing unit 11 transmits the pre-processed spectral data to the clustering processing unit 14.
As described above, the pre-processing unit 11 performs pre-processing on the spectral data, and the clustering processing unit 14 can thereby make the clustering result correspond to the display result of the spectral data.
FIG. 9 is a flowchart illustrating a flow of processing by the clustering processing unit 14. As illustrated in FIG. 9 , the clustering processing unit 14 acquires the pre-processed spectral data from the pre-processing unit 11 (step S11), and performs clustering processing (step S12). Then, the clustering processing unit 14 instructs the clustering result presentation unit 15 to present a clustering result (step S13).
As described above, since the clustering processing unit 14 performs clustering using the pre-processed spectral data, it is possible to make the clustering result correspond to the display result of the spectral data.
FIG. 10 is a flowchart illustrating a flow of fluorescence separation processing by the fluorescence separation unit 16. As illustrated in FIG. 10 , the fluorescence separation unit 16 acquires a reference spectrum (step S21). Then, the fluorescence separation unit 16 acquires spectral data for one cell (step S22) and performs fluorescence separation processing (step S23). Then, the fluorescence separation unit 16 determines whether or not all cells have been processed (step S24). If there is a cell that has not been processed, the process returns to step S22. If all cells have been processed, fluorescence separation processing ends.
As described above, since the fluorescence separation unit 16 performs fluorescence separation processing, the normal analysis presentation unit 17 can analyze the fluorescence data and display the analysis result.
1.3 Action and Effect
As described above, according to an embodiment, the pre-processing unit 11 acquires spectral data and performs logicle conversion as pre-processing. Then, the clustering processing unit 14 performs clustering using spectral data that has been subjected to logicle conversion. Then, the clustering result presentation unit 15 displays ae clustering result on a display device. Therefore, the information processing device 1 can prevent clustering from being performed on a portion having a large value in the spectral data. Therefore, the information processing device 1 can perform clustering such that the display result of the spectral data and the clustering result correspond to each other.
2. Second Embodiment
As described above, the recent flow cytometer has become multispectral, in which a particle such as a cell is stained with a plurality of fluorescent dyes.
As the flow cytometer becomes multispectral, the number of fluorescent substances that can be measured at one time increases, and as a result, combination explosion occurs. Therefore, it is difficult to perform analysis by a human hand. For example, in a case where N (N is a natural number) colors are measured at one time, when each of the colors is treated two-dimensionally, there are n (n−1)/2 combinations, and the number of combinations increases in proportion to a half of the square of the number of colors N.
As a method for analyzing data increased by such combination explosion, for example, a method for classifying data by clustering such as FlowSOM and then analyzing the data is considered as described above.
FIG. 11 is a schematic diagram for explaining a series of flow from initialization to learning in FlowSOM. The example illustrated in FIG. 11 indicates a case where 100 vectors (coordinate values) included in a data group 900 to be analyzed are classified into nine representative nodes N11 to N33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state.
In the example illustrated in FIG. 11 , first, a predetermined number (nine in this example) of pieces of data are selected from the data group 900 by random sampling, and the selected pieces of data are set as initial node vectors N_ij of the representative nodes N11 to N33 (step S1).
Next, learning to update the representative node vector N_ij of each of the representative nodes N11 to N33 is executed. Specifically, first, one piece of data is acquired from the data group 900 by random sampling (step S2).
Subsequently, processing represented by the following formula (1) is executed on a representative node closest to the acquired data (vector), and the representative node vector N_ij of the representative node is thereby updated (step S3). Note that in formula (1), α represents a learning ratio. The learning ratio α may be set on the basis of an empirical rule or the like, and may be, for example, a value such as 0.1 or 0.01.
New representative node vector N_ij=(1−α)×old representative node vector N_ij+α×vector of acquired data  (1)
Subsequently, on a representative node located around the representative node having the representative node vector N_ij updated in step S3 (hereinafter, a peripheral representative node), processing of further multiplying the change by the formula (1) by β, represented by the following formula (2), is executed, and a representative node vector of the peripheral representative node (hereinafter, referred to as a peripheral representative node vector) N_ij is thereby updated (step S4). Note that in formula (2), β(r) may be a parameter of an algorithm determined on the basis of an empirical rule, and may be, for example, a value such as 0.1 or 0.01. In addition, β(r) may be a fixed value or may be changed depending on the number of times of learning. For example, a β(r) value may be changed between first learning and second learning. The same may apply to a.
New peripheral representative node vector N_ij=(1−β(r))×old peripheral representative node vector+β(r)×vector of acquired data  (2)
Note that, in formula (2), r may be a distance from a node to be updated, or may be a variable for weighting β with a function related to the distance. For example, β(r)=0.9 can be set for a node adjacent to the node to be updated, and β(r)=0.5 can be set for a node adjacent to the node adjacent to the node to be updated. As described above, β(r) may be changed depending on the number of times of learning, and for example, the value of β may be reduced as the number of times of learning increases.
Thereafter, the processing in steps S2 to S4 is repeatedly executed until the processing for all the pieces of data included in the data group 900 is completed.
However, in such FlowSOM, since the SOM algorithm is used, there are the following problems.
First, in initialization of a representative node of SOM (hereinafter, also referred to as an SOM node), as described above, since an initial node vector is determined by random sampling of data (step S1), there is a problem that a clustering result varies depending on execution.
Second, in the SOM algorithm, since clustering is executed while learning is executed by random sampling (steps S2 to S4), there is a problem that a learning result varies depending on sampling order, and as a result, a clustering result varies depending on execution.
Third, as a derivative from the second problem, since learning is performed sequentially by random sampling (steps S2 to S4), the processing is inseparable processing in which updating of a vector of an SOM node cannot be parallelized (divided). As a result, there is a problem that processing efficiency or processing speed cannot be improved.
In addition, although the multispectral flow cytometer makes more detailed analysis possible, the amount of data handled increases at the same time. Therefore, there is also a problem that an increase in processing time or a curse of dimensions easily occurs when data is classified by clustering.
Furthermore, in recent years, the number of pieces of data to be analyzed has increased due to an improvement in measurement speed of the flow cytometer. However, when the number of pieces of data increases, there is not only a problem that drawing time in data analysis increases, but also a problem that analysis time increases due to a simple increase in the number of targeted clusters in addition to the increase in the drawing time when a cluster generated by a user is additionally analyzed.
Therefore, in the present embodiment, an information processing device, an information processing method, a program, and an information processing system capable of efficiently executing clustering capable of suppressing variations in results depending on execution will be described with an example.
However, the technology according to the present disclosure does not need to solve all the above-described problems at the same time. Therefore, it is understood that those solving some or all of the above-described problems by including some or all of the configurations described later are included in the technical scope of the present disclosure.
In addition, in the following description, the same reference numerals are given to similar configurations and operations to those of the above-described first embodiment, and detailed description thereof is omitted.
2.1 Configuration of Information Processing System
FIG. 12 is a block diagram illustrating a configuration of an information processing system according to the present embodiment. As illustrated in FIG. 12 , an information processing system 200 includes an information processing device 201 and a measurement device 3. In the present embodiment, the measurement device 3 and a measurement sample may be similar to the measurement device 3 and the measurement sample described in the first embodiment, and therefore detailed description thereof is omitted here.
The information processing device 201 includes a pre-processing/fluorescence separation unit 211, a pre-processing parameter table 12, a spectrum output unit 13, a clustering processing unit 214, a clustering result presentation unit 15, and a normal analysis presentation unit 17. In this configuration, the pre-processing parameter table 12, the clustering result presentation unit 15, and the normal analysis presentation unit 17 may be similar to those according to the first embodiment. In addition, the pre-processing/fluorescence separation unit 211 may have the functions of both the pre-processing unit 11 and the fluorescence separation unit 16 according to the first embodiment. Note that the spectrum output unit 13 is omitted in FIG. 12 , but a spectrum output unit 13 similar to that of the first embodiment may be added.
Similarly to the fluorescence separation unit 16 according to the first embodiment, the pre-processing/fluorescence separation unit 211 separates measurement data 2, which is spectral data, into a fluorescence spectrum for each fluorescent dye by using a reference spectrum. Then, the pre-processing/fluorescence separation unit 211 executes pre-processing such as Logicle conversion on an unmixed fluorescence spectrum. The pre-processed fluorescence spectrum for each fluorescent dye is presented to a user by the normal analysis presentation unit 17.
Note that a parameter set of conversion parameters used in the pre-processing may be selected by a user's designation of a parameter set to be used from parameter sets managed in the pre-processing parameter table 12 (see FIG. 3 ) as in the first embodiment. In addition, the parameter sets managed in the pre-processing parameter table 12 may be finely adjustable by a user.
The clustering processing unit 214 according to the present embodiment executes clustering processing on the pre-processed fluorescence spectrum output from the pre-processing/fluorescence separation unit 211. A clustering result generated by the clustering processing unit 214 is presented to a user by the clustering result presentation unit 15.
2.2 Operation Example of Information Processing Device
Next, operation of the information processing device 201 according to the present embodiment will be described.
2.2.1 Operation Example of Pre-Processing/Fluorescence Separation Unit
First, an operation example of the pre-processing/fluorescence separation unit 211 according to the present embodiment will be described. FIG. 13 is a flowchart illustrating an operation example of the pre-processing/fluorescence separation unit according to the present embodiment. As illustrated in FIG. 13 , in the present embodiment, the pre-processing/fluorescence separation unit 211 first selects a pre-processing parameter on the basis of a user's instruction as in step S1 of FIG. 8 (step S201).
Next, similarly to the fluorescence separation processing described with reference to FIG. 10 in the first embodiment, the pre-processing/fluorescence separation unit 211 executes fluorescence separation processing using a reference spectrum on spectral data of all the cells included in the measurement data 2 (step S202).
Next, as in steps S2 to S5 of FIG. 8 , the pre-processing/fluorescence separation unit 211 executes pre-processing using a pre-processing parameter selected or changed by a user on the spectral data of all the cells included in the measurement data 2 (steps S203 to S206).
Thereafter, if the pre-processing on the spectral data of all the cells is completed (YES in step S206), the pre-processing/fluorescence separation unit 211 inputs the pre-processed spectral data to the clustering processing unit 214 (step S207), and ends this operation. Note that the pre-processing/fluorescence separation unit 211 may instruct the spectrum output unit 13 to present the pre-processed spectral data to a user, for example, as in step S6 of FIG. 8 .
Note that in this operation, the spectral data of all the cells included in the measurement data 2 is targeted, but the target of this operation is not limited thereto, and only needs to be spectral data of the number of cells equal to or larger than the number of node divisions (hereinafter, also referred to as a necessary number) in clustering described later. The same may apply to operation of a clustering processing unit described later.
2.2.2 Operation Example of Clustering Processing Unit
Next, an operation example of the clustering processing unit 214 according to the present embodiment will be described. A basic flow of operation executed by the clustering processing unit 214 according to the present embodiment may be, for example, similar to the operation described with reference to FIG. 9 in the first embodiment. However, in the present embodiment, the clustering processing executed in step S12 of FIG. 9 is replaced with clustering processing described later with reference to FIG. 14 .
FIG. 14 is a flowchart illustrating an example of the clustering processing executed by the clustering processing unit according to the present embodiment. Note that, as can be seen from operation of the clustering processing unit 214 illustrated in FIG. 14 , the clustering processing unit 214 according to the present embodiment can also function as one or more of a dimensional compression unit, an initial value determination unit, a clustering unit, an average value calculation unit, one or more allocation units, an update unit, a node number changing unit, a meta clustering unit, a division unit, and a node setting unit in claims.
As illustrated in FIG. 14 , in the present embodiment, the clustering processing unit 214 first executes main component analysis on spectral data of all the cells (or the number of cells equal to or larger than the necessary number) included in the measurement data 2 (step S221).
Note that the spectral data targeted in step S221 may be spectral data pre-processed after fluorescence separation. In addition, in the present description, the number of main components is two, but is not limited thereto, and may be three or more. Furthermore, in step S221, not only the main component analysis but also various types of dimensional compression using a statistical data analysis method such as T-SNE may be executed.
Next, the clustering processing unit 214 determines an initial value of a representative node vector of an SOM node on the basis of a first main component and a second main component determined in step S221 and the values (vectors) of all the cells (or the number of cells equal to or larger than the necessary number) included in the measurement data 2 (step S222). Note that details of step S222 will be described later with reference to FIG. 15 .
Next, the clustering processing unit 214 executes SOM clustering processing by performing batch learning (step S223). Note that details of step S223 will be described later with reference to FIG. 16 .
Next, the clustering processing unit 214 executes processing of determining the number of clusters (hereinafter, referred to as meta clustering) such as consensus clustering on a result of the SOM clustering processing executed in step S223 (step S224). As a result, the number of clusters in the clustering processing is determined. Note that details of step S223 in a case of using consensus clustering will be described later with reference to FIG. 17 .
Thereafter, the clustering processing unit 214 instructs the clustering result presentation unit 15 to present a clustering result to a user (step S225). For visualization for presenting the clustering result to the user, for example, a minimum spanning tree (MST) method can be used.
2.2.3 Initialization of Representative Node Vector
FIG. 15 is a diagram for explaining operation at the time of initializing a representative node vector of each representative node according to the present embodiment, described in step S222 of FIG. 14 . Note that the example illustrated in FIG. 15 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N11 to N33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the example illustrated in FIG. 11 .
As illustrated in FIG. 15 , in initialization of a representative node vector according to the present embodiment, first, the clustering processing unit 214 calculates an average value in each dimension for a data group including spectral data after fluorescence separation and pre-processing (hereinafter, referred to as processed spectral data) (hereinafter, simply referred to as a data group) 302 (step S301). For example, in a case where the number of dimensions of data is ten, for each of one to ten dimensions, values in the dimensions in all pieces of the processed spectral data included in the data group 302 are summed, and an average thereof is calculated. Note that the number of dimensions is a value corresponding to the number of types of data, and can correspond to the number of channels in a case of spectral data, for example. Therefore, for example, in a case where a light receiving element array has 32 channels, that is, a light receiving element array detects fluorescence by dividing an entire detection range into 32 wavelength bands, the number of dimensions of spectral data acquired by the light receiving element array is 32.
Subsequently, the clustering processing unit 214 performs dimensional compression on all pieces of the processed spectral data included in the data group 302 to determine a first eigenvalue, a second eigenvalue, variance of the first eigenvalue, and variance of the second eigenvalue (step S302). For the dimensional compression of data, for example, a statistical data analysis method such as main component analysis can be used. Note that the first eigenvalue and the second eigenvalue may be selected according to a predetermined rule or arbitrarily from a dimension after the dimensional compression, and the number of the eigenvalues is not limited to two of the first eigenvalue and the second eigenvalue, and may be one or three or more, for example.
Subsequently, the clustering processing unit 214 calculates an initial node vector of each of the representative nodes N11 to N33 using the following formula (3) (step S303). Note that, in formula (3), an initial value (initial node vector) of a representative node vector of a representative node Nij having coordinates (i, j) is represented by N_ij. In formula (3), the number of row divisions is the number of representative nodes arranged in a row direction, and is three in this example. Similarly, the number of column divisions is the number of representative nodes arranged in a column direction, and is three in this example.
Initial node vector N_ij=average value in each dimension+(first eigenvalue/number of row divisions)×i+(second eigenvalue/number of column divisions)×j  (3)
By determining the initial node vector (initial value) of each representative node as described above, it is possible to set the same initial node vector all the time for the data group 302 having the same population. As a result, it is possible to avoid that a clustering result varies depending on execution (corresponding to the first problem described above).
2.2.4 Clustering by Batch Learning
FIG. 16 is a flowchart illustrating an example of clustering by batch learning according to the present embodiment described in step S223 of FIG. 14 . Note that the example illustrated in FIG. 16 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N11 to N33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the example illustrated in FIG. 15 .
As illustrated in FIG. 16 , in clustering by batch learning according to the present embodiment, first, the clustering processing unit 214 executes processing of selecting processed spectral data piece by piece from the data group 302 and allocating the selected piece of processed spectral data to a representative node closest thereto (step S304). This allocation processing is repeated until allocation of all pieces (or the number of pieces equal to or larger than the necessary number) of processed spectral data in the data group 302 is completed. In the example illustrated in FIG. 16 , the pieces of processed spectral data with cell IDs ‘1’ and ‘3’ are allocated to a representative node N11, the piece of processed spectral data with a cell ID ‘2’ is allocated to a representative node N12, and the piece of processed spectral data with a cell ID ‘100’ is allocated to a representative node N33 (cell IDs=‘4’ to ‘99’ are not described).
Note that the cell ID may be an identifier for uniquely identifying a cell corresponding to the processed spectral data registered in the data group 302. In addition, the order in selecting the processed spectral data from the data group 302 is not particularly limited, and may be, for example, various orders such as ascending order of cell IDs for uniquely identifying cells of the processed spectral data registered in the data group 302 and registration order in the data group 302.
Next, the clustering processing unit 214 executes processing of updating the representative node vector of each of the representative nodes N11 to N33 on the basis of the processed spectral data allocated to each of the representative nodes N11 to N33 (step S305). In the update of the representative node vector, for example, a new representative node vector after the update is calculated on the basis of the following formula (4).
new representative node vector N_ij=(1−α)×old representative node vector N_ij+α×average value of allocated processed spectral data  (4)
Next, the clustering processing unit 214 executes processing of further multiplying the change by the formula (4) by β, represented by the following formula (5), on the peripheral representative node located around the representative node having the representative node vector N_ij updated in step S305, and thereby updates the peripheral representative node vector N_ij of the peripheral representative node (step S306). Note that in formula (5), β(r) may be similar to β(r) in formula (2) described above.
new peripheral representative node vector N_ij=(1−β(r))×old peripheral representative node vector N_ij+β(r)×average value of processed spectral data allocated to representative node  (5)
In the present embodiment, the representative node vector of each of the representative nodes N11 to N33 is updated by repeating the processing in steps S304 to S306 described above a predetermined number of times (for example, one or more times). Note that in the repetition of the processing in steps S304 to S306, the learning ratio α and/or β may change (for example, decrease or increase) every time the processing is repeated.
As described above, by allocating all pieces of the processed spectral data to be analyzed to any of the representative nodes N11 to N33, and then updating the representative node vector of each of the representative nodes N11 to N33 using the processed spectral data allocated to each of the representative nodes N11 to N33, random sampling at the time of learning can be eliminated. Therefore, it is possible to avoid that a clustering result varies depending on execution (corresponding to the second problem).
In addition, since the update of the representative node vector is executed after the allocation of all pieces of the processed spectral data is completed, processing from the allocation of the processed spectral data to the update of the representative node vector can be subdivided and executed by different information processing devices.
For example, the processed spectral data included in the data group 302 can be divided into a plurality of groups, and the spectral data belonging to each group can be allocated to the representative node by different information processing devices. For example, the clustering processing unit 214 according to the present embodiment includes an allocation unit that allocates the plurality of pieces of processed spectral data to any of the representative nodes N11 to N33, the number of allocation units included in the clustering processing unit 214 may be equal to or less than the number of groups, and the allocation units may be executed in different information processing devices.
As a result, since the processing of allocating the processed spectral data can be executed in parallel, it is possible to improve processing efficiency, processing speed, and the like.
In addition, since it is possible to reduce the amount of data of which each information processing device is in charge by subdividing the processing, it is also possible to suppress occurrence of a problem such as an increase in processing time or a curse of dimensions at the time of clustering.
2.2.5 Determination of the Number of Clusters Using Consensus Clustering
FIG. 17 is a diagram for explaining operation for determining the number of clusters using consensus clustering described in step S224 of FIG. 14 . Note that the example illustrated in FIG. 17 illustrates a case where 100 pieces of processed spectral data (vectors (coordinate values)) on which the pre-processing/fluorescence separation unit 211 has executed fluorescence separation and pre-processing are classified into nine representative nodes N11 to N33 arranged in three rows and three columns in a two-dimensional coordinate system in an initial state, similarly to the examples illustrated in FIGS. 15 and 16 .
In the above description, as illustrated in (a) of FIG. 17 , the case where the number of classification groups of the processed spectral data included in the data group 302, that is, the number of representative nodes is set to nine in initial setting has been exemplified. However, the number (nine) of the initially set representative nodes is not necessarily optimal depending on the processed spectral data included in the data group 302.
Therefore, in the present embodiment, in step S224 of FIG. 14 , by executing meta clustering such as consensus clustering, a more suitable number of divisions and a combination of representative nodes constituting a representative node after meta clustering (hereinafter, referred to as a meta representative node) are determined as illustrated in (b) of FIG. 17 .
The example illustrated in FIG. 17 exemplifies a case where, as a result of meta clustering, one meta representative node NN1 is constituted by representative nodes N11, N12, and N21, one meta representative node NN2 is constituted by representative nodes N22, N31, and N32, and one meta representative node NN3 is constituted by representative nodes N13, N23, and N33.
Note that, in the present embodiment, the number of node divisions (nine in this example) set at the time of execution of clustering may be the number of divisions determined by default or the number of divisions set by a user. In a case where a user is caused to set the initial number of node divisions, the information processing device 201 may further include an operation input unit that functions as a node setting unit that causes a user to set the number of node divisions.
2.3 Action and Effect
As described above, according to the present embodiment, since the initial node vector (initial value) of each representative node is determined on the basis of an average value of the processed spectral data for each dimension and an eigenvalue obtained by dimensional compression of the processed spectral data, it is possible to set the same initial node vector all the time for the data group 302 having the same population. As a result, it is possible to avoid that a clustering result varies depending on execution.
In addition, according to the present embodiment, by allocating all pieces of the processed spectral data to be analyzed to any of the representative nodes N11 to N33, and then updating the representative node vector of each of the representative nodes N11 to N33 using the processed spectral data allocated to each of the representative nodes N11 to N33, it is possible to avoid that a clustering result varies depending on execution.
Furthermore, in the present embodiment, since the update of the representative node vector is executed after the allocation of all pieces of the processed spectral data is completed, processing from the allocation of the processed spectral data to the update of the representative node vector can be subdivided and executed by different information processing devices.
Furthermore, since it is possible to reduce the amount of data of which each information processing device is in charge by subdividing the processing, it is also possible to suppress occurrence of a problem such as an increase in processing time or a curse of dimensions at the time of clustering.
Note that in the present embodiment, the case where the spectral data after fluorescence separation is subjected to clustering by the clustering processing unit 214 has been exemplified, but the clustering target is not limited thereto, and for example, the measurement data 2 acquired by the measurement device 3 may be subjected to clustering by the clustering processing unit 214. In this case, the pre-processing/fluorescence separation unit 211 executes pre-processing on the measurement data 2, and inputs spectral data before fluorescence separation obtained by the pre-processing to the clustering processing unit 214. Then, the clustering processing unit 214 executes the above-described clustering processing on the input spectral data before fluorescence separation.
In addition, in the present embodiment, the case where the pre-processing/fluorescence separation unit 211 executes pre-processing on a fluorescence spectrum for each fluorescent dye obtained by executing fluorescence separation processing on spectral data included in the measurement data 2 has been exemplified, but the operation of the pre-processing/fluorescence separation unit 211 is not limited thereto, and for example, the pre-processing/fluorescence separation unit 211 may execute pre-processing on spectral data included in the measurement data 2 first, and then may execute fluorescence separation processing.
Other configurations, operations, and effects may be similar to those of the above-described first embodiment, and therefore detailed description thereof is omitted here.
3. Hardware Configuration of Information Processing Device
Next, a hardware configuration of the information processing devices 1 and 201 according to the above-described embodiments will be described with reference to FIG. 18 . FIG. 18 is a block diagram illustrating a hardware configuration example of the information processing devices according to the embodiments of the present disclosure. Note that, in the following description, the information processing device 1 will be exemplified, but the same applies also to the information processing device 201 similarly.
As illustrated in FIG. 18 , the information processing device 1 includes a central processing unit (CPU) 901, a read only memory (ROM) 903, and a random access memory (RAM) 905. In addition, the information processing device 1 includes a host bus 907, a bridge 909, an external bus 911, an interface 913, an input device 915, an output device 917, a storage device 919, a drive 921, a connection port 925, and a communication device 929. The information processing device 1 may include a processing circuit called a digital signal processor (DSP) or an application specific integrated circuit (ASIC) instead of or in addition to the CPU 901.
The CPU 901 functions as an arithmetic processing device and a control device, and controls the overall operation or a part thereof in the information processing device 1 according to various programs recorded in the ROM 903, the RAM 905, the storage device 919, or a removable recording medium 923. For example, the CPU 901 controls the overall operation of each functional unit included in the information processing device 1 in the above embodiment. The ROM 903 stores a program, an operation parameter, and the like used by the CPU 901. The RAM 905 primarily stores a program used in execution of the CPU 901, a parameter that appropriately changes in the execution, and the like. The CPU 901, the ROM 903, and the RAM 905 are connected to each other by the host bus 907 constituted by an internal bus such as a CPU bus. Furthermore, the host bus 907 is connected to the external bus 911 such as a peripheral component interconnect/interface (PCI) bus via the bridge 909.
The input device 915 is a device operated by a user, such as a mouse, a keyboard, a touch panel, a button, a switch, or a lever. The input device 915 may be, for example, a remote control device using an infrared ray or another radio wave, or an external connection device 927 such as a mobile phone corresponding to operation of the information processing device 1. The input device 915 includes an input control circuit that generates an input signal on the basis of information input by a user and outputs the input signal to the CPU 901. By operating the input device 915, a user inputs various types of data to the information processing device 1 or instructs the information processing device 1 to perform processing operation.
The output device 917 is constituted by a device capable of visually or aurally notifying a user of acquired information. The output device 917 can be, for example, a display device such as an LCD, a PDP, or an OELD, a sound output device such as a speaker or a headphone, or a printer device. The output device 917 outputs a result obtained by processing of the information processing device 1 as a video such as a text or an image, or as a sound such as audio.
The storage device 919 is a data storage device constituted as an example of a storage of the information processing device 1. The storage device 919 is constituted by, for example, a magnetic storage device such as a hard disk drive (HDD), a semiconductor storage device, an optical storage device, or a magneto-optical storage device. The storage device 919 stores a program and various types of data executed by the CPU 901, various types of data acquired from the outside, and the like.
The drive 921 is a reader/writer for the removable recording medium 923 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory, and is built in or externally attached to the information processing device 1. The drive 921 reads information recorded in the attached removable recording medium 923 and outputs the information to the RAM 905. In addition, the drive 921 writes a record in the attached removable recording medium 923.
The connection port 925 is a port for directly connecting a device to the information processing device 1. The connection port 925 can be, for example, a universal serial bus (USB) port, an IEEE 1394 port, or a small computer system interface (SCSI) port. In addition, the connection port 925 may be an RS-232C port, an optical audio terminal, a high-definition multimedia interface (HDMI) (registered trademark) port, or the like. By connecting the external connection device 927 to the connection port 925, various types of data can be exchanged between the information processing device 1 and the external connection device 927.
The communication device 929 is, for example, a communication interface constituted by a communication device or the like for connection to a communication network NW. The communication device 929 can be, for example, a communication card for wired or wireless local area network (LAN), Bluetooth (registered trademark), or wireless USB (WUSB). In addition, the communication device 929 may be a router for optical communication, a router for asymmetric digital subscriber line (ADSL), a modem for various types of communication, or the like. The communication device 929 transmits and receives a signal and the like to and from the Internet or another communication device using a predetermined protocol such as TCP/IP. In addition, the communication network NW connected to the communication device 929 is a network connected in a wired or wireless manner, and is, for example, the Internet, a home LAN, infrared communication, radio wave communication, or satellite communication.
Note that the technical scope of the present disclosure is not limited to the above-described embodiments as they are, and various modifications can be made without departing from the gist of the present disclosure. In addition, components of different embodiments and modifications may be appropriately combined with each other.
For example, in the above embodiments, the information processing system 4 includes the information processing device 1 or 201 and the measurement device 3, but the present technology is not limited to such an example. For example, the information processing device 1 or 201 may have a function (measurement function) of the measurement device 3. In this case, the information processing system 4 is implemented by the information processing device 1 or 201. In addition, the measurement device 3 may have the functions of the information processing device 1 or 201. In this case, the information processing system 4 is implemented by the measurement device 3. In addition, the measurement device 3 may have some of the functions of the information processing device 1 or 201, and the information processing device 1 or 201 may have some of the functions of the measurement device 3.
Note that the present technology can also have the following configurations.
(1)
An information processing device including:
    • a dimensional compression unit that executes dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes;
    • an initial value determination unit that determines an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and
    • a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value.
      (2)
The information processing device according to (1), further including an average value calculation unit that calculates an average value of the plurality of pieces of spectral data for each dimension, in which
    • the initial value determination unit determines the initial value of each of the plurality of nodes on the basis of the average value of the plurality of pieces of spectral data for each dimension in addition to a result of the dimensional compression.
      (3)
The information processing device according to (1) or (2), in which
    • the clustering unit includes:
    • an allocation unit that allocates each of the plurality of pieces of spectral data to any one of the plurality of nodes; and
    • an update unit that updates a node vector of each of the plurality of nodes on the basis of the spectral data allocated to each of the plurality of nodes.
      (4)
The information processing device according to (3), in which
    • the clustering unit includes:
    • a node number changing unit that changes the number of nodes by executing consensus clustering on the node vector of each of the plurality of nodes updated by the update unit; and
    • a meta clustering unit that executes meta clustering based on the node vector of each of the plurality of nodes before change on a node after change by the node number changing unit.
      (5)
The information processing device according to (4), in which
    • the clustering unit further includes a division unit that divides the plurality of pieces of spectral data into two or more groups, and
    • the allocation unit allocates each of the plurality of pieces of spectral data to any one of the plurality of nodes for each of the two or more groups.
      (6)
The information processing device according to (5), in which
    • the clustering unit includes the allocation units as many as or less than the number of groups, and
    • the allocation units are respectively arranged in different information processing devices.
      (7)
The information processing device according to any one of (1) to (6), in which
    • the clustering unit executes clustering using a self-organizing map (SOM) algorithm.
      (8)
The information processing device according to any one of (1) to (7), further including
    • a node setting unit that causes a user to set the number of nodes.
      (9)
The information processing device according to any one of (1) to (8), in which
    • the dimensional compression unit executes main component analysis on the plurality of pieces of spectral data as the dimensional compression.
      (10)
The information processing device according to any one of (1) to (9), in which
    • the spectral data is spectral data measured by a spectrum type flow cytometer.
      (11)
The information processing device according to any one of (1) to (10), further including
    • a pre-processing unit that executes scale conversion on each of the plurality of pieces of spectral data, in which
    • the dimensional compression unit executes the dimensional compression on each of the plurality of pieces of spectral data on which the scale conversion has been executed.
      (12)
The information processing device according to (11), further including
    • a fluorescence separation unit that separates each of the plurality of pieces of spectral data into a fluorescence spectrum for each of the fluorescent dyes, in which
    • the pre-processing unit executes the scale conversion on each of the plurality of fluorescence spectra.
      (13)
The information processing device according to (11) or (12), in which
    • the pre-processing unit performs conversion of non-linear processing as the scale conversion.
      (14)
The information processing device according to (13), in which
    • the pre-processing unit performs logicle conversion, log conversion, or bi-exponential conversion as the scale conversion.
      (15)
The information processing device according to any one of (1) to (14), further including
    • a display control unit that displays a result of the clustering performed by the clustering unit.
      (16)
The information processing device according to any one of (11) to (14), further including
    • a display control unit that displays a result of the clustering performed by the clustering unit, in which
    • the display control unit displays data on which scale conversion has been performed by the pre-processing unit.
      (17)
An information processing method including:
    • executing dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes;
    • an initial value is determined for each of a plurality of nodes on the basis of a result of the dimensional compression; and
    • executing clustering on the plurality of pieces of spectral data using the initial value.
      (18)
A program for causing a computer to execute:
    • a step of executing dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes;
    • a step of determining an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and
    • a step of executing clustering on the plurality of pieces of spectral data using the initial value.
      (19)
An information processing system comprising:
    • a measurement device that detects each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes; and
    • an information processing device that performs clustering on the plurality of pieces of spectral data detected by the measurement device, in which
    • the information processing device includes:
    • a dimensional compression unit that executes dimensional compression on each of the plurality of pieces of spectral data detected by the measurement device;
    • an initial value determination unit that determines an initial value for each of a plurality of nodes on the basis of a result of the dimensional compression; and
    • a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value.
REFERENCE SIGNS LIST
    • 1, 201 Information Processing Device
    • 2 Measurement Data
    • 3 Measurement Device
    • 4, 200 Information Processing System
    • 11 Pre-Processing Unit
    • 12 Pre-Processing Parameter Table
    • 13 Spectrum Output Unit
    • 14, 214 Clustering Processing Unit
    • 15 Clustering Result Presentation Unit
    • 16 Fluorescence Separation Unit
    • 17 Normal Analysis Presentation Unit
    • 211 Pre-Processing/Fluorescence Separation Unit
    • 302 Data Group

Claims (19)

The invention claimed is:
1. An information processing device comprising:
a pre-processing unit that converts actual observation values from a measurement device according to a user-selected parameter;
a spectrum output unit that generates an image of a spectrum plot on at least one display device using spectral data that has been pre-processed by the pre-processing unit;
a dimensional compression unit that executes dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes,
wherein a statistical data analysis method used by the dimensional compression unit comprises t-distributed Stochastic Neighbor Embedding (T-SNE);
an initial value determination unit that determines an initial value for each of a plurality of nodes on a basis of a result of the dimensional compression;
a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value; and
a clustering result presentation unit that displays a clustering result by the clustering unit on the at least one display device.
2. The information processing device according to claim 1, further comprising
an average value calculation unit that calculates an average value of the plurality of pieces of spectral data for each dimension, wherein
the initial value determination unit determines the initial value of each of the plurality of nodes on a basis of the average value of the plurality of pieces of spectral data for each dimension in addition to a result of the dimensional compression.
3. The information processing device according to claim 1, wherein
the clustering unit includes:
an allocation unit that allocates each of the plurality of pieces of spectral data to any one of the plurality of nodes; and
an update unit that updates a node vector of each of the plurality of nodes on a basis of the spectral data allocated to each of the plurality of nodes.
4. The information processing device according to claim 3, wherein
the clustering unit includes:
a node number changing unit that changes the number of nodes by executing consensus clustering on the node vector of each of the plurality of nodes updated by the update unit; and
a meta clustering unit that executes meta clustering based on the node vector of each of the plurality of nodes before change on a node after change by the node number changing unit.
5. The information processing device according to claim 4, wherein
the clustering unit further includes a division unit that divides the plurality of pieces of spectral data into two or more groups, and
the allocation unit allocates each of the plurality of pieces of spectral data to any one of the plurality of nodes for each of the two or more groups.
6. The information processing device according to claim 5, wherein
the clustering unit includes the allocation units as many as or less than the number of groups, and
the allocation units are respectively arranged in different information processing devices.
7. The information processing device according to claim 1, wherein
the clustering unit executes clustering using a self-organizing map (SOM) algorithm.
8. The information processing device according to claim 1, further comprising
a node setting unit that causes a user to set the number of nodes.
9. The information processing device according to claim 1, wherein
the dimensional compression unit executes main component analysis on the plurality of pieces of spectral data as the dimensional compression.
10. The information processing device according to claim 1, wherein
the spectral data is spectral data measured by a spectrum type flow cytometer.
11. The information processing device according to claim 1, further comprising
a pre-processing unit that executes scale conversion on each of the plurality of pieces of spectral data, wherein
the dimensional compression unit executes the dimensional compression on each of the plurality of pieces of spectral data on which the scale conversion has been executed.
12. The information processing device according to claim 11, further comprising
a fluorescence separation unit that separates each of the plurality of pieces of spectral data into a fluorescence spectrum for each of the fluorescent dyes, wherein
the pre-processing unit executes the scale conversion on each of the plurality of fluorescence spectra.
13. The information processing device according to claim 11, wherein
the pre-processing unit performs conversion of non-linear processing as the scale conversion.
14. The information processing device according to claim 13, wherein
the pre-processing unit performs logicle conversion, log conversion, or bi-exponential conversion as the scale conversion.
15. The information processing device according to claim 11, further comprising
a display control unit that displays a result of the clustering performed by the clustering unit, wherein
the display control unit displays data on which scale conversion has been performed by the pre-processing unit.
16. The information processing device according to claim 1, further comprising
a display control unit that displays a result of the clustering performed by the clustering unit.
17. An information processing method comprising:
converting actual observation values from a measurement device according to a user-selected parameter;
generating an image of a spectrum plot on at least one display device using spectral data that has been pre-processed by a pre-processing unit;
executing dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes,
wherein a statistical data analysis method used the dimensional compression unit comprises t-distributed Stochastic Neighbor Embedding (T-SNE);
determining an initial value for each of a plurality of nodes on a basis of a result of the dimensional compression;
executing clustering on the plurality of pieces of spectral data using the initial value; and
displaying a clustering result by the clustering unit on the at least one display device.
18. A non-transitory computer readable medium storing instructions that, when executed by a computer, cause the computer to execute the processes of:
converting actual observation values from a measurement device according to a user-selected parameter;
generating an image of a spectrum plot on at least one display device using spectral data that has been pre-processed by a pre-processing unit;
executing dimensional compression on each of a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes,
wherein a statistical data analysis method used the dimensional compression unit comprises t-distributed Stochastic Neighbor Embedding (T-SNE);
determining an initial value for each of a plurality of nodes on a basis of a result of the dimensional compression; and
executing clustering on the plurality of pieces of spectral data using the initial value; and
displaying a clustering result by the clustering unit on the at least one display device.
19. An information processing system comprising:
a measurement device that detects a plurality of pieces of spectral data including a fluorescent component emitted from each of a plurality of particles labeled with one or more fluorescent dyes; and
an information processing device that performs clustering on the plurality of pieces of spectral data detected by the measurement device, wherein
the information processing device includes:
a pre-processing unit that converts actual observation values from the measurement device according to a user-selected parameter;
a spectrum output unit that generates an image of a spectrum plot on at least one display device using spectral data that has been pre-processed by the pre-processing unit;
a dimensional compression unit that executes dimensional compression on each of the plurality of pieces of spectral data detected by the measurement device,
wherein a statistical data analysis method used by the dimensional compression unit comprises t-distributed Stochastic Neighbor Embedding (T-SNE);
an initial value determination unit that determines an initial value for each of a plurality of nodes on a basis of a result of the dimensional compression;
a clustering unit that executes clustering on the plurality of pieces of spectral data using the initial value; and
a clustering result presentation unit that displays a clustering result by the clustering unit on the at least one display device.
US17/636,013 2019-08-23 2020-07-13 Information processing device, information processing method, program, and information processing system Active 2042-12-25 US12540892B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US19/421,764 US20260104343A1 (en) 2019-08-23 2025-12-16 Information processing device, information processing method, program, and information processing system

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
JP2019152593 2019-08-23
JP2019152593 2019-08-23
JP2020093877A JP7516870B2 (en) 2019-08-23 2020-05-29 Information processing device, information processing method, program, and information processing system
JP2020-093877 2020-05-29
PCT/JP2020/027196 WO2021039158A1 (en) 2019-08-23 2020-07-13 Information processing device, information processing method, program, and information processing system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/027196 A-371-Of-International WO2021039158A1 (en) 2019-08-23 2020-07-13 Information processing device, information processing method, program, and information processing system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US19/421,764 Continuation US20260104343A1 (en) 2019-08-23 2025-12-16 Information processing device, information processing method, program, and information processing system

Publications (2)

Publication Number Publication Date
US20220283073A1 US20220283073A1 (en) 2022-09-08
US12540892B2 true US12540892B2 (en) 2026-02-03

Family

ID=74685441

Family Applications (2)

Application Number Title Priority Date Filing Date
US17/636,013 Active 2042-12-25 US12540892B2 (en) 2019-08-23 2020-07-13 Information processing device, information processing method, program, and information processing system
US19/421,764 Pending US20260104343A1 (en) 2019-08-23 2025-12-16 Information processing device, information processing method, program, and information processing system

Family Applications After (1)

Application Number Title Priority Date Filing Date
US19/421,764 Pending US20260104343A1 (en) 2019-08-23 2025-12-16 Information processing device, information processing method, program, and information processing system

Country Status (5)

Country Link
US (2) US12540892B2 (en)
EP (1) EP4019937A4 (en)
JP (2) JP7516870B2 (en)
CN (1) CN114270168B (en)
WO (1) WO2021039158A1 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7608038B2 (en) * 2018-09-21 2025-01-06 浜松ホトニクス株式会社 Laser device and laser waveform control method
WO2023026742A1 (en) * 2021-08-25 2023-03-02 浜松ホトニクス株式会社 Dye image acquisition method, dye image acquisition device, and dye image acquisition program
JP2023053726A (en) * 2021-10-01 2023-04-13 トヨタ自動車株式会社 Analysis device, analysis method and analysis program
GB202206324D0 (en) * 2022-04-29 2022-06-15 King S College London Characterisation method
JP7564408B1 (en) * 2023-02-24 2024-10-08 浜松ホトニクス株式会社 Pigment data acquisition method, pigment data acquisition device, and pigment data acquisition program
CN117151229B (en) * 2023-11-01 2024-01-26 青岛聚远网络科技有限公司 Cloud reasoning method and system based on cloud side architecture
WO2025253892A1 (en) * 2024-06-05 2025-12-11 ソニーグループ株式会社 Information processing method, information processing device, and information processing system
WO2026004349A1 (en) * 2024-06-24 2026-01-02 ソニーグループ株式会社 Information processing method, information processing device, and information processing system
WO2026009599A1 (en) * 2024-07-04 2026-01-08 ソニーグループ株式会社 Information processing method, information processing device, information processing system, and program

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH063252A (en) 1992-06-19 1994-01-11 Toa Medical Electronics Co Ltd Method and device for analyzing particle
JP2007132921A (en) 2005-11-10 2007-05-31 Idexx Lab Inc Method of identifying discrete population (for example, cluster) of data in flow sight meter multidimensional data set
WO2009046185A2 (en) * 2007-10-02 2009-04-09 The Regents Of The University Of Michigan Method and apparatus for clustering and visualization of multicolor cytometry data
WO2011154258A1 (en) 2010-06-09 2011-12-15 Galderma Research & Development Method for analysis of sebum samples
CN103049581A (en) 2013-01-21 2013-04-17 北京航空航天大学 Web text classification method based on consistency clustering
US20130266959A1 (en) 2010-10-12 2013-10-10 Deutsches Rheuma-Forschungszentrum Berlin Principle component analysis (pca) - based analysis of discontinuous emission spectra in multichromatic flow cytometry
JP2016511397A (en) 2013-01-31 2016-04-14 ユニベルシテ ドゥ モンペリエ How to identify rare events
CN106548205A (en) 2016-10-21 2017-03-29 北京信息科技大学 A kind of fast automatic point of group of flow cytometry data and circle door method
CN107239791A (en) 2017-05-12 2017-10-10 东北大学 A kind of higher-dimension K means cluster centre method for optimizing based on LSH
US20170371886A1 (en) 2016-06-22 2017-12-28 Agency For Science, Technology And Research Methods for identifying clusters in a dataset, methods of analyzing cytometry data with the aid of a computer and methods of detecting cell sub-populations in a plurality of cells
US9934364B1 (en) 2017-02-28 2018-04-03 Anixa Diagnostics Corporation Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102375983B (en) * 2010-08-17 2015-11-25 上海宝康电子控制工程有限公司 Based on the image segmentation processing method of area matching optimization K-means clustering algorithm
US10354204B2 (en) * 2016-04-21 2019-07-16 Sas Institute Inc. Machine learning predictive labeling system
CN106055883B (en) * 2016-05-25 2022-09-02 中国电力科学研究院 Transient stability evaluation input feature validity analysis method based on improved Sammon mapping
CN109460777B (en) * 2018-10-11 2021-09-07 北京朗镜科技有限责任公司 Picture classification method and device and computer readable storage medium

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH063252A (en) 1992-06-19 1994-01-11 Toa Medical Electronics Co Ltd Method and device for analyzing particle
US5703959A (en) 1992-06-19 1997-12-30 Toa Medical Electronics Co., Ltd. Method and device for analyzing particles
JP2007132921A (en) 2005-11-10 2007-05-31 Idexx Lab Inc Method of identifying discrete population (for example, cluster) of data in flow sight meter multidimensional data set
WO2009046185A2 (en) * 2007-10-02 2009-04-09 The Regents Of The University Of Michigan Method and apparatus for clustering and visualization of multicolor cytometry data
US20090097733A1 (en) 2007-10-02 2009-04-16 The Regents Of The University Of Michigan Method and apparatus for clustering and visualization of multicolor cytometry data
US7853432B2 (en) * 2007-10-02 2010-12-14 The Regents Of The University Of Michigan Method and apparatus for clustering and visualization of multicolor cytometry data
WO2011154258A1 (en) 2010-06-09 2011-12-15 Galderma Research & Development Method for analysis of sebum samples
US20130266959A1 (en) 2010-10-12 2013-10-10 Deutsches Rheuma-Forschungszentrum Berlin Principle component analysis (pca) - based analysis of discontinuous emission spectra in multichromatic flow cytometry
CN103049581A (en) 2013-01-21 2013-04-17 北京航空航天大学 Web text classification method based on consistency clustering
JP2016511397A (en) 2013-01-31 2016-04-14 ユニベルシテ ドゥ モンペリエ How to identify rare events
US20170371886A1 (en) 2016-06-22 2017-12-28 Agency For Science, Technology And Research Methods for identifying clusters in a dataset, methods of analyzing cytometry data with the aid of a computer and methods of detecting cell sub-populations in a plurality of cells
CN106548205A (en) 2016-10-21 2017-03-29 北京信息科技大学 A kind of fast automatic point of group of flow cytometry data and circle door method
US9934364B1 (en) 2017-02-28 2018-04-03 Anixa Diagnostics Corporation Methods for using artificial neural network analysis on flow cytometry data for cancer diagnosis
CN107239791A (en) 2017-05-12 2017-10-10 东北大学 A kind of higher-dimension K means cluster centre method for optimizing based on LSH

Non-Patent Citations (26)

* Cited by examiner, † Cited by third party
Title
[No Author Listed], Linear vs. Logarithmic Scaling. FlowJo University. Nov. 12, 2018. https://www.bdj.co.jp/biosciences/support/flowjo-university/contents/start/linear_logarithmic.html and https://www.flowjo.com/learn/flowjo-university/flowjo/before-flowjo/59 (non-official translation, last accessed Feb. 14, 2022), 9 pages.
Akash Dubey: "The Mathematics Behind Principal Component Analysis Towards Data Science", Dec. 20, 2018 (Dec. 20, 2018), XP93047502, retrieved on May 16, 2023] * Section Mathematics Behind PCA *.
Anonymous: "Dimensionality Reduction—Google Search", Jul. 22, 2024 (Jul. 22, 2024), XP093188330.
Deng Xuran et al., "Overview of clustering center initial value selection method", Chinese Journal of Electronic Science Research, Apr. 30, 2019, pp. 354-359, vol. 14, No. 4.
Di-Yuan Tzeng et al: "A review of principal component analysis and its applications to color technology", Color Research & Application, John Wiley & Sons, Inc, Jan. 28, 2005, pp. 84-98, vol. 30, No. 2, XP071624742.
Enrico Lugli, et al., "Subject Classification Obtained by Cluster Analysis and Principal Component Analysis Applied to Flow Cytometric Data", Cytometry Part A, Mar. 12, 2007, pp. 334-344, vol. 17A, No. 5.
Extended European Search Report issued May 26, 2023 in connection with European Application No. 17806771.6.
International Preliminary Report on Patentability and English translation thereof mailed Mar. 3, 2022 in connection with International Application No. PCT/JP2020/027196.
International Search Report and English translation thereof mailed Sep. 29, 2020 in connection with International Application No. PCT/JP2020/027196.
International Written Opinion and English translation thereof mailed Sep. 29, 2020 in connection with International Application No. PCT/JP2020/027196.
Lenhardt et al., Artificial neural networks for processing fluorescence spectroscopy data in skin cancer diagnostics. Physica Scripta. Nov. 15, 2013;2013(T157):014057. 4 pages.
Van Gassen Sofie et al: "FlowSOM: Using Self-Organizing Maps for Visualization and Interpretation of Cytometry Data", Cytometry A, Jan. 8, 2015 (Jan. 8, 2015), vol. 87, No. 7, pp. 636-645, XP055794113, Hoboken, USA ISSN: 1552-4922, DOI:10.1002/cyto.a.22625.
Ye et al., Ultrafast clustering of single-cell flow cytometry data using FlowGrid. BMC Syst Biol. Apr. 5, 2019;13(Suppl 2):35. doi: 10.1186/s12918-019-0690-2.
[No Author Listed], Linear vs. Logarithmic Scaling. FlowJo University. Nov. 12, 2018. https://www.bdj.co.jp/biosciences/support/flowjo-university/contents/start/linear_logarithmic.html and https://www.flowjo.com/learn/flowjo-university/flowjo/before-flowjo/59 (non-official translation, last accessed Feb. 14, 2022), 9 pages.
AKASH DUBEY: "The Mathematics Behind Principal Component Analysis | by Akash Dubey | Towards Data Science", 20 December 2018 (2018-12-20), XP093047502, Retrieved from the Internet <URL:https://towardsdatascience.com/the-mathematics-behind-principal-component-analysis-fff2d7f4b643> [retrieved on 20230516]
ANONYMOUS: "dimensionality reduction - Google Search", 22 July 2024 (2024-07-22), XP093188330, Retrieved from the Internet <URL:https://www.google.com/search?q=dimenssionality+reduction&rlz=1C1GCEA_enNL999NL999&oq=dimenssionality+reduction&gs_lcrp=EgZjaHJvbWUyBggAEEUYOTIJCAEQABgNGIAEMgkIAhAAGA0YgAQyCQgDEAAYDRiABDIJCAQQABgNGIAEMgkIBRAAGA0YgAQyCQgGEAAYDRiABDIJCAcQABgNGIAEMgkICBAAGA0YgAQyCQgJEAAYDRiABNIBCDQwNTRqMGo3qAIAsAIA&sourceid=chrome&ie=UTF-8>
Deng Xuran et al., "Overview of clustering center initial value selection method", Chinese Journal of Electronic Science Research, Apr. 30, 2019, pp. 354-359, vol. 14, No. 4.
DI‐YUAN TZENG; ROY S. BERNS: "A review of principal component analysis and its applications to color technology", COLOR RESEARCH & APPLICATION, JOHN WILEY & SONS, INC., US, vol. 30, no. 2, 28 January 2005 (2005-01-28), US, pages 84 - 98, XP071624742, ISSN: 0361-2317, DOI: 10.1002/col.20086
Enrico Lugli, et al., "Subject Classification Obtained by Cluster Analysis and Principal Component Analysis Applied to Flow Cytometric Data", Cytometry Part A, Mar. 12, 2007, pp. 334-344, vol. 17A, No. 5.
Extended European Search Report issued May 26, 2023 in connection with European Application No. 17806771.6.
International Preliminary Report on Patentability and English translation thereof mailed Mar. 3, 2022 in connection with International Application No. PCT/JP2020/027196.
International Search Report and English translation thereof mailed Sep. 29, 2020 in connection with International Application No. PCT/JP2020/027196.
International Written Opinion and English translation thereof mailed Sep. 29, 2020 in connection with International Application No. PCT/JP2020/027196.
Lenhardt et al., Artificial neural networks for processing fluorescence spectroscopy data in skin cancer diagnostics. Physica Scripta. Nov. 15, 2013;2013(T157):014057. 4 pages.
VAN GASSEN SOFIE, CALLEBAUT BRITT, VAN HELDEN MARY J., LAMBRECHT BART N., DEMEESTER PIET, DHAENE TOM, SAEYS YVAN: "FlowSOM: Using self-organizing maps for visualization and interpretation of cytometry data : FlowSOM", CYTOMETRY A, WILEY-LISS, HOBOKEN, USA, vol. 87, no. 7, 1 July 2015 (2015-07-01), Hoboken, USA, pages 636 - 645, XP055794113, ISSN: 1552-4922, DOI: 10.1002/cyto.a.22625
Ye et al., Ultrafast clustering of single-cell flow cytometry data using FlowGrid. BMC Syst Biol. Apr. 5, 2019;13(Suppl 2):35. doi: 10.1186/s12918-019-0690-2.

Also Published As

Publication number Publication date
WO2021039158A1 (en) 2021-03-04
EP4019937A1 (en) 2022-06-29
JP2024133588A (en) 2024-10-02
US20260104343A1 (en) 2026-04-16
JP7835250B2 (en) 2026-03-25
CN114270168A (en) 2022-04-01
US20220283073A1 (en) 2022-09-08
CN114270168B (en) 2026-03-27
JP2021036224A (en) 2021-03-04
JP7516870B2 (en) 2024-07-17
EP4019937A4 (en) 2023-06-28

Similar Documents

Publication Publication Date Title
US12540892B2 (en) Information processing device, information processing method, program, and information processing system
US8779387B2 (en) Method and system for detecting fluorochromes in a flow cytometer
CN113168529B (en) Information processing device, information processing method and program
CN109073536A (en) Information processing unit, information processing method, program and information processing system
JP7367343B2 (en) Preparative separation system and method
US11561161B2 (en) Information processing apparatus, information processing method, and program
US20220276148A1 (en) Information processing device, display method, program, and information processing system
EP3882603B1 (en) Information processing device, information processing method, and computer program
WO2021182031A1 (en) Particle analysis system and particle analysis method
CN102262043B (en) Microparticle analysis device and data display method
US20220317019A1 (en) Particle analysis system having autofluorescence spectrum correction
CN116261657A (en) Server system, information processing system, data acquisition client terminal, data analysis client terminal, and information processing method
JP7564408B1 (en) Pigment data acquisition method, pigment data acquisition device, and pigment data acquisition program
WO2025158915A1 (en) Information processing method, information processing device, and information processing system
WO2026042344A1 (en) Pigment data acquisition method, pigment data acquisition device, and pigment data acquisition program
CN114846316A (en) Information processing device, particle measuring system, particle distributing device, particle distributing system, information processing method, and information processing program

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: SONY GROUP CORPORATION, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YAMANE, KENJI;ENOKI, JUNICHIRO;REEL/FRAME:059956/0468

Effective date: 20220107

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION COUNTED, NOT YET MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: ADVISORY ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: ALLOWED -- NOTICE OF ALLOWANCE NOT YET MAILED

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STPP Information on status: patent application and granting procedure in general

Free format text: WITHDRAW FROM ISSUE AWAITING ACTION

STPP Information on status: patent application and granting procedure in general

Free format text: AWAITING TC RESP., ISSUE FEE NOT PAID

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE