Multiple particle analysis or 3D classification
Multiple particle analysis (MPA) is typically used to classify particles with different components or different conformations. It can also be used to test different starting reference maps to make sure that a consistent reconstruction is produced.
MPA follow the same steps as single particle analysis, with some modifications to deal with multiple reference maps. Steps 1-5 for SPA are identical, as described here.
1. Multiple reference maps
MPA requires multiple reference maps, which can be obtained in many different ways. Ideally, all the reference maps should have the same quality, such as the same resolution, the same signal-to-noise ratio, etc. However, real reference maps typically have significant differences, not just in structure but also quality. To level the playing field to the best extent, all reference maps should be handled the same way. This may involve low-pass filtering to the same resolution, typically to the match the resolution of the worst map. Also, masking out as much of the background is desirable, and such masking should be doe in the same way for all maps.
1.1. Synthetic reference maps:
In the case of a project with no prior information on the structure of the particle, one starting point is to generate one or more synthetic reference maps. Multiple different synthetic maps can be used to make sure that the same structure is eventually obtained in all cases.1.2. Reference maps from atomic structures:
Multiple atomic structures can be generated in various software packages suitable for molecular manipulation. These structures can be converted into density maps and used as references.1.3. Reconstructing multiple maps using particles with orientations:
Given a SPA project where the particle orientations have already been determined, multiple maps can be calculated in various ways. Different selection schemes are available using the program bpartsel and one of the following options:- -rank 4,1 Select a number of groups from FOM ranking and flag to adjust for defocus.
- -random 23.6 Select: random selection to get the given fraction or percentage.
- -bootstrap 527 Select: random selection with replacement up to the given number.
Multiple maps are then reconstructed based on the selection in the parameter file:
breconstruct -v 3 -resol 15 -select 5 -rescale 0,1 -sym I -recon phi6_ref_05.map -out phi6_ref_05.star phi6_run1.star
For the -bootstrap option in bpartsel, use the corresponding -bootstrap option in breconstruct.
2. Multiple orientations for each particle
For each reference map, a full orientation-finding run is done with all the particles. It is important that all parameters used (such as resolution limits and annulli) should be the same for the different runs to ensure that the FOM's are comparable. This generates a parameter file for each reference map with the corresponding origins, orientations and FOM's.3. Merging and selecting particles
The multiple output files resulting from orientation-finding runs must be merged and a selection of orientation parameters done for each particle:
bpartmulti -v 7 -all -select fom -fom 0.4 -out phi6_selected.star phi6_*_out.star
4. Multiple map reconstruction
In the parameter file generated during merging and selection, the selection column contains the assignment of the particle to a reference map. The corresponding new reconstruction is calculated using this selection:
breconstruct -v 3 -resol 12 -select 3 -rescale 0,1 -sym I -recon phi6_recon_03.map -out phi6_recon_03.star phi6_selected.star
5. Selecting maps for merging and rejection
Classes with small numbers of particles (e.g., < 5%) are typically rejected and not further considered.
The reconstructions from the remaining classes are then compared pair-wise to estimate the similarities and differences:
bmapdist -v 1 -cut 0.3,70 -resol 10 -out phi6_mapdist.star -matrix phi6_mapdist.dat phi6_recon*.pif
This outputs several tables:
- Correlation coefficients: Real space correlation
- -log(Correlation coefficients): Correlation converted to a distance form
- R factors: Difference between maps
- Resolution based on Fourier shell correlation:
- Resolution based on differential phase residuals:
The last two tables give an indication to what resolution each pair of maps are similar.
The matrix of similarities generated can be further used for cluster analysis and to generate an image showing the matrix:
bmatrix -verb 7 -pref 0.5 -out phi6_mapdist.png phi6_mapdist.dat<
bcolour -v 7 -color 0,255 phi6_mapdist.png phi6_mapdist_col.png
Maps that are considered to be very similar can be added to each other.
To generate a larger number of new reference maps for the next round of orientation-finding, intermediary maps can be produced by merging pairs of maps.
6. Convergence
Convergence of the iterative refinement of particle orientations and classification is achieved when:
- The number of particles in each class remains approximately constant
- The pair-wise comparisons of the maps remain about the same.