Oligonucleotide Selective Detection by Levitated Optomechanics
Timothy Wilson, Owen J. L. Rackham, Hendrik Ulbricht

TL;DR
This paper explores using levitated optomechanics to detect specific signals from oligonucleotides attached to silica nanoparticles.
Contribution
A novel optomechanical method is introduced for detecting oligonucleotide-specific signals using functionalized silica nanoparticles in optical traps.
Findings
Functionalized and nonfunctionalized nanoparticles showed distinct oscillation frequency and amplitude differences.
Lorentzian curve fitting and random forest modeling confirmed significant differences between particle types.
Transmission electron microscopy failed to visually distinguish between the particle types.
Abstract
This study examines the detection of oligonucleotide-specific signals in sensitive optomechanical experiments. Silica nanoparticles were functionalized using ZnCl2 and 25-mers of single-stranded deoxyadenosine and deoxythymidine monophosphate which were optically trapped by a 1550 nm wavelength laser in vacuum. In the optical trap, silica nanoparticles behave as harmonic oscillators, and their oscillation frequency and amplitude can be precisely detected by optical interferometry. The data was compared across particle types, revealing differences in frequency, width, and amplitude of peaks with respect to motion of the silica nanoparticles which can be explained by a theoretical model. Data obtained from this platform was analyzed by fitting Lorentzian curves to the spectra. Dimensionality reduction detected differences between the functionalized and nonfunctionalized silica…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13- —HORIZON EUROPE European Innovation Council10.13039/100018703
- —Engineering and Physical Sciences Research Council10.13039/501100000266
- —Engineering and Physical Sciences Research Council10.13039/501100000266
- —Engineering and Physical Sciences Research Council10.13039/501100000266
- —Leverhulme Trust10.13039/501100000275
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMechanical and Optical Resonators · Microfluidic and Bio-sensing Technologies · Experimental and Theoretical Physics Studies
Detecting and differentiating DNA strands has applications in the fields of medicine, data storage and evolutionary biology. ?−? ? It is therefore of interest to develop methods to study DNA with greater speed and accuracy. The Sanger sequencing method was published in 1977? with parallelization and high-throughput now standard in modern techniques.? This research presents an alternative method based upon the optical properties of DNA nucleotides.
Optical trapping was originally observed by Arthur Ashkin in 1970 to contain micron-sized particles.? This technique, known as optical tweezing, has since found applications in biosensing and live cell imaging in a solution. ?,? Optical trapping in vacuum is commonly referred to as levitated optomechanics? and is the approach used in this work. By levitated optomechanics, it is possible to measure tiny forces of trapped particles on the order of 10^–20^ N,? leading to the notion that a trapped particle functionalized with DNA might be distinguished from those particles that do not have surface modifications.
Silica is routinely used in optical trapping due to its greater refraction than water at near-infrared wavelengths, a property required for stable optical trapping in a water medium.? Silica nanoparticles were used early in the development of optical trapping by Ashkin and Dziedzic where they demonstrated levitation of 20 μm diameter silica nanoparticles at a pressure of 1 mbar. ?−? ? Factors important to consider in the choice of material include high polarizability and low absorption at the wavelength of the source. Silica meets these criteria at the 1550 nm wavelength.? The properties of silica nanoparticles, and their ability to be functionalized with DNA, are the reasons why they were used in this study.
The process of DNA adsorption onto the surface of nanoparticles remains rarely studied.? Metal ions are vital in living organisms, the regulation of biological processes? and as cofactors of DNAzymes.? One paper examined methods to functionalize silica nanoparticles with short fluorescein-tagged DNA strands using different metal ions as binding agents.? They found that Zn^2+^ ions from a ZnCl_2_ solution at a concentration of 1 mM was one of the best adsorption metal ions to bind fluorescein tagged 25-mer deoxyadenosine monophosphate oligonucleotides to silica nanoparticles. This study did not carry out optical trapping of silica nanoparticles, however it did provide a foundation for the method of functionalization used in this research.
The research presented here uses the method of particle preparation for release into the optical trap in a vacuum as shown in Figure. The sonicator is used to reduce particle aggregation when released into the optical trap from the nebulizer.
Here we present the technique of optical trapping under vacuum to explore the detection of oligonucleotide functionalization differences between groups of silica nanoparticles. Silica nanoparticles with no surface modifications (standard silica nanoparticles) were tested as a reference, then 25-mer deoxyadenosine monophosphate (25A) and 25-mer deoxythymidine monophosphate (25T) oligonucleotides were added with ZnCl_2_ as a binding agent to the silica nanoparticles with concentrations of ZnCl_2_ changed for the 25T variant. The raw data for each particle type was compared. Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction and random forest analysis were used to examine how well these groups of particles could be classified. Transmission Electron Microscopy (TEM) imaging was also used to attempt to visualize differences. Collectively, this provides evidence for the selective detection of these silica nanoparticles using levitated optomechanics.
The nanoparticles were released into a vacuum chamber, and when trapped, the chamber was pumped down to a consistent 3.5 mbar within the range of error of the pressure gauge. The Power Spectral Density (PSD) waveforms were recorded on an oscilloscope. After the data were collected, the particles were filtered to remove outliers. The process of calculating numerical columns and outlier removal was carried out by fitting Lorentzian curves to the f 1, f 2 and f 3 frequency peaks, which correspond to the z, x and y degrees of motion, respectively, using the tool, Optoanalysis.? The PSD, S _ xx _(ω), as seen in Figure has units of V^2^/Hz and can be written as
where γ is the conversion factor, k B is the Boltzmann constant, T 0 is the temperature of the environment, m is mass of the particle, Γ_0_ is the damping rate, ω_0_ is the natural angular frequency, and ω is the angular frequency at which the PSD is calculated. Equation can be simplified to the experimental data, S _ xx _ ^exp^, as below:
where , B = ω_0_ and C = Γ_0_ are free fit parameters. The conversion factor γ converts the PSD to units of m^2^/Hz and can be calculated as follows in eq:
The nanoparticle is assumed to be in a thermal equilibrium where T 0 = 300 K. The radius is derived from the fit parameter of the Lorentzian curve from the PSD in eq as in the literature:?
where r is the particle radius, η_air_ is the viscosity of air, d is the diameter of the atmospheric particles, P gas is the pressure measured from the pressure sensor and ρ is the particle material density. Assuming the particle to be spherical, the mass of the particle can be calculated using .
The resulting parameters were filtered and outliers removed. An outlier is defined as being greater than 1.5 interquartile ranges from the median for any numerical column. The complete raw data set is available in the . There are considerable variances in the size of silica nanoparticles, which is the primary reason for this filtering. The data analysis presented here are with outliers removed.
The raw PSD data were compared for each silica nanoparticle type. One candidate was selected from each group, and the f 1, f 2 and f 3 frequency peaks were displayed in sections for comparison. Both panels in Figure highlight differences observed between the particles. In Figurea, there is distinct separation in the PSD regarding width, amplitude and frequency of all peaks. Figureb shows differences between most frequency peaks for the 25T functionalized silica nanoparticles at different ZnCl_2_ concentrations, the exception being the similarity between the 25T silica nanoparticles at 500 μM and 750 μM ZnCl_2_ concentrations. These PSDs are very similar in frequency, amplitude and width at the f 2 and f 3 peaks.
A physics explanation for the observed trap frequency shift for the oligonucleotide base-coated silica nanoparticle in the levitated optomechanical trap is described in the . This model, which ignores the ZnCl_2_ salt layer surrounding the surface of the silica nanoparticles, finds that the polarizability-to-mass ratio (α/m) changes depending on the oligonucleotide. It is this change in α/m, not mass, which is responsible for the frequency shift. The measure of α/m is routinely used in metrology to complement mass spectrometry and sort fullerenes and polypeptides. ?,? The estimates show that there is about a 1 kHz frequency shift per monolayer of DNA base molecule. There is a difference between adenine and thymine, and it is estimated that there is a frequency shift of 30 mHz corresponding to a single adenine molecule when compared to a single thymine base.
Dimensionality reduction is often used in the field of machine learning to compress a data set with many features, or columns, into a manageable number of components.? This technique decreases the complexity of these data and improves the accuracy of classification.? UMAP is a dimensionality reduction technique that can be run in a supervised learning mode to maximize the space between known classes in low-dimensional space that have features which are nonlinearly correlated. ?,? This method was selected for its flexibility in the analysis of any type of high-dimensional data.? Applying UMAP to the data collected allows examination of the differences between groups in a two-dimensional graphical representation.
The random forest technique is an ensemble machine learning algorithm that is effective as a generalized classification and regression model.? Ensemble techniques have a greater accuracy than other machine learning methods, such as Support Vector Machines and K-Nearest Neighbors, because a group of classifiers tends to perform more accurately than an individual. ?,? A random forest is a group of decision trees, where each tree provides its own classification, and these are considered collectively through a vote to reach a classification consensus. The overall random forest algorithm considered the classification with the greatest number of votes from all the trees in the forest.? Usefully, the random forest can return a measure of feature importance.? Furthermore, the random forest model also produces an accuracy score, giving an indication of its performance. This is calculated using the Out of Bag Error which assesses the mean misclassification ratio of samples not used for training.?
To assess the performance of the random forest classifier in this experiment, a cross-validation technique was used. Accuracy scores from a single random forest run, particularly on a data set of small sample size, are challenging to interpret and often do not give a complete picture. Monte Carlo cross-validation (MCCV) is a method which is suitable for small sample sizes. It functions by randomizing the samples in each training and test data set and can be run for as many iterations as desired for robustness.? MCCV can be applied to a random forest classification problem to validate the accuracy of the model, or optimize the features for use in future data.? A recent study compared resampling methods and found that no technique was consistently better than the others.?
The PSD comparison of one particle from each class in Figure suggests that there is a difference in optical properties between the silica nanoparticle groups. UMAP dimensionality reduction was used to plot the features of the particles from the Optoanalysis? fitting in two dimensions. The key objective of this analysis was to determine if it is possible to detect a difference between standard and oligonucleotide-functionalized silica nanoparticle. The clustering depicted in Figurea for 25A and 25T silica nanoparticles at the same 1000 μM concentration of ZnCl_2_ suggests there is variation in the data that can be explained by silica nanoparticle status. The dimensionality reduction has considered all available features generated by the Lorentzian curve fitting to the raw data. There is a separation between the three particle types, with no overlap between these groups. Next, the UMAP method was applied to investigate if changing the concentration of ZnCl_2_ has a detectable effect on 25T functionalization to the surface of silica nanoparticles. The separation observed in Figureb between all concentrations of ZnCl_2_ shows agreement with data from the literature? in that different concentrations of the binding agent can cause measurable changes in the quantity of DNA on the surface of silica nanoparticles. Both 2-dimensional UMAP clustering figures demonstrate the effectiveness of this method in clustering individual groups and separating the particle type classes.
A random forest model was trained on the Optoanalysis tool parameters with outliers removed. The data sets were split into 80% for training and 20% for testing. Training parameters and features were iterated through, and the optimal combination was selected to the deliver the best model accuracy. Due to the limitation in size of the data sets, where the 25A and 25T silica nanoparticles at 1000 μM ZnCl_2_ and standard silica nanoparticles data has 64 entries, and the 25T silica nanoparticles at different ZnCl_2_ concentrations data have 66 entries, the MCCV method was utilized to give a more complete understanding of random forest performance. Figurea shows that the random forest model performs well over the 300 iterations, at best there is perfect accuracy, at worst it is 62% accurate, the mean accuracy is 87%. This shows agreement with the UMAP plot in Figurea with distinct clustering between particle types suggesting that classification not random. In Figureb the model has a good mean accuracy at 73%, with a minimum of 50% and maximum of 93%. An explanation for the weaker mean accuracy in Figureb compared to Figurea could be the similarity between the DNA binding for 25T 500 μM and 750 μM ZnCl_2_ silica nanoparticles.? The ranking of mean feature importance in Figurec is in the same order for both models; however, the magnitude of importance varies. The f 2 and f 3 A parameters are most important, with the f 1 and f 2 radii coming in last. This suggests a consistent importance of these features to classify the particle groups. The f 3 radius was not used in any iteration and is therefore not shown.
Transmission electron microscopy (TEM) visualization of DNA molecules is challenging.? One study was able visualize DNA duplex features such as the major groove, minor groove and helix pitch using high-resolution TEM at 70 keV; however, further information about base sequence was challenging to infer.? There are heavy metal staining methods to visualize DNA with TEM including uranyl acetate, but this is difficult due to its radioactivity.? TEM imaging of silica nanoparticles is straightforward as the particles provide good contrast for visualization.?
All particle types that were tested in the optical trap were also imaged using TEM. The same sample preparation method was used as shown in Figure. The images presented in Figure show the variance in particle size and shape. This is the reason why outliers were removed in the data analysis since considerable differences could be due to physical rather than optical properties. Despite sonicating the particle solutions, there are still clusters of particles present during imaging. The speckled white surface texture is representative of the surface roughness of the particles; they are not perfect spheres. There were no clear visual differences between the nanoparticle groups.
The analysis performed in this study found detectable differences in optical properties between the types of silica nanoparticles. Taking one example from each particle group and plotting the raw data demonstrates that the Lorentzian curve fitting to produce the UMAP plots and random forest models is based upon foundational differences in their inputs. The UMAP result indicates that there are differences in the features between the silica nanoparticle classes that cause observable separation. The random forest modeling and MCCV iterations reveal that these classes can be identified with high mean accuracies of 0.87 and 0.73. Finally, the TEM imaging suggests that there is not an observable difference between the groups of silica nanoparticles.
To overcome the indistinguishable differences between silica nanoparticle groups using TEM, future work could include the use of Scanning Electron Microscopy and Energy Dispersive X-ray Spectroscopy (SEM-EDS). This technique can detect the presence of elements in a sample of atomic number greater than 11,? so would detect the addition of zinc and phosphorus on the DNA-functionalized silica nanoparticles when compared to standard silica nanoparticles.
There is a limitation in data quantity as can be seen in the UMAP plots in Figure. To extend this analysis, a larger data set would need to be generated. However, the MCCV result regarding iterating through different variations of training and testing data does give confidence that there is a real difference in the data that are classified by the random forest model.
Having a classifier to distinguish between DNA strands could have applications in diagnostic testing, particularly where current sequencing technologies are challenged by GC-rich or repetitive regions. ?,? 25A and 25T oligonucleotides were selected in this study because they have been shown to bind well with Zn^2+^ to silica nanoparticles,? however, they are also as different from each other as possible. Changing the bases to 24T with one A, swapping around the position of the A nucleotide in the strand, or changing the strand length and sequence entirely would be necessary to demonstrate this approach for further applications. Another question not investigated in this study was the effect of duplex DNA. Experimenting with a 25A–25T duplex would also indicate if there was a uniqueness to the DNA structure. Perhaps the additional binding offered by another phosphodiester backbone or the greater mass would affect how the data is clustered.
In summary, this study could lead down several directions. The frequency shift of 30 mHz in the proposed model can be feasibly detected in future experiments and will need to be investigated. If this is proved experimentally, then the ability to detect individual mass changes due to differences in base sequence could have uses in determining mutational or methylation modifications to DNA strands. ?,? Further development requires the correct foresight into how this technology could be applied. Work and thought are needed to explore why these results are present. Exploration into the underlying cause of different optical properties may shed light on the reasoning behind the UMAP clustering and random forest classification. Bioinformatics is becoming an ever more essential component of modern medicine; perhaps this optical trap classifier will be another method in the toolkit.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Campolongo M. J.Tan S. J.Xu J.Luo D.DNA nanomedicine: Engineering DNA as a polymer for therapeutic and diagnostic applications Adv. Drug Delivery Rev.20106260661610.1016/j.addr.2010.03.004PMC 712582720338202 · doi ↗ · pubmed ↗
- 2Doricchi A.Platnich C. M.Gimpel A.Horn F.Earle M.Lanzavecchia G.Cortajarena A. L.Liz-Marzán L. M.Liu N.Heckel R.Emerging approaches to DNA data storage: challenges and prospects ACS Nano 202216175521757110.1021/acsnano.2c 0674836256971 PMC 9706676 · doi ↗ · pubmed ↗
- 3Nesse R. M.Bergstrom C. T.Ellison P. T.Flier J. S.Gluckman P.Govindaraju D. R.Niethammer D.Omenn G. S.Perlman R. L.Schwartz M. D.Making evolutionary biology a basic science for medicine Proc. Natl. Acad. Sci. U. S. A.20101071800180710.1073/pnas.090622410619918069 PMC 2868284 · doi ↗ · pubmed ↗
- 4Sanger F.Nicklen S.Coulson A. R.DNA sequencing with chain-terminating inhibitors Proc. Natl. Acad. Sci. U. S. A.1977745463546710.1073/pnas.74.12.5463271968 PMC 431765 · doi ↗ · pubmed ↗
- 5Dewey F. E.Pan S.Wheeler M. T.Quake S. R.Ashley E. A.DNA sequencing: clinical applications of new DNA sequencing technologies Circulation 201212593194410.1161/CIRCULATIONAHA.110.97282822354974 PMC 3364518 · doi ↗ · pubmed ↗
- 6Ashkin A.Acceleration and trapping of particles by radiation pressure Phys. Rev. Lett.19702415610.1103/Phys Rev Lett.24.156 · doi ↗
- 7Rodríguez-Sevilla P.Labrador-Páez L.Jaque D.Haro-González P.Optical trapping for biosensing: materials and applications J. Mater. Chem. B 201759085910110.1039/C 7TB 01921 A 32264590 · doi ↗ · pubmed ↗
- 8Tam J. M.Castro C. E.Heath R. J.Mansour M. K.Cardenas M. L.Xavier R. J.Lang M. J.Vyas J. M.Use of an optical trap for study of host-pathogen interactions for dynamic live cell imaging Journal of Visualized Experiments: Jo VE 2011312310.3791/3123-v 21841755 PMC 3197446 · doi ↗ · pubmed ↗
