![130387-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130387-Thumbnail%20Image.png?versionId=6gpdIy6xLnotgLFkaLQxeegAVW1UYvRA&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240603/us-west-2/s3/aws4_request&X-Amz-Date=20240603T221938Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=47500b6ff4932a266b697137663943647750fcc98d71d0eb3672dc6bad2584d7&itok=J9u_7aLu)
X-ray free electron lasers are used in measuring diffraction patterns from nanocrystals in the 'diffract-before-destroy' mode by outrunning radiation damage. The finite-sized nanocrystals provide an opportunity to recover intensity between Bragg spots by removing the modulating function that depends on crystal shape, i.e. the transform of the crystal shape. This shape-transform dividing-out scheme for solving the phase problem has been tested using simulated examples with cubic crystals. It provides a phasing method which does not require atomic resolution data, chemical modification to the sample, or modelling based on the protein databases. It is common to find multiple structural units (e.g. molecules, in symmetry-related positions) within a single unit cell, therefore incomplete unit cells (e.g. one additional molecule) can be observed at surface layers of crystals. In this work, the effects of such incomplete unit cells on the 'dividing-out' phasing algorithm are investigated using 2D crystals within the projection approximation. It is found that the incomplete unit cells do not hinder the recovery of the scattering pattern from a single unit cell (after dividing out the shape transforms from data merged from many nanocrystals of different sizes), assuming that certain unit-cell types are preferred. The results also suggest that the dynamic range of the data is a critical issue to be resolved in order to apply the shape transform method practically.
![130320-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130320-Thumbnail%20Image.png?versionId=88ytKIF82b6vJW83BmdHpse0aB9xIggf&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T153726Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=10923590cf158ddcd9f0faae9fa7e11cac4be04751b6cec78b64f3ec57e10086&itok=-BhZSDXs)
X-ray free-electron lasers provide novel opportunities to conduct single particle analysis on nanoscale particles. Coherent diffractive imaging experiments were performed at the Linac Coherent Light Source (LCLS), SLAC National Laboratory, exposing single inorganic core-shell nanoparticles to femtosecond hard-X-ray pulses. Each facetted nanoparticle consisted of a crystalline gold core and a differently shaped palladium shell. Scattered intensities were observed up to about 7 nm resolution. Analysis of the scattering patterns revealed the size distribution of the samples, which is consistent with that obtained from direct real-space imaging by electron microscopy. Scattering patterns resulting from single particles were selected and compiled into a dataset which can be valuable for algorithm developments in single particle scattering research.
![130322-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130322-Thumbnail%20Image.png?versionId=YCHSKaVcygXd3HHenvL2xzS.G.kixC32&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T153726Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=84121a565f274f48d6dad85b0da158b4de106a8b80e67f52cd88a28592f99586&itok=I8O649eW)
Single particle diffractive imaging data from Rice Dwarf Virus (RDV) were recorded using the Coherent X-ray Imaging (CXI) instrument at the Linac Coherent Light Source (LCLS). RDV was chosen as it is a well-characterized model system, useful for proof-of-principle experiments, system optimization and algorithm development. RDV, an icosahedral virus of about 70 nm in diameter, was aerosolized and injected into the approximately 0.1 μm diameter focused hard X-ray beam at the CXI instrument of LCLS. Diffraction patterns from RDV with signal to 5.9 Ångström were recorded. The diffraction data are available through the Coherent X-ray Imaging Data Bank (CXIDB) as a resource for algorithm development, the contents of which are described here.
![130343-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130343-Thumbnail%20Image.png?versionId=XdPzPhFLDH1MZxGV7AWG1jyZPJ4l6sFp&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240605/us-west-2/s3/aws4_request&X-Amz-Date=20240605T101839Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=5724689a6bcb97b95394e37e9f86a84124d81511fcdf7bd633ba3b9583b838a6&itok=nz_jWjjK)
![130350-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130350-Thumbnail%20Image.png?versionId=w3dG6iWiJ7Jasvugcwrgw.8FSiNaZnFt&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240605/us-west-2/s3/aws4_request&X-Amz-Date=20240605T170724Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=5f6a567148781deda2b2127cea49a69bf03bb9d1363d553f03ee72983f413f09&itok=D7ID6Po8)
The membrane proximal region (MPR, residues 649–683) and transmembrane domain (TMD, residues 684–705) of the gp41 subunit of HIV-1’s envelope protein are highly conserved and are important in viral mucosal transmission, virus attachment and membrane fusion with target cells. Several structures of the trimeric membrane proximal external region (residues 662–683) of MPR have been reported at the atomic level; however, the atomic structure of the TMD still remains unknown. To elucidate the structure of both MPR and TMD, we expressed the region spanning both domains, MPR-TM (residues 649–705), in Escherichia coli as a fusion protein with maltose binding protein (MBP). MPR-TM was initially fused to the C-terminus of MBP via a 42 aa-long linker containing a TEV protease recognition site (MBP-linker-MPR-TM).
Biophysical characterization indicated that the purified MBP-linker-MPR-TM protein was a monodisperse and stable candidate for crystallization. However, crystals of the MBP-linker-MPR-TM protein could not be obtained in extensive crystallization screens. It is possible that the 42 residue-long linker between MBP and MPR-TM was interfering with crystal formation. To test this hypothesis, the 42 residue-long linker was replaced with three alanine residues. The fusion protein, MBP-AAA-MPR-TM, was similarly purified and characterized. Significantly, both the MBP-linker-MPR-TM and MBP-AAA-MPR-TM proteins strongly interacted with broadly neutralizing monoclonal antibodies 2F5 and 4E10. With epitopes accessible to the broadly neutralizing antibodies, these MBP/MPR-TM recombinant proteins may be in immunologically relevant conformations that mimic a pre-hairpin intermediate of gp41.
![130313-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-04/130313-Thumbnail%20Image.png?versionId=kqD.YDB1VX5uQO57RpUMtuE923xLWhlm&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240530/us-west-2/s3/aws4_request&X-Amz-Date=20240530T153726Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=05ec79073f27c2e53cb226bb4cd4865425dd3116805c5d2eeea0420e1d49b6f1&itok=zdlOjHlD)
![154121-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-08/154121-Thumbnail%20Image.png?versionId=Fxsip8Z3HeKZkUxNZjuR78268l5nYth0&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240614/us-west-2/s3/aws4_request&X-Amz-Date=20240614T155534Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=9289f5f3670e6e7d932ed28341ba2aacb3266181ea48ee89d0042471f7743304&itok=2-2ehCc5)
photosynthesis involves the harvesting of light energy from the sun by the antenna (made
of pigments) of the PSII trans-membrane complex. The harvested excitation energy is
transferred from the antenna complex to the reaction center of the PSII, which leads to a
light-driven charge separation event, from water to plastoquinone. This phenomenal
process has been producing the oxygen that maintains the oxygenic environment of our
planet for the past 2.5 billion years.
The oxygen molecule formation involves the light-driven extraction of 4 electrons
and protons from two water molecules through a multistep reaction, in which the Oxygen
Evolving Center (OEC) of PSII cycles through 5 different oxidation states, S0 to S4.
Unraveling the water-splitting mechanism remains as a grant challenge in the field of
photosynthesis research. This requires the development of an entirely new capability, the
ability to produce molecular movies. This dissertation advances a novel technique, Serial
Femtosecond X-ray crystallography (SFX), into a new realm whereby such time-resolved
molecular movies may be attained. The ultimate goal is to make a “molecular movie” that
reveals the dynamics of the water splitting mechanism using time-resolved SFX (TRSFX)
experiments and the uniquely enabling features of X-ray Free-Electron Laser
(XFEL) for the study of biological processes.
This thesis presents the development of SFX techniques, including development of
new methods to analyze millions of diffraction patterns (~100 terabytes of data per XFEL
experiment) with the goal of solving the X-ray structures in different transition states.
ii
The research comprises significant advancements to XFEL software packages (e.g.,
Cheetah and CrystFEL). Initially these programs could evaluate only 8-10% of all the
data acquired successfully. This research demonstrates that with manual optimizations,
the evaluation success rate was enhanced to 40-50%. These improvements have enabled
TR-SFX, for the first time, to examine the double excited state (S3) of PSII at 5.5-Å. This
breakthrough demonstrated the first indication of conformational changes between the
ground (S1) and the double-excited (S3) states, a result fully consistent with theoretical
predictions.
The power of the TR-SFX technique was further demonstrated with proof-of principle
experiments on Photoactive Yellow Protein (PYP) micro-crystals that high
temporal (10-ns) and spatial (1.5-Å) resolution structures could be achieved.
In summary, this dissertation research heralds the development of the TR-SFX
technique, protocols, and associated data analysis methods that will usher into practice a
new era in structural biology for the recording of ‘molecular movies’ of any biomolecular
process.
![155026-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/155026-Thumbnail%20Image.png?versionId=KmO0ZaoqOFkTN0He56Vj3.vPE6ljTB41&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240612/us-west-2/s3/aws4_request&X-Amz-Date=20240612T112051Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=abeafad3189710f3c18adf56baabbe791d177382aeab00d0a65eed6c15d754f9&itok=DxzNs1t3)
![157795-Thumbnail Image.png](https://d1rbsgppyrdqq4.cloudfront.net/s3fs-public/styles/width_400/public/2021-09/157795-Thumbnail%20Image.png?versionId=PE_yueOjTd0TeKlqGvnt4qcCTlRppGbp&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIASBVQ3ZQ42ZLA5CUJ/20240614/us-west-2/s3/aws4_request&X-Amz-Date=20240614T080346Z&X-Amz-SignedHeaders=host&X-Amz-Expires=120&X-Amz-Signature=e63c1cc69008d4642372a5db137ec9ad4bbd474e1cecfed67b371cd0221df7ca&itok=XZAnjthb)
Many photosystem II (PSII) dataset have been collected at XFELs, several of which are time-resolved (containing both dark and laser illuminated frames). Comparison of light and dark datasets requires understanding systematic errors that can be introduced during data analysis. This dissertation describes data analysis of PSII datasets with a focus on the effect of parameters on later results. The influence of the subset of data used in the analysis is also examined and several criteria are screened for their utility in creating better subsets of data. Subsets are compared with Bragg data analysis and continuous diffuse scattering data analysis.
A new tool, DatView aids in the creation of subsets and visualization of statistics. DatView was developed to improve the loading speed to visualize statistics of large SFX datasets and simplify the creation of subsets based on the statistics. It combines the functionality of several existing visualization tools into a single interface, improving the exploratory power of the tool. In addition, it has comparison features that allow a pattern-by-pattern analysis of the effect of processing parameters. \emph{DatView} improves the efficiency of SFX data analysis by reducing loading time and providing novel visualization tools.
We present results from experiments at the Linac Coherent Light Source (LCLS) demonstrating that serial femtosecond crystallography (SFX) can be performed to high resolution (~2.5 Å) using protein microcrystals deposited on an ultra-thin silicon nitride membrane and embedded in a preservation medium at room temperature. Data can be acquired at a high acquisition rate using x-ray free electron laser sources to overcome radiation damage, while sample consumption is dramatically reduced compared to flowing jet methods. We achieved a peak data acquisition rate of 10 Hz with a hit rate of ~38%, indicating that a complete data set could be acquired in about one 12-hour LCLS shift using the setup described here, or in even less time using hardware optimized for fixed target SFX. This demonstration opens the door to ultra low sample consumption SFX using the technique of diffraction-before-destruction on proteins that exist in only small quantities and/or do not produce the copious quantities of microcrystals required for flowing jet methods.