HIV-1 proteins shown in ribbon and capsid form.

The pointed end of HIV’s outer viral shell, its capsid, helps it squeeze into host cells’ nuclear pore. A team from Pitt has simulated the virus in PSC’s Bridges-2 to show how a twist in a critical protein may help it shoehorn in. Based on a figure from Yang DT et al. Illuminating an Invisible State of the HIV-1 Capsid Protein CTD Dimer using 19F NMR and Weighted Ensemble Simulations. PNAS — Feb. 25, 2025 — vol. 122 — no. 8.

Simulations on Bridges-2 Help Pitt Team Visualize Rare, Transient Shape Change in Capsid Protein

The fight to eliminate AIDS continues. An important step in infection by HIV is insertion of the viral capsid — the inner protein coat containing its genetic material — through the host cell’s nuclear pore. How the virus does this is a puzzle of squeezing a large structure though a small entrance. It’s also a potential target for better therapies. Simulations on PSC’s flagship Bridges-2 system by a University of Pittsburgh team revealed how changes in the shape of the HIV-1 capsid protein may help the capsid be more flexible. The method may also be useful for studying flexibility in other important proteins.

WHY IT’S IMPORTANT

2024 was a mixed bag in the fight against AIDS. Modern antiviral therapies have turned HIV into a chronic, but survivable, disease. Death rates are at a 20-year low. On the other hand, doctors are unlikely to meet their goal of eliminating AIDS as a health threat worldwide by 2030. In some populations, HIV infection is actually increasing. And we still don’t have a vaccine. That’s why scientists studying AIDS continue to search for weak spots in the virus’s infection cycle.

HIV inserts its genes into the host cell’s nucleus in an unusual way. It doesn’t import its RNA genetic material piecemeal. Instead, it stuffs its whole wedge-shaped capsid — basically the whole virus minus its outer membrane — through the nuclear pore in one piece.

The HIV-1 capsid protein C-terminal domain (CA-CTD) dimer. The capsid is made up of repeating units of the self-assembling CA. Dimerization of CA through the CTD holds the capsid together and connects adjacent hexameric (gray) and pentameric (magenta) subunits. The CA-CTD dimer is shown in gray ribbon representation and the CA-NTD is in white ribbon representation. The fully assembled capsid(15) consists of a hexameric lattice, with exactly 12 pentameric subunits which allow for closure of the ovoid

HIV-1 capsid protein in the dimer form (left), in pentamer and hexamer (center), and in the assembled capsid (right). From Yang DT et al. Illuminating an Invisible State of the HIV-1 Capsid Protein CTD Dimer using 19F NMR and Weighted Ensemble Simulations. PNAS — Feb. 25, 2025 — vol. 122 — no. 8.

The HIV-1 capsid protein makes up most of the protein mesh that forms the capsid. It does this by making connections between separate capsid proteins at different scales. First, either six or five copies of the protein link together via their N-terminal domains to form six-sided hexamers or five-sided pentamers. Then, the opposite end of the protein, the C-terminal domain (CTD), links with CTDs of neighboring hexamers or pentamers to connect them and form a mesh that surrounds the genetic material. The wedge of the capsid is kind of like a soccer ball, which also needs hexagons and pentagons to make a curved shape.

The CTD connections between two proteins — called dimers — in neighboring hexamers or pentamers can adopt different shapes that change the angles between those shapes. Before assembling into the capsid, about 85 percent of the CTDs connect in the D1 shape; the rest, the D2 shape.

“The capsid has these different subunits on it … and they connect to each other and form [a sort of] mesh. Those connection points are this protein/protein dimer … But we also know from [nuclear magnetic resonance experiments] … that that dimer has multiple conformations. And we can understand what the major state looks like, the major dimer conformation. It’s about 85 percent in this state. But then there’s this minor conformation, 10 to 15 percent of it’s in this other state. And it’s this very transiently occupied state, right? So we can’t really get a good structure of it.”

— Darian T. Yang, Pitt

Scientists suspected that the pointy end of the capsid’s wedge might help it squeeze through the nuclear pore. What they didn’t know was whether the ability of the CTD to shift angles between the hexamers and pentamers might play an additional role in making the capsid more flexible, and better able to deform to push through. One problem was that the D1 to D2 conversion is so fast, and the D2 shape is so outnumbered by D1. Because of this, D2 doesn’t show up well in imaging and is hard to simulate. D2 was basically invisible to both methods.

Darian T. Yang, now a postdoctoral researcher at the University of Copenhagen, wanted to know what D2 looks like, and whether the capsid protein’s ability to change shape might give the capsid extra flexibility to squeeze through. Then working as a graduate student with both Professor of Chemistry Lillian T. Chong and UPMC Rosalind Franklin Chair of Structural Biology Angela M. Gronenborn at Pitt, he carried out an exhaustive series of simulations of the capsid protein with PSC’s National Science Foundation-funded Bridges-2 supercomputer. He got computing time on Bridges-2 via an allocation from ACCESS, the NSF’s high-performance computing network, in which PSC is a leading member.

HOW PSC HELPED

The simulations would require powerful graphics processing units (GPUs), and plenty of them. Yang’s simulations would require many repetitions to work out the different ways in which the proteins can behave. Bridges-2, with 34 GPU nodes containing a total of 280 late-model GPUs, fit the bill nicely. For comparison, a high-end graphic design laptop typically has two GPUs.

“It was really nice when Bridges-2 came out … It was a big shift in the amount of speed that we could get with our simulations. The advantage of having access to [multiple] GPU computing is [that] our particular software package (WESTPA) — it’s this method called weighted ensemble path sampling — [is that it’s] very efficiently parallelizable across multiple GPUs.”

— Darian T. Yang, Pitt

The team compared its simulation results with laboratory experiments using an imaging technology called nuclear magnetic resonance, or NMR, which can track the shape of the protein via a fluorine atom that scientists attached to the natural protein. By going back and forth between real-life results and the behavior of the virtual proteins in the computer, they could be more confident the simulations were capturing reality.

The simulations, which were the first of their kind, produced switching rates and populations of the D1 and D2 shapes that matched the behavior of the capsid in the lab experiments. The team announced their findings in the prestigious Proceedings of the National Academy of Sciences U.S.A. in February 2025.

The results are encouraging because they suggest that simulations can pair with experiments to find new targets for HIV drugs or vaccines. The method should also be useful for scientists studying other biologically and medically important systems.