Exciting change is on the way! Please join us at nsf.gov for the latest news on NSF-funded research. While the NSF Science360 page and daily newsletter have now been retired, there’s much happening at nsf.gov. You’ll find current research news on the homepage and much more to explore throughout the site. Best of all, we’ve begun to build a brand-new website that will bring together news, social media, multimedia and more in a way that offers visitors a rich, rewarding, user-friendly experience.

Want to continue to receive email updates on the latest NSF research news and multimedia content? On September 23rd we’ll begin sending those updates via GovDelivery. If you’d prefer not to receive them, please unsubscribe now from Science360 News and your email address will not be moved into the new system.

Thanks so much for being part of the NSF Science360 News Service community. We hope you’ll stay with us during this transition so that we can continue to share the many ways NSF-funded research is advancing knowledge that transforms our future.

For additional information, please contact us at NewsTravels@nsf.gov

Top Story

Genetic testing has a data problem. New software can help

In recent years, the market for direct-to-consumer genetic testing has exploded. The number of people who used at-home DNA tests more than doubled in 2017, most of them in the U.S. As the tests become more popular, companies are grappling with how to store the accumulating data and how to process results quickly. A new tool called TeraPCA is now available to help. The research is funded by NSF's Directorate for Computer and Information Science and Engineering, supporting new insights in Big Data analytics. Despite our many physical differences, any two humans are about 99% the same genetically. The most common genetic variations, which contribute to the 1% that makes us different, are called single nucleotide polymorphisms, or SNPs (pronounced "snips"). There are 4 to 5 million SNPs in every person's genome. Those are a lot of data to keep track of on even one person; doing the same for thousands or millions of people is quite a feat. For the largest genetic testing companies, storing and processing data is not only expensive and technologically challenging, but comes with privacy concerns. These companies have a responsibility to protect personal health data. Storing it on their hard drives could make them attractive targets for hackers. TeraPCA was designed with these challenges in mind: processing data too large to fit on a computer's main memory at one time. It makes sense of large datasets by reading small chunks at a time. In the future, TeraPCA may be useful not only to learn about ancestry, but it could also make new genetics research possible, shedding light on disease risks and possible cures.

Visit Website | Image credit: NHGRI/NIH