<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://3demmethods.i2pc.es/index.php?action=history&amp;feed=atom&amp;title=2025Costa_PERC</id>
	<title>2025Costa PERC - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://3demmethods.i2pc.es/index.php?action=history&amp;feed=atom&amp;title=2025Costa_PERC"/>
	<link rel="alternate" type="text/html" href="https://3demmethods.i2pc.es/index.php?title=2025Costa_PERC&amp;action=history"/>
	<updated>2026-05-01T10:43:34Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.44.2</generator>
	<entry>
		<id>https://3demmethods.i2pc.es/index.php?title=2025Costa_PERC&amp;diff=5201&amp;oldid=prev</id>
		<title>WikiSysop: Created page with &quot;== Citation ==  Costa-Gomes, B., Greer, J., Juraschko, N., Parkhurst, J., Mirecka, J., Famili, M., Rangel-Smith, C., Strickson, O., Lowe, A., Basham, M. and others 2025. PERC: a suite of software tools for the curation of cryoEM data with application to simulation, modeling and machine learning. Structural Biology and Crystallization Communications. 81, 10 (2025).  == Abstract ==  Ease of access to data, tools and models expedites scientific research. In structural biolo...&quot;</title>
		<link rel="alternate" type="text/html" href="https://3demmethods.i2pc.es/index.php?title=2025Costa_PERC&amp;diff=5201&amp;oldid=prev"/>
		<updated>2026-04-30T06:51:09Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Citation ==  Costa-Gomes, B., Greer, J., Juraschko, N., Parkhurst, J., Mirecka, J., Famili, M., Rangel-Smith, C., Strickson, O., Lowe, A., Basham, M. and others 2025. PERC: a suite of software tools for the curation of cryoEM data with application to simulation, modeling and machine learning. Structural Biology and Crystallization Communications. 81, 10 (2025).  == Abstract ==  Ease of access to data, tools and models expedites scientific research. In structural biolo...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Citation ==&lt;br /&gt;
&lt;br /&gt;
Costa-Gomes, B., Greer, J., Juraschko, N., Parkhurst, J., Mirecka, J., Famili, M., Rangel-Smith, C., Strickson, O., Lowe, A., Basham, M. and others 2025. PERC: a suite of software tools for the curation of cryoEM data with application to simulation, modeling and machine learning. Structural Biology and Crystallization Communications. 81, 10 (2025).&lt;br /&gt;
&lt;br /&gt;
== Abstract ==&lt;br /&gt;
&lt;br /&gt;
Ease of access to data, tools and models expedites scientific research. In&lt;br /&gt;
structural biology there are now numerous open repositories of experimental&lt;br /&gt;
and simulated data sets. Being able to easily access and utilize these is crucial&lt;br /&gt;
to allow researchers to make optimal use of their research effort. The tools&lt;br /&gt;
presented here are useful for collating existing public cryoEM data sets and/or&lt;br /&gt;
creating new synthetic cryoEM data sets to aid the development of novel data&lt;br /&gt;
processing and interpretation algorithms. In recent years, structural biology has&lt;br /&gt;
seen the development of a multitude of machine-learning-based algorithms to&lt;br /&gt;
aid numerous steps in the processing and reconstruction of experimental data&lt;br /&gt;
sets and the use of these approaches has become widespread. Developing such&lt;br /&gt;
techniques in structural biology requires access to large data sets, which can be&lt;br /&gt;
cumbersome to curate and unwieldy to make use of. In this paper, we present a&lt;br /&gt;
suite of Python software packages, which we collectively refer to as PERC&lt;br /&gt;
(profet, EMPIARreader and CAKED). These are designed to reduce the burden&lt;br /&gt;
which data curation places upon structural biology research. The protein&lt;br /&gt;
structure fetcher (profet) package allows users to conveniently download and&lt;br /&gt;
cleave sequences or structures from the Protein Data Bank or AlphaFold&lt;br /&gt;
databases. EMPIARreader allows lazy loading of Electron Microscopy Public&lt;br /&gt;
Image Archive data sets in a machine-learning-compatible structure. The Class&lt;br /&gt;
Aggregator for Key Electron-microscopy Data (CAKED) package is designed&lt;br /&gt;
to seamlessly facilitate the training of machine-learning models on electron&lt;br /&gt;
microscopy data, including electron-cryo-microscopy-specific data augmentation&lt;br /&gt;
and labeling. These packages may be utilized independently or as building&lt;br /&gt;
blocks in workflows. All are available in open-source repositories and designed&lt;br /&gt;
to be easily extensible to facilitate more advanced workflows if required.&lt;br /&gt;
&lt;br /&gt;
== Keywords ==&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
https://journals.iucr.org/f/issues/2025/10/00/ir5046/index.html&lt;br /&gt;
&lt;br /&gt;
== Related software ==&lt;br /&gt;
&lt;br /&gt;
== Related methods ==&lt;br /&gt;
&lt;br /&gt;
== Comments ==&lt;/div&gt;</summary>
		<author><name>WikiSysop</name></author>
	</entry>
</feed>