agrypnia genome assembly submitted to NCBI. Contamination was filtered out using blob tools
Source: https://figshare.com/articles/dataset/agrypnia_filtered_fasta/13383092/1
1000 Genomes gVCF mapped to hs37d5 for NA18504. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18504/7944293/1
1000 Genomes gVCF mapped to hs37d5 for HG01624. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01624/7895774/1
Academic Data and Datasets
2
2.0
Dec 19, 2021
12/21
by
Andrew D. Richardson; David Y. Hollinger; Julie Shoemaker; Holly Hughes; Kathleen Savage; Eric A. Davidson
data
eye 2
favorite 0
comment 0
Carbon dioxide (CO 2 ), methane (CH 4 ), and nitrous oxide (N 2 O) are the greenhouse gases largely responsible for anthropogenic climate change. Natural plant and microbial metabolic processes play a major role in the global atmospheric budget of each. We have been studying ecosystem-atmosphere trace gas exchange at a sub-boreal forest in the northeastern United States for over two decades. Historically our emphasis was on turbulent fluxes of CO 2 and water vapor. In 2012 we embarked on an...
Source: https://figshare.com/articles/dataset/Tower-_and_chamber-based_greenhouse_gas_flux_measurements_from_Howland_Forest_Maine_2012-2018_/7445657/1
1000 Genomes gVCF mapped to hs37d5 for HG03452. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03452/7901339/1
Academic Data and Datasets
2
2.0
Dec 18, 2021
12/21
by
Xiaomei Shu; David P. Livingston III; Charles P. Woloshuk; Gary A. Payne
data
eye 2
favorite 0
comment 0
Developing maize seeds were inoculated with Aspergillus flavus and collected at 4 hours post inoculation
Source: https://figshare.com/articles/dataset/Af_4hpi_rep1/5603281/1
1000 Genomes gVCF mapped to hs37d5 for NA20786. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20786/7944296/1
Each directory stores 12 trials with the number of zebrafish displayed in the name of the directory (1 AB is for 12 trials of 1 zebrafish AB) for the directory 20AB, the data show only the positions of the individuals without their identities. The first column is for the time, then columns go by two (for x position then y position) For the directories 2AB, 3AB, 5AB, 7AB and 10AB, the data are ranked to gap and no gap. Gap means there are nan in the files when the tracker can not identify the...
Source: https://figshare.com/articles/dataset/seguret_dryad_2017_rar/5151247/1
MODIS sst 15-year timeseries in greater Kotzebue sound
Source: https://figshare.com/articles/dataset/sst_modis_day_nc/7423247/1
1000 Genomes gVCF mapped to hs37d5 for HG03874. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03874/7928795/1
1000 Genomes gVCF mapped to hs37d5 for NA19185. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19185/7927103/1
English - code parallel corpus from StackOverflow
Source: https://figshare.com/articles/dataset/SOParallelData_zip/7673927/1
Supplementary data for the metaseq manuscript. This dataset is intended to be downloaded and unpacked automatically, along with the rest of the supplemental data files, using the download-metaseq-supplemental.py script found at http://figshare.com/download/file/1577683 .
Source: https://figshare.com/articles/dataset/metaseq_supplemental_data_ChIP_seq_BAMs_BG3_Shep_/1092594/2
Single computed slice through a tomographic reconstruction of a protoplasmic astrocyte in a 0.5 um thick section from the hippocampus of a 1 month old male mouse, imaged with intermediate voltage electron microscopy. This reconstruction is the 14th in a series of 26 serial reconstructions through the cell soma. The complete reconstruction can be viewed under MP7503.
Source: https://cellimagelibrary.figshare.com/articles/dataset/CCDB_6727_jpg/8179529/1
Computational methods that automatically extract knowledge from data are critical for enabling data-driven materials science. A reliable identification of lattice symmetry is a crucial first step for materials characterization and analytics. Current methods require a user-specified threshold, and are unable to detect ``average symmetries'' for defective structures. Here, we propose a new machine-learning-based approach to automatically classify structures by crystal symmetry. First, we...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZDKBRF&version=2.0
Academic Data and Datasets
7
7.0
Dec 16, 2021
12/21
by
Ian Hinder; Kidder, Larry; Pfeiffer, Harald; Scheel, Mark; Boyle, Michael; Hemberger, Dan; Lovelace, Geoffrey; Szilagyi, Bela
data
eye 7
favorite 0
comment 0
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2625759
Academic Data and Datasets
4
4.0
Dec 16, 2021
12/21
by
Raphaël Nussbaumer; Lionel Benoit; Grégoire Mariethoz; Felix Liechti; Silke Bauer; Baptiste Schmid
data
eye 4
favorite 0
comment 0
This dataset contains the interpolated values of bird density and bird flight speed (N-S and E-W) resulting from the methodology presented in [reference].The methodology is explained in less detail at rafnuss-postdoc.github.io/BMM . The resulting interpolation is a probability distribution (define the probability of each value to occurs). Only the median, quantile 10 and 90 are given in this file. The spatio-temporal grid has a resolution of 0.2° in latitude (43°-68°) and longitude...
Source: https://zenodo.org/record/3243466
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/3307421
Log and boot files of game 436
Source: https://zenodo.org/record/3341279
Wbbyyr: FastText language models for Mandarin Chinese, trained on 14,440,000 Sina Weibo posts for each year in 2012-2018. The 14,440,000 posts from each year are split into 10 folds. Due to Zenodo size limit, this dataset contains only the first fold from each year. Each model is trained for 20 iterations. Each vector is 300 dimensions long.
Source: https://zenodo.org/record/3605209
Academic Data and Datasets
2
2.0
Jan 13, 2022
01/22
by
Manuel de Pedro; Miquel Riba; Santiago González-Martínez; Pedro Seoane; Rocío Bautista; M. Gonzalo Claros; Maria Mayol
data
eye 2
favorite 0
comment 0
This VCF file includes the filtered SNP dataset for 238 Leontodon longirostris samples and 20 for the outgroup sister species Leontodon saxatilis (168,733 SNPs).
Source: https://figshare.com/articles/dataset/Leontodon_vcf/12903848/2
DNS data in forced turbulence for case 1-part a
Source: https://figshare.com/articles/dataset/entrainment3df_case1_tar_gzaa/5821086/1
1000 Genomes gVCF mapped to hs37d5 for HG02554. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02554/7931255/1
The complete mitochondrial genome of Aesop slipper lobster Scyllarides haanii (De Haan, 1841) sequencing cleandata
Source: https://figshare.com/articles/dataset/The_complete_mitochondrial_genome_of_Aesop_slipper_lobster_Scyllarides_haanii_De_Haan_1841_sequencing_cleandata/12805703/1
Gene matrix for CESC
Source: https://figshare.com/articles/dataset/Gene_matrix_for_CESC/12751418/1
Academic Data and Datasets
2
2.0
Dec 18, 2021
12/21
by
Emerson M. A. Xavier; Francisco J. Ariza-López; Manuel A. Ureña-Cámara
data
eye 2
favorite 0
comment 0
This file is part of the MatchingLand testbed and contains the datasets of systematic disturbance group (only point data). The systematic disturbance group is formed by synthetic datasets generated from a set of affine transformations over the initial datasets. The datasets are in Shapefile format and coordinate reference system EPSG:32628.
Source: https://springernature.figshare.com/articles/dataset/MatchingLand_-_systematic_disturbance_point_data/4658770/1
Raw fastq reads (contaminated with yeast) Sanger 16s .ab1 RAST 16s fasta RDP alignment Cleaned up alignment FastTree phylogenetic tree FastTree rerooted phylogenetic tree
Source: https://figshare.com/articles/dataset/From_Swab_to_Publication_Sample_Data_Tatumella_/1064368/1
1000 Genomes gVCF mapped to hs37d5 for HG03558. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03558/7907651/1
Mass cytometry data of BM and HSPC cells.
Source: https://figshare.com/articles/dataset/Original_fcs_files_of_mass_cytometry/16528836/2
The datum for a manuscript submitted to 《Geophysical Research Letters》. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/TZM7CR&version=1.0
The MOOD project (MOnitoring Outbreak events for Disease surveillance in a data science context. H2020) has geo-referenced the data Google has published as a series of PDF files presenting reports on national and subnational human mobility levels relative to a baseline data of late January 2020. The details and the PDF files can be found at https://www.google.com/covid19/mobility/ . More detail on these files can be found at https://www.moodspatialdata.com/humanmobilityforcovid19 The first set...
Source: https://figshare.com/articles/dataset/Maps_of_human_mobility_change_during_the_COVID-19_outbreak/12130980/65
Academic Data and Datasets
6
6.0
Dec 17, 2021
12/21
by
Jiapeng Qu; Fabian Ewald fassnacht; Christopher Schiller; Teja Kattenborn; Xinquan Zhao
data
eye 6
favorite 0
comment 0
Landsat based peak NDVI image Year 1997 Tile 14
Source: https://springernature.figshare.com/articles/dataset/Landsat_based_peak_NDVI_image_Year_1997_Tile_14/7769753/1
The National Software Reference Library (NSRL) is designed to collect software from various sources and incorporate file profiles computed from this software into a Reference Data Set (RDS) of information. The RDS can be used by law enforcement, government, and industry organizations to review files on a computer by matching file profiles in the RDS. This will help alleviate much of the effort involved in determining which files are important as evidence on computers or file systems that have...
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2641306
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/3303264
XAW
Source: https://figshare.com/articles/dataset/Cannabis_Microbiome_Raw_Sequence_Data_xaw/928621/1
Huntington referred to a ‘clash of civilizations’ revealing itself in international terrorism, particularly in the clash between the Islamic civilization and the West. The authors confront his hypotheses with ones derived from the strategic logic of international terrorism. They predict more terrorism against nationals from countries whose governments support the government of the terrorists’ home country. Like Huntington, they also predict excessive terrorism on Western targets, not...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HOGQGD&version=1.0
1000 Genomes gVCF mapped to hs37d5 for NA18864. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18864/7890737/1
Fold 7 test set
Source: https://figshare.com/articles/dataset/test7_zip/13372976/1
sub-18
Source: https://figshare.com/articles/dataset/sub-18_rar/16564113/1
Zn-atz-oba gas sorption data
Source: https://figshare.com/articles/dataset/Zn-atz-oba_gas_sorption_data_Exp_Sim_xlsx/16571151/2
Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-211_dcdWW_domain_trajectories/12163065/2
1000 Genomes gVCF mapped to hs37d5 for HG00097.
Source: https://figshare.com/articles/dataset/gVCF_HG00097/7841411/1
This replication archive contains all data and code to replicate the results in "Measuring Political Positions from Legislative Speech" by Benjamin E. Lauderdale and Alexander Herzog. Article abstract : Existing approaches to measuring political disagreement from text data perform poorly except when applied to narrowly selected texts discussing the same issues and written in the same style. We demonstrate the first viable approach for estimating legislator-specific scores from the...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RQMIV3&version=1.0
tar file with numerical Euler solution and processed data used in plotting the pressure fields
Source: https://rs.figshare.com/articles/dataset/Euler_solution_at_t_0_0031_from_A_fluid_mechanic_s_analysis_of_the_teacup_singularity/12739523/1
-Consumer demand for 3 dishes: Spaghetti Bolognese, meatballs with rice and peas, buns with sausage -Individually adapted choice based conjoint -Structure of data +1/3 raw data (wide data) +2/3 long data (formatted to long) +3/3 analysed_data (labelled and modelled data) +3/3 labelled data (=analysed data with label of each Variable in 2nd row) -We also provide the original survey: survey.pdf -and the coding in Stata 17: +1/3 how raw data was created +2/3 how data was transformed from wide to...
Source: https://data.goettingen-research-online.de/dataset.xhtml?persistentId=doi:10.25625/MZQGOO&version=1.0
1000 Genomes gVCF mapped to hs37d5 for HG01950. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01950/7911317/1
Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-275_dcdWW_domain_trajectories/12163527/2
NCSU_43360 Swedish Malaise trap project 18-Aug-2003 Sweden Smaland Almhults kommun Stenbrohult, Djaknabygds bokback, Heath with old beeches 56.60910034 14.19309998
Source: https://figshare.com/articles/dataset/Male_genitalia_of_Conostigmus_geniculatus_dorsal_view_PSUCIM_4336_/100988/2
James et al. (2020) - Mitigating systematic error in topographic models for geomorphic change detection: Accuracy, precision and considerations beyond off-nadir imagery UAV-collected image dataset and associated image coordinates of GCP observations for surveys of la Borgne d'Arolla. See paper for details. 60m_10degr_3 : UAV image data File formats for image observations: .xml file contains GCP image observations (and other data) exported from Photoscan v1.4.2. Can be imported back into...
Source: https://figshare.com/articles/dataset/James_et_al_2020_BdA_60m_10degr_3/11786754/1
Academic Data and Datasets
3
3.0
Dec 18, 2021
12/21
by
Maria Izabel Cavassim; Sara Moeskjaer; Bryden Fields; Asger Bachmann; Bjarni Vilhjálmsson; Mikkel H Schierup; J. Peter W. Young; Stig Uggerhøj Andersen
data
eye 3
favorite 0
comment 0
Supplementary material (Tables and Figures) for the article: Cavassim et al. 2020: Symbiosis genes show a unique pattern of introgression and selection within a Rhizobium leguminosarum species complex. Data.zip: comprises gene alignments and SNP matrices of a Rhizobium complex. Further detailed is found in the article https://doi.org/10.1099/mgen.0.000351
Source: https://figshare.com/articles/dataset/Gene_alignments_and_SNP_matrices_of_a_Rhizobium_complex/11568894/5
100-dimensional word2vec CBOW negative sampling word embeddings for the Cyrillic Uzbek language. Trained using the webcrawl corpus v1.
Source: https://figshare.com/articles/dataset/uzb-cyrl-webcrawl-v1-word2vec-cbow-ns-100d/12991472/2
Human brain tissues were obtained from the Wuhan brain bank in accordance with the brain bank protocol. Ethical agreements were obtained from the donors or their relatives by written informed consent. Total RNA was isolated from the frozen prefrontal cortex tissue using the Trizol (Invitrogen, USA) protocol with no modifications. Low molecular weight RNA was isolated, ligated to the adapters, amplified, and sequenced following the Small RNA preparation protocol (Illumina, USA) with no...
Source: https://figshare.com/articles/dataset/human_brain_smRNA_seq_bz2/6893138/1
This is the second part of a data set from September 2004 of low temperature STM data.
Source: https://figshare.com/articles/dataset/Part_2_of_LTSTM_data_set_from_Sept_2004/1284306/1
This landsat scene is an example used in the sagebrush-ecosystem-modeling github workflow for NEON's Onaqui Mountains (ONAQ) site. Landsat-8 image courtesy of the U.S. Geological Survey
Source: https://figshare.com/articles/dataset/Landsat-L1-038_032-201710-ONAQ-scene/12525548/2
Is there more violence in the middle? Over 100 studies have analyzed whether violent out- comes such as civil war, terrorism, and repression are more common in regimes that are neither full autocracies nor full democracies, yet findings are inconclusive. While this hypothesis is ultimately about functional form, existing work uses models in which a particular functional form is assumed. Existing work also uses arbitrary operationalizations of “the middle”. This paper aims to resolve the...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/LNUYXZ&version=1.1
Scholars, practitioners, and pundits often leave their assessments of uncertainty vague when debating foreign policy, arguing that clearer probability estimates would provide arbitrary detail instead of useful insight. We provide the first systematic test of this claim using a data set containing 888,328 geopolitical forecasts. We find that coarsening numeric probability assessments in a manner consistent with common qualitative expressions—including expressions currently recommended for use...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/D9FAZL&version=1.0
This is a mirror of the Mapping Police Violence spreadsheet, as downloaded from https://mappingpoliceviolence.org/ on 2020-06-01. See that site for links to reporting, to donate, and for more context in general. A mirror of the website is also preserved in wayback: http://web.archive.org/web/20200602001333/https://mappingpoliceviolence.org/
Subject 1 Session Trait from Appelgren and Bengtsson, Feedback on Trait or Action Impacts on Caudate and Paracingulum Activity, Plos one 2015
Source: https://figshare.com/articles/dataset/S1_2/1422089/2
1000 Genomes gVCF mapped to hs37d5 for HG01797. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01797/7901240/1
For inquiries about code and data, please contact Lu Shen (lshen@fas.harvard.edu) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/MHN3NY&version=1.0
There exists no consensus as to what indicates state satisfaction with the systemic status quo even though it has been a widely used concept in the empirical literature on conflict. This is surprising because satisfaction is not a new concept in International Relations and has been accorded a central role in many theories of war. In this article, we present a measure of satisfaction based on the cost of money for sovereign borrowers and compare that measure to several leading indicators of...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9YEXKP&version=2.0
Academic Data and Datasets
2
2.0
Dec 16, 2021
12/21
by
Jiapeng Qu; Fabian Ewald fassnacht; Christopher Schiller; Teja Kattenborn; Xinquan Zhao
data
eye 2
favorite 0
comment 0
Landsat based peak NDVI image Year 2006 Tile 12
Source: https://springernature.figshare.com/articles/dataset/Landsat_based_peak_NDVI_image_Year_2006_Tile_12/7769609/1
Academic Data and Datasets
2
2.0
Dec 19, 2021
12/21
by
Jiapeng Qu; Fabian Ewald fassnacht; Christopher Schiller; Teja Kattenborn; Xinquan Zhao
data
eye 2
favorite 0
comment 0
Landsat based peak NDVI image Year 2003 Tile 10
Source: https://springernature.figshare.com/articles/dataset/Landsat_based_peak_NDVI_image_Year_2003_Tile_10/7769846/1
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2642028
Log and boot files of game 67
Source: https://zenodo.org/record/1325331
This is model output from GEPIC for wheat as part of AgMIP's Global Gridded Crop Model Intercomparison (GGCMI) phase 1 output data set. The data have been generated following the modeling protocol of Elliott et al. (2015) and has been used to evaluate the models (Müller et al., 2017). A data description paper has been published in Scientific Data (Müller et al. 2019). References: Elliott J, Müller C, Deryng D, Chryssanthacopoulos J, Boote KJ, Büchner M, Foster I, Glotter M, Heinke J, Iizumi...
Source: https://zenodo.org/record/1408571
Log and boot files of game 466
Source: https://zenodo.org/record/3341358
Academic Data and Datasets
2
2.0
Dec 16, 2021
12/21
by
Gary Tse; Christien Ka Hou Li; Sharen Lee; Rachel Wing Chuen Lai; Chengye Yin; Keith Leung; Andrew Li
data
eye 2
favorite 0
comment 0
Automated analysis of ECGs taken from patients with heart failure of different aetiologies
Source: https://zenodo.org/record/3471486
Log and boot files of game 263
Source: https://zenodo.org/record/3340812
This dataset contains protostome peptide sequences that have been used to test the validity of deuterostome specific orthologous groups. BLAST was used to find potential homologous protostome sequences that could invalidate the specificity of the deuterostome orthogroups in question.
Source: https://zenodo.org/record/2650166
Academic Data and Datasets
1
1.0
Jan 13, 2022
01/22
by
Martin Dorber; Koen Kuipers; Francesca Verones
data
eye 1
favorite 0
comment 0
LUCM map generated in "Dorber, M.; Kuipers, K.; Verones, F., Global characterization factors for terrestrial biodiversity impacts of future land inundation in Life Cycle Assessment. Science of The Total Environment 2019. (https://doi.org/10.1016/j.scitotenv.2019.134582)" to calculate Area (Ai,j) and area shares (pi,j). Values for land use types: 1= Natural habitat; 2= Managed forest; 3 Pasture; 4 Agriculture; 5= Urban; 210 = Water
Source: https://figshare.com/articles/dataset/LUCM_Map_Ratser_File/11106575/1
Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-249_dcdWW_domain_trajectories/12163404/2
This repository contains: 1. Starting peptide data, R scripts to process them, and the resulting quantitative protein data 2. R scripts to process the genetic data for the Collaborative Cross and Diversity Outbred mice for further analysis (e.g., QTL, mediation, heritability) 3. R scripts to run all analyses reported in manuscript 4. Results data 5. R scripts to reproduce all figures in manuscript
Source: https://figshare.com/articles/dataset/Collaborative_Cross_and_Diversity_Outbred_liver_proteomics_manuscript/12818717/1
Academic Data and Datasets
1
1.0
Feb 19, 2022
02/22
by
Kamil Khanipov; George Golovko; Yuriy Fofanov
data
eye 1
favorite 0
comment 0
Multidimensional Patterns in Oral Microbiome Data
Source: https://figshare.com/articles/dataset/Multidimensional_Patterns_in_Oral_Microbiome_Data/15075162/2