我已经从archs rnaseq data下载了数据。人类hdf5文件(28G)。我想访问表情数据和组信息。我使用下面的代码:
h5_exprs <- h5read("archs4_gene_human_v2.1.2.h5", "data/expression")
它抛出
错误(scratch_11.R#9):h5checktype()中出错。H5标识符无效。
作为解决问题的额外步骤,我应该做些什么?
当我运行***h5ls(“archs4_gene_human_v2.1.2.h5”)***时,输出如下所示:
group name otype dclass dim
0 / data H5I_GROUP
1 /data expression H5I_DATASET INTEGER 620825 x 62548
2 / meta H5I_GROUP
3 /meta genes H5I_GROUP
4 /meta/genes gene_symbol H5I_DATASET STRING 62548
5 /meta samples H5I_GROUP
6 /meta/samples aligned_reads H5I_DATASET INTEGER 620825
7 /meta/samples channel_count H5I_DATASET STRING 620825
8 /meta/samples characteristics_ch1 H5I_DATASET STRING 620825
9 /meta/samples contact_address H5I_DATASET STRING 620825
10 /meta/samples contact_city H5I_DATASET STRING 620825
11 /meta/samples contact_country H5I_DATASET STRING 620825
12 /meta/samples contact_institute H5I_DATASET STRING 620825
13 /meta/samples contact_name H5I_DATASET STRING 620825
14 /meta/samples contact_zip H5I_DATASET STRING 620825
15 /meta/samples data_processing H5I_DATASET STRING 620825
16 /meta/samples extract_protocol_ch1 H5I_DATASET STRING 620825
17 /meta/samples geo_accession H5I_DATASET STRING 620825
18 /meta/samples instrument_model H5I_DATASET STRING 620825
19 /meta/samples last_update_date H5I_DATASET STRING 620825
20 /meta/samples library_selection H5I_DATASET STRING 620825
21 /meta/samples library_source H5I_DATASET STRING 620825
22 /meta/samples library_strategy H5I_DATASET STRING 620825
23 /meta/samples molecule_ch1 H5I_DATASET STRING 620825
24 /meta/samples organism_ch1 H5I_DATASET STRING 620825
25 /meta/samples platform_id H5I_DATASET STRING 620825
26 /meta/samples relation H5I_DATASET STRING 620825
27 /meta/samples series_id H5I_DATASET STRING 620825
28 /meta/samples singlecellprobability H5I_DATASET FLOAT 620825
29 /meta/samples source_name_ch1 H5I_DATASET STRING 620825
30 /meta/samples sra_id H5I_DATASET STRING 620825
31 /meta/samples status H5I_DATASET STRING 620825
32 /meta/samples submission_date H5I_DATASET STRING 620825
33 /meta/samples taxid_ch1 H5I_DATASET STRING 620825
34 /meta/samples title H5I_DATASET STRING 620825
35 /meta/samples type H5I_DATASET STRING 620825
1条答案
按热度按时间7gs2gvoe1#
我不确定这个错误的原因。我还没有下载整个28GB的文件,但是如果我能够直接从S3存储读取
/data/expression
数据集的子集,例如:一些想法:
h5read()
命令确实是scratch_11.R
的第9行中的命令。h5read()
之前尝试运行h5errorHandling(type = "verbose")
,这将给予更大的HDF5错误堆栈,并可能有助于缩小问题范围。unable to allocate vector of size ...
错误。