我正在寻找aggregate()函数的并行版本,并且看起来和正是我正在寻找的。
作为测试,我创建了一个包含10m条记录的数据集
blockSize <- 5000
records <- blockSize * 2000
df <- data.frame(id=1:records, value=rnorm(records))
df$period <- round(df$id/blockSize)
# now I want to aggregate by period and return mean of every block:
x <- aggregate(val
我正在开发一个包,它建议R的并行包(所以我的描述有一个Suggests: parallel (>= 1.13.1)语句。它在OSX和Linux下编译得很好,但在windows (使用)构建时失败了。这里是install.log的终结,win-builder会发出这样的结论:
*** installing help indices
** building package indices
** installing vignettes
** testing if installed package can be loaded
*** arch - i386
Error: package or
我的测试源码是:
volatile int gl = 0;
void * InternalHandler( void * param )
{
for ( int i = 0; i < 100000; ++i ) { ++gl; }
return 0;
}
int main()
{
pthread_t ths[100] = { 0 };
for ( int i = 0; i < 100; ++i)
{
pthread_create( &ths[ i ], 0, InternalHandler, 0 );
}
for ( int i =