# 基于算法共轭梯度法的检点恢复方法CS Distributed, Parallel, and Cluster Computing

Reducing the frequency at which redundant information is stored lessens the runtime overhead. However, after the node failure, the solver must restart from the last iteration for which redundant information was stored, which increases recovery overhead. This formulation highlights the method's similarities to checkpoint-restart (CR). Thus, this method, which we call ESR with periodic storage (ESRP), can be considered a form of algorithm-based checkpoint-restart. The state is stored implicitly, by exploiting redundancy inherent to the algorithm, rather than explicitly as in CR. We also minimize the amount of data to be stored and retrieved compared to CR, but additional computation is required to reconstruct the solver's state. In this paper, we describe the necessary modifications to ESR to convert it into ESRP, and perform an experimental evaluation.

We compare ESRP experimentally with previously-existing ESR and application-level in-memory CR. Our results confirm that the overhead for ESR is reduced significantly, both in the failure-free case, and if node failures are introduced. In the former case, the overhead of ESRP is usually lower than that of CR. However, CR is faster if node failures happen. We claim that these differences can be alleviated by the implementation of more appropriate preconditioners.

0 条评论

• ### 学会在未知需求的情况下对车辆服务进行定价（CS GT）

车辆服务提供者根据用户对不同出发地和目的地的出行需求来确定服务价格，可能会有利可图。之前关于车辆服务空间定价的研究都是基于供应商知道用户需求的假设。在本文中，我...

• ### 多边形中最大的三角形（CS CG）

我们研究了如何寻找可以在平面中多边形内接的最大面积三角形的问题。 我们考虑了该问题的八个版本：我们使用凸多边形或简单多边形作为容器；我们要求三角形的一个角具有固...

• ### 分区着色问题的复杂性（CS CC）

给定一个简单的无定向图G=(V,E)，并将顶点集V分成p个部分，分区着色问题为如何从分区的每个部分中选择一个顶点，使p个被选择的顶点上诱导的子图的色数受k的约束...

• ### Knapsack problem algorithms for my real-life carry-on knapsack

I'm a nomad and live out of one carry-on bag. This means that the total weight o...

• ### 基于双眼视觉的高精度无人机目标定位系统（CS CV）

在工作过程中，无人驾驶车辆常常需要高精度地定位目标。在无人材料搬运车间中，无人车辆需要对工件进行高精度的姿态估计以准确地抓住工件。在此背景下，本文提出了一种基于...

• ### Write your own Excel in 100 lines of F#

I've been teaching F# for over seven years now, both in the public F# FastTrack ...

• ### 【量化精品】通过LSTM神经网络进行时序预测针对股票市场（附Python源码）

阅读原文 Neural Networks these days are the “go to” thing when talking about new fad...

• ### 卷积神经网络在艺术图像中的迁移学习分析(CS CV)

从巨大的自然图像数据集中转移学习，深度神经网络的微调和使用相应的预训练网络已经成为事实上的艺术分析应用的核心。然而，人们对迁移学习的影响仍然知之甚少。在本文中，...

• ### 学界 | 百度提出问答模型GNR：检索速度提高25倍

选自Baidu Research 作者：Jonathan Raiman & John Miller 机器之心编译 参与：刘晓坤、李泽南、蒋思源 近日，百度人工智...

• ### Pytorch分布式训练错误

subprocess.CalledProcessError: Command ‘[’/home/labpos/anaconda3/envs/idr/bin/py...