Flink1.4 如何使用状态

smartsi

发布于 2019-08-07 08:33:23

1.1K0

发布于 2019-08-07 08:33:23

文章被收录于专栏：SmartSi

1. Keyed State 与 Operator State

Flink有两种基本的状态：Keyed State和Operator State。

1.1 Keyed State

Keyed State总是与key相关，只能在与KeyedStream相关的函数和算子中使用。

KeyedStream 继承 DataStream，表示根据指定的key进行分组的数据流。使用DataStream提供的KeySelector根据key对其上的算子State进行分区。 DataStream支持的典型操作也可以在KeyedStream上进行，除了诸如shuffle，forward和keyBy之类的分区方法之外。 KeyedStream可以通过调用DataStream.keyBy()来获得。而在KeyedStream上进行任何transformation都将转变回DataStream。

你可以将 Keyed State 视为已经分区或分片的Operator State，每个 key 对应一个状态分区。每个Keyed State在逻辑上只对应一个 <并行算子实例，key>，并且由于每个 key “只属于” 一个Keyed Operator的一个并行实例，我们可以简单地认为成 <operator，key>。

Keyed State 被进一步组织成所谓的 Key Group。Key Group 是 Flink 可以分配 Keyed State 的最小原子单位；Key Group的数量与最大并行度一样多。在执行期间，Keyed Operator的每个并行实例都与一个或多个Key Group的key一起工作。

1.2 Operator State

使用Operator State (或非Keyed State)，每个算子状态都绑定到一个并行算子实例。Kafka Connector 是在Flink中使用算子状态的一个很好的例子。Kafka消费者的每个并行实例都要维护一个topic分区和偏移量的map作为其Operator State。

在并行度发生变化时，Operator State接口支持在并行算子实例之间进行重新分配状态。可以有不同的方案来处理这个重新分配。

2. Raw State 与 Managed State

Keyed State和Operator State以两种形式存在：托管状态Managed State和原生状态Raw State。

Managed State由Flink RunTime控制的数据结构表示，如内部哈希表或RocksDB。例如ValueState，ListState等。Flink RunTime对状态进行编码并将它们写入检查点。

Raw State是指算子保留在它们自己数据结构中的状态。当 Checkpoint 时，他们只写入一个字节序列到检查点中。Flink对状态的数据结构一无所知，只能看到原始字节。

所有数据流函数都可以使用Managed State，但Raw State接口只能在实现算子时使用。建议使用Managed State（而不是Raw State），因为在Managed State下，Flink可以在并行度发生变化时自动重新分配状态，并且还可以更好地进行内存管理。

如果你的Managed State需要自定义序列化逻辑，请参阅相应的指南以确保将来的兼容性。Flink的默认序列化器不需要特殊处理。

3. 使用Managed Keyed State

Managed Keyed State接口提供了对不同类型状态的访问，这些状态的作用域为当前输入元素的key。这意味着这种类型的状态只能用于KeyedStream，可以通过stream.keyBy（...）创建。

现在，我们先看看可用状态的不同类型，然后我们会看到如何在程序中使用。可用状态有：

ValueState <T>：保存了一个可以更新和检索的值（如上所述，作用域为输入元素的key，所以每个key可能对应一个值）。该值可以使用update（T）来更新，使用T value（）来检索。
ListState <T>：保存了一个元素列表。可以追加元素并检索当前存储的所有元素的Iterable。使用add（T）添加元素，可以使用Iterable <T> get（）来检索Iterable。
ReducingState <T>：保存一个单一的值，表示添加到状态所有值的聚合。接口与ListState相同，但使用add（T）添加的元素时需要指定ReduceFunction。
AggregatingState <IN，OUT>：保存一个单一的值，表示添加到状态所有值的聚合。与ReducingState不同，聚合后的类型可能与添加到状态的元素类型不同。接口与ListState相同，但使用add（IN）添加到状态的元素使用指定的AggregateFunction进行聚合。
FoldingState <T，ACC>：保存一个单一的值，表示添加到状态所有值的聚合。与ReducingState不同，聚合后类型可能与添加到状态的元素类型不同。接口与ListState相同，但使用add（T）添加到状态的元素使用指定FoldFunction。
MapState ：保存了一个映射列表。可以将键值对放入状态，并检索当前存储的所有映射的Iterable。使用put（UK，UV）或putAll（Map <UK，UV>）添加映射。与用户key相关的值可以使用get（UK）来检索。映射，键和值的迭代视图可分别使用entries（），keys（）和values（）来检索。

所有类型的状态都有一个clear（）方法，它清除了当前活跃key的状态，即输入元素的key。

FoldingState和FoldingStateDescriptor已经在Flink 1.4中被弃用，将来会被彻底删除。请改用AggregatingState和AggregatingStateDescriptor。

请记住，这些状态对象仅能用于状态接口。状态没有必要一定存储在内存中，也可以保存在磁盘或其他地方。第二件要记住的是，你从状态获取的值取决于输入元素的key。因此，如果所使用的key不同，那你在一次用户函数调用中获得的值可能与另一次调用的不同。

为了得到一个状态句柄，你必须创建一个StateDescriptor。它包含了状态的名字（我们将在后面看到，你可以创建多个状态，必须有唯一的名称，以便引用它们），状态值的类型，以及用户自定义函数，如ReduceFunction。根据要检索的状态类型，你可以创建一个ValueStateDescriptor，ListStateDescriptor，ReducingStateDescriptor，FoldingStateDescriptor或MapStateDescriptor。

使用RuntimeContext来访问状态，所以只能在Rich函数中使用。请参阅这里了解有关信息，我们会很快看到一个例子。在RichFunction中可用的RuntimeContext具有下面访问状态的方法：

ValueState getState(ValueStateDescriptor)
ReducingState getReducingState(ReducingStateDescriptor)
ListState getListState(ListStateDescriptor)
AggregatingState getAggregatingState(AggregatingState)
FoldingState getFoldingState(FoldingStateDescriptor)
MapState getMapState(MapStateDescriptor)

下面是FlatMapFunction的一个例子：

Java版本:

public class CountWindowAverage extends RichFlatMapFunction<Tuple2<Long, Long>, Tuple2<Long, Long>> {

    /**
     * The ValueState handle. The first field is the count, the second field a running sum.
     */
    private transient ValueState<Tuple2<Long, Long>> sum;

    @Override
    public void flatMap(Tuple2<Long, Long> input, Collector<Tuple2<Long, Long>> out) throws Exception {

        // access the state value
        Tuple2<Long, Long> currentSum = sum.value();

        // update the count 个数
        currentSum.f0 += 1;

        // add the second field of the input value 总和
        currentSum.f1 += input.f1;

        // update the state
        sum.update(currentSum);

        // if the count reaches 2, emit the average and clear the state
        if (currentSum.f0 >= 2) {
            out.collect(new Tuple2<>(input.f0, currentSum.f1 / currentSum.f0));
            sum.clear();
        }
    }

    @Override
    public void open(Configuration config) {
        ValueStateDescriptor<Tuple2<Long, Long>> descriptor =
                new ValueStateDescriptor<>(
                        "average", // the state name
                        TypeInformation.of(new TypeHint<Tuple2<Long, Long>>() {}), // type information
                        Tuple2.of(0L, 0L)); // default value of the state, if nothing was set
        sum = getRuntimeContext().getState(descriptor);
    }
}

// this can be used in a streaming program like this (assuming we have a StreamExecutionEnvironment env)
env.fromElements(Tuple2.of(1L, 3L), Tuple2.of(1L, 5L), Tuple2.of(1L, 7L), Tuple2.of(1L, 4L), Tuple2.of(1L, 2L))
        .keyBy(0)
        .flatMap(new CountWindowAverage())
        .print();

// the printed output will be (1,4) and (1,5)

Scala版本:

class CountWindowAverage extends RichFlatMapFunction[(Long, Long), (Long, Long)] {

  private var sum: ValueState[(Long, Long)] = _

  override def flatMap(input: (Long, Long), out: Collector[(Long, Long)]): Unit = {

    // access the state value
    val tmpCurrentSum = sum.value

    // If it hasn't been used before, it will be null
    val currentSum = if (tmpCurrentSum != null) {
      tmpCurrentSum
    } else {
      (0L, 0L)
    }

    // update the count
    val newSum = (currentSum._1 + 1, currentSum._2 + input._2)

    // update the state
    sum.update(newSum)

    // if the count reaches 2, emit the average and clear the state
    if (newSum._1 >= 2) {
      out.collect((input._1, newSum._2 / newSum._1))
      sum.clear()
    }
  }

  override def open(parameters: Configuration): Unit = {
    sum = getRuntimeContext.getState(
      new ValueStateDescriptor[(Long, Long)]("average", createTypeInformation[(Long, Long)])
    )
  }
}


object ExampleCountWindowAverage extends App {
  val env = StreamExecutionEnvironment.getExecutionEnvironment

  env.fromCollection(List(
    (1L, 3L),
    (1L, 5L),
    (1L, 7L),
    (1L, 4L),
    (1L, 2L)
  )).keyBy(_._1)
    .flatMap(new CountWindowAverage())
    .print()
  // the printed output will be (1,4) and (1,5)

  env.execute("ExampleManagedState")
}

这个例子实现了一个穷人的计数窗口。我们通过第一个字段键入元组（在这个例子中都有相同的key为1）。该函数将计数和总和存储在ValueState中。一旦计数达到2，就输出平均值并清除状态，以便我们从0开始。注意，如果我们元组第一个字段具有不同值，那将为每个不同的输入key保持不同的状态值。

3.1 Scala DataStream API中的状态

除了上面介绍的接口之外，Scala API还具有在KeyedStream上使用单个ValueState的有状态map（）或flatMap（）函数的快捷方式。用户函数可以在Option获取ValueState的当前值，并且必须返回将用于更新状态的更新值。

val stream: DataStream[(String, Int)] = ...

val counts: DataStream[(String, Int)] = stream
  .keyBy(_._1)
  .mapWithState((in: (String, Int), count: Option[Int]) =>
    count match {
      case Some(c) => ( (in._1, c), Some(c + in._2) )
      case None => ( (in._1, 0), Some(in._2) )
    })

4. 使用Managed Operator State

要使用Managed Operator State，有状态函数可以实现更通用的CheckpointedFunction接口或ListCheckpointed <T extends Serializable>接口。

4.1 CheckpointedFunction

CheckpointedFunction接口提供了对有不同的重分配方案的非Keyed State的访问。它需要实现一下两种方法：

void snapshotState(FunctionSnapshotContext context) throws Exception;

void initializeState(FunctionInitializationContext context) throws Exception;

每当执行 Checkpoint 时，会调用snapshotState（）方法。每当用户自定义函数被初始化时，或当函数第一次初始化时，或者当函数从之前的检查点恢复时，initializeState（）方法被调用。鉴于此，initializeState（）不仅是初始化不同类型的状态的地方，而且还包括状态恢复逻辑的地方。

目前支持列表式的Managed Operator State。状态应该是一个可序列化的对象列表，相互间彼此独立，因此可以在扩展时重新分配。换句话说，这些对象可以在非Keyed State中重新分配比较细的粒度。根据状态访问方法，定义了以下重新分配方案：

Even-split redistribution: 每个算子都返回一个状态元素的列表。状态是逻辑上所有列表的连接。在恢复/重新分配时，列表被均分成与并行算子一样多的子列表。每个算子都可以得到一个可能为空或者包含一个或多个元素的子列表。例如，如果并行度为1，一个算子的检查点状态包含元素element1和element2，将并行度增加到2时，element1在算子实例0上运行，而element2将转至算子实例1。
Union redistribution: 每个算子都返回一个状态元素列表。状态是逻辑上所有列表的连接。在恢复/重新分配时，每个算子都可以获得全部的状态元素列表。

下面是一个有状态的SinkFunction的例子，它使用CheckpointedFunction在将元素输出到外部之前进行缓冲元素。它演示了基本的均分再分配列表状态：

Java版本:

public class BufferingSink
        implements SinkFunction<Tuple2<String, Integer>>,
                   CheckpointedFunction,
                   CheckpointedRestoring<ArrayList<Tuple2<String, Integer>>> {

    private final int threshold;

    private transient ListState<Tuple2<String, Integer>> checkpointedState;

    private List<Tuple2<String, Integer>> bufferedElements;

    public BufferingSink(int threshold) {
        this.threshold = threshold;
        this.bufferedElements = new ArrayList<>();
    }

    @Override
    public void invoke(Tuple2<String, Integer> value) throws Exception {
        bufferedElements.add(value);
        if (bufferedElements.size() == threshold) {
            for (Tuple2<String, Integer> element: bufferedElements) {
                // send it to the sink
            }
            bufferedElements.clear();
        }
    }

    @Override
    public void snapshotState(FunctionSnapshotContext context) throws Exception {
        checkpointedState.clear();
        for (Tuple2<String, Integer> element : bufferedElements) {
            checkpointedState.add(element);
        }
    }

    @Override
    public void initializeState(FunctionInitializationContext context) throws Exception {
        ListStateDescriptor<Tuple2<String, Integer>> descriptor =
            new ListStateDescriptor<>(
                "buffered-elements",
                TypeInformation.of(new TypeHint<Tuple2<Long, Long>>() {}));

        checkpointedState = context.getOperatorStateStore().getListState(descriptor);

        if (context.isRestored()) {
            for (Tuple2<String, Integer> element : checkpointedState.get()) {
                bufferedElements.add(element);
            }
        }
    }

    @Override
    public void restoreState(ArrayList<Tuple2<String, Integer>> state) throws Exception {
        // this is from the CheckpointedRestoring interface.
        this.bufferedElements.addAll(state);
    }
}

Scala版本:

class BufferingSink(threshold: Int = 0)
  extends SinkFunction[(String, Int)]
    with CheckpointedFunction
    with CheckpointedRestoring[List[(String, Int)]] {

  @transient
  private var checkpointedState: ListState[(String, Int)] = _

  private val bufferedElements = ListBuffer[(String, Int)]()

  override def invoke(value: (String, Int)): Unit = {
    bufferedElements += value
    if (bufferedElements.size == threshold) {
      for (element <- bufferedElements) {
        // send it to the sink
      }
      bufferedElements.clear()
    }
  }

  override def snapshotState(context: FunctionSnapshotContext): Unit = {
    checkpointedState.clear()
    for (element <- bufferedElements) {
      checkpointedState.add(element)
    }
  }

  override def initializeState(context: FunctionInitializationContext): Unit = {
    val descriptor = new ListStateDescriptor[(String, Int)](
      "buffered-elements",
      TypeInformation.of(new TypeHint[(String, Int)]() {})
    )

    checkpointedState = context.getOperatorStateStore.getListState(descriptor)

    if(context.isRestored) {
      for(element <- checkpointedState.get()) {
        bufferedElements += element
      }
    }
  }

  override def restoreState(state: List[(String, Int)]): Unit = {
    bufferedElements ++= state
  }
}

initializeState方法有一个FunctionInitializationContext参数。这用来初始化非keyed state“容器”。这是一个ListState类型的容器，非keyed state对象将在检查点时存储。

注意一下状态是如何被初始化，类似于keyed state状态，使用包含状态名称和状态值类型相关信息的StateDescriptor：

Java版本:

ListStateDescriptor<Tuple2<String, Integer>> descriptor =
    new ListStateDescriptor<>(
        "buffered-elements",
        TypeInformation.of(new TypeHint<Tuple2<Long, Long>>() {}));

checkpointedState = context.getOperatorStateStore().getListState(descriptor);

Scala版本:

val descriptor = new ListStateDescriptor[(String, Long)](
    "buffered-elements",
    TypeInformation.of(new TypeHint[(String, Long)]() {})
)

checkpointedState = context.getOperatorStateStore.getListState(descriptor)

状态访问方法的命名约定包含其重新分配模式及其状态结构。例如，要使用带有联合重新分配方案的列表状态进行恢复，请使用getUnionListState（descriptor）访问状态。如果方法名称不包含重新分配模式，例如 getListState（descriptor），这表示使用基本的均分重分配方案。

在初始化容器之后，我们使用上下文的isRestored（）方法来检查失败后是否正在恢复。如果是，即我们正在恢复，将会应用恢复逻辑。

如修改后的BufferingSink的代码所示，在状态初始化期间恢复的这个ListState被保存在类变量中，以备将来在snapshotState（）中使用。在那里ListState清除了前一个检查点包含的所有对象，然后用我们想要进行检查点的新对象填充。

Keyed State也可以在initializeState（）方法中初始化。这可以使用提供的FunctionInitializationContext完成。

4.2 ListCheckpointed

ListCheckpointed接口是CheckpointedFunction进行限制的一种变体，它只支持在恢复时使用均分再分配方案的列表样式状态。还需要实现以下两种方法：

List<T> snapshotState(long checkpointId, long timestamp) throws Exception;

void restoreState(List<T> state) throws Exception;

snapshotState()方法应该返回一个对象列表来进行checkpoint，而restoreState()方法在恢复时必须处理这样一个列表。如果状态是不可重分区的，则可以在snapshotState()中返回一个Collections.singletonList(MY_STATE)。

4.2.1 Stateful Source Functions

与其他算子相比，有状态的数据源需要得到更多的关注。为了能更新状态以及输出集合的原子性（在失败/恢复时需要一次性语义），用户需要从数据源的上下文中获取锁。

Java版本:

public static class CounterSource
        extends RichParallelSourceFunction<Long>
        implements ListCheckpointed<Long> {

    /**  current offset for exactly once semantics */
    private Long offset;

    /** flag for job cancellation */
    private volatile boolean isRunning = true;

    @Override
    public void run(SourceContext<Long> ctx) {
        final Object lock = ctx.getCheckpointLock();

        while (isRunning) {
            // output and state update are atomic
            synchronized (lock) {
                ctx.collect(offset);
                offset += 1;
            }
        }
    }

    @Override
    public void cancel() {
        isRunning = false;
    }

    @Override
    public List<Long> snapshotState(long checkpointId, long checkpointTimestamp) {
        return Collections.singletonList(offset);
    }

    @Override
    public void restoreState(List<Long> state) {
        for (Long s : state)
            offset = s;
    }
}

Scala版本:

class CounterSource
       extends RichParallelSourceFunction[Long]
       with ListCheckpointed[Long] {

  @volatile
  private var isRunning = true

  private var offset = 0L

  override def run(ctx: SourceFunction.SourceContext[Long]): Unit = {
    val lock = ctx.getCheckpointLock

    while (isRunning) {
      // output and state update are atomic
      lock.synchronized({
        ctx.collect(offset)

        offset += 1
      })
    }
  }

  override def cancel(): Unit = isRunning = false

  override def restoreState(state: util.List[Long]): Unit =
    for (s <- state) {
      offset = s
    }

  override def snapshotState(checkpointId: Long, timestamp: Long): util.List[Long] =
    Collections.singletonList(offset)

}

备注:

Flink版本:1.4

本文参与腾讯云自媒体同步曝光计划，分享自作者个人站点/博客。

原始发表：2018-01-16，如有侵权请联系 cloudcommunity@tencent.com 删除

大数据

scala

本文分享自作者个人站点/博客前往查看

如有侵权，请联系 cloudcommunity@tencent.com 删除。

本文参与腾讯云自媒体同步曝光计划，欢迎热爱写作的你一起参与！

大数据

scala