c# 【.Net实用方法总结】整理并总结.NET 中的 System.IO.Pipelines（管道）

作者简介：博主是一位.Net开发者，同时也是RPA和低代码平台的践行者。 个人主页：会敲键盘的肘子 系列专栏：.Net实用方法总结呂专栏简介：博主针对.Net开发和C站问答过程中遇到的问题进行总结，形成本专栏，希望可以帮助到您解决问题。 座右铭：总有一天你所坚持的会反过来拥抱你。

写在前面：

System.IO.Pipelines 是一个库，旨在使在 .NET 中执行高性能 I/O 更加容易。该库的目标为适用于所有 .NET 实现的 .NET Standard。

System.IO.Pipelines 已构建为：

具有高性能的流数据分析功能。减少代码复杂性。

本文关键字：System.IO.Pipelines、文件系统、方法示例、管道

文章目录

1️⃣ 概述2️⃣ System.IO.Pipelines 解决什么问题♈ 引入问题

3️⃣ 管道♈ 基本介绍♊ 基本用法

4️⃣ 反压和流量控制5️⃣ PipeScheduler6️⃣ 管道重置♈ 场景需求♊ PipeReader⭐ 读取流数据方案⭐ 读取单条消息⭐ 读取多条消息⭐ 取消⭐ PipeReader 常见问题⭐ 有问题的代码❌数据丢失❌ 无限循环❌ 应用程序无响应❌ 内存不足 (OOM)❌ 内存损坏

♋ PipeWrite⭐ 取消⭐ PipeWriter 常见问题

♎ PipeWrite和PipeReader的建议

7️⃣ IDuplexPipe♈ 基本介绍♊ 流⭐ 示例

1️⃣ 概述

文件和流 I/O（输入/输出）是指在存储媒介中传入或传出数据。在 .NET 中，System.IO 命名空间包含允许以异步方式和同步方式对数据流和文件进行读取和写入操作的类型。这些命名空间还包含对文件执行压缩和解压缩的类型，以及通过管道和串行端口启用通信的类型。

文件是一个由字节组成的有序的命名集合，它具有永久存储。在处理文件时，你将处理目录路径、磁盘存储、文件和目录名称。相反，流是一个字节序列，可用于对后备存储进行读取和写入操作，后备存储可以是多个存储媒介之一（例如，磁盘或内存）。正如存在除磁盘之外的多种后备存储一样，也存在除文件流之外的多种流（如网络、内存和管道流）。

文件和目录

File - 提供用于创建、复制、删除、移动和打开文件的静态方法，并可帮助创建 FileStream 对象。FileInfo - 提供用于创建、复制、删除、移动和打开文件的实例方法，并可帮助创建 FileStream 对象。Directory - 提供用于创建、移动和枚举目录和子目录的静态方法。DirectoryInfo - 提供用于创建、移动和枚举目录和子目录的实例方法。Path - 提供用于以跨平台的方式处理目录字符串的方法和属性。流

FileStream - 用于对文件进行读取和写入操作。MemoryStream - 用于作为后备存储对内存进行读取和写入操作。BufferedStream - 用于改进读取和写入操作的性能。读取器和编写器

BinaryReader 和 BinaryWriter - 用于将基元数据类型作为二进制值进行读取和写入。StreamReader 和 StreamWriter - 用于通过使用编码值在字符和字节之间来回转换来读取和写入字符。StringReader 和 StringWriter - 用于从字符串读取字符以及将字符写入字符串中。TextReader 和 TextWriter - 用作其他读取器和编写器（读取和写入字符和字符串，而不是二进制数据）的抽象基类。

2️⃣ System.IO.Pipelines 解决什么问题

♈ 引入问题

下面的代码是典型的 TCP 服务器，它从客户机接收行分隔的消息（由 '\n' 分隔）：

async Task ProcessLinesAsync(NetworkStream stream)

{

var buffer = new byte[1024];

await stream.ReadAsync(buffer, 0, buffer.Length);

// Process a single line from the buffer

ProcessLine(buffer);

}

前面的代码有几个问题：

单次调用 ReadAsync 可能无法接收整条消息（行尾）。忽略了 stream.ReadAsync 的结果。 stream.ReadAsync 返回读取的数据量。它不能处理在单个 ReadAsync 调用中读取多行的情况。它为每次读取分配一个 byte 数组。

要解决上述问题，需要进行以下更改：

缓冲传入的数据，直到找到新行。分析缓冲区中返回的所有行。该行可能大于 1KB（1024 字节）。此代码需要调整输入缓冲区的大小，直到找到分隔符后，才能在缓冲区内容纳完整行。

如果调整缓冲区的大小，当输入中出现较长的行时，将生成更多缓冲区副本。压缩用于读取行的缓冲区，以减少空余。请考虑使用缓冲池来避免重复分配内存。下面的代码解决了其中一些问题： async Task ProcessLinesAsync(NetworkStream stream)

{

byte[] buffer = ArrayPool.Shared.Rent(1024);

var bytesBuffered = 0;

var bytesConsumed = 0;

while (true)

{

// Calculate the amount of bytes remaining in the buffer.

var bytesRemaining = buffer.Length - bytesBuffered;

if (bytesRemaining == 0)

{

// Double the buffer size and copy the previously buffered data into the new buffer.

var newBuffer = ArrayPool.Shared.Rent(buffer.Length * 2);

Buffer.BlockCopy(buffer, 0, newBuffer, 0, buffer.Length);

// Return the old buffer to the pool.

ArrayPool.Shared.Return(buffer);

buffer = newBuffer;

bytesRemaining = buffer.Length - bytesBuffered;

}

var bytesRead = await stream.ReadAsync(buffer, bytesBuffered, bytesRemaining);

if (bytesRead == 0)

{

// EOF

break;

}

// Keep track of the amount of buffered bytes.

bytesBuffered += bytesRead;

var linePosition = -1;

{

// Look for a EOL in the buffered data.

linePosition = Array.IndexOf(buffer, (byte)'\n', bytesConsumed,

bytesBuffered - bytesConsumed);

if (linePosition >= 0)

{

// Calculate the length of the line based on the offset.

var lineLength = linePosition - bytesConsumed;

// Process the line.

ProcessLine(buffer, bytesConsumed, lineLength);

// Move the bytesConsumed to skip past the line consumed (including \n).

bytesConsumed += lineLength + 1;

}

while (linePosition >= 0);

}

前面的代码很复杂，不能解决所识别的所有问题。高性能网络通常意味着编写复杂的代码以使性能最大化。 System.IO.Pipelines 的设计目的是使编写此类代码更容易。

System.IO.Pipelines 已构建为：

具有高性能的流数据分析功能。减少代码复杂性。

3️⃣ 管道

♈ 基本介绍

Pipe 类可用于创建 PipeWriter/PipeReader 对。写入 PipeWriter 的所有数据都可用于 PipeReader：

var pipe = new Pipe();

PipeReader reader = pipe.Reader;

PipeWriter writer = pipe.Writer;

我们已在上篇文章中介绍了Pipe 类的用法，大家可以去查看本专栏之前的文章。

♊ 基本用法

示例

async Task ProcessLinesAsync(Socket socket)

{

var pipe = new Pipe();

Task writing = FillPipeAsync(socket, pipe.Writer);

Task reading = ReadPipeAsync(pipe.Reader);

await Task.WhenAll(reading, writing);

}

async Task FillPipeAsync(Socket socket, PipeWriter writer)

{

const int minimumBufferSize = 512;

while (true)

{

// Allocate at least 512 bytes from the PipeWriter.

Memory memory = writer.GetMemory(minimumBufferSize);

try

{

int bytesRead = await socket.ReceiveAsync(memory, SocketFlags.None);

if (bytesRead == 0)

{

break;

}

// Tell the PipeWriter how much was read from the Socket.

writer.Advance(bytesRead);

}

catch (Exception ex)

{

LogError(ex);

break;

}

// Make the data available to the PipeReader.

FlushResult result = await writer.FlushAsync();

if (result.IsCompleted)

{

break;

}

// By completing PipeWriter, tell the PipeReader that there's no more data coming.

await writer.CompleteAsync();

}

async Task ReadPipeAsync(PipeReader reader)

{

while (true)

{

ReadResult result = await reader.ReadAsync();

ReadOnlySequence buffer = result.Buffer;

while (TryReadLine(ref buffer, out ReadOnlySequence line))

{

// Process the line.

ProcessLine(line);

}

// Tell the PipeReader how much of the buffer has been consumed.

reader.AdvanceTo(buffer.Start, buffer.End);

// Stop reading if there's no more data coming.

if (result.IsCompleted)

{

break;

}

// Mark the PipeReader as complete.

await reader.CompleteAsync();

}

bool TryReadLine(ref ReadOnlySequence buffer, out ReadOnlySequence line)

{

// Look for a EOL in the buffer.

SequencePosition? position = buffer.PositionOf((byte)'\n');

if (position == null)

{

line = default;

return false;

}

// Skip the line + the \n.

line = buffer.Slice(0, position.Value);

buffer = buffer.Slice(buffer.GetPosition(1, position.Value));

return true;

}

上述代码有两个循环：

FillPipeAsync 从 Socket 读取并写入 PipeWriter。ReadPipeAsync 从 PipeReader 读取并分析传入的行。

没有分配显式缓冲区。所有缓冲区管理都委托给 PipeReader 和 PipeWriter 实现。委派缓冲区管理使使用代码更容易集中关注业务逻辑。

在第一个循环中：

调用 PipeWriter.GetMemory(Int32) 从基础编写器获取内存。调用 PipeWriter.Advance(Int32) 以告知 PipeWriter 有多少数据已写入缓冲区。调用 PipeWriter.FlushAsync 以使数据可用于 PipeReader。

在第二个循环中，PipeReader 使用由 PipeWriter 写入的缓冲区。缓冲区来自套接字。对 PipeReader.ReadAsync 的调用：

返回包含两条重要信息的 ReadResult：

以 ReadOnlySequence 形式读取的数据。布尔值 IsCompleted，指示是否已到达数据结尾 (EOF)。

找到行尾 (EOL) 分隔符并分析该行后：

该逻辑处理缓冲区以跳过已处理的内容。调用 PipeReader.AdvanceTo 以告知 PipeReader 已消耗和检查了多少数据。

读取器和编写器循环通过调用 Complete 结束。 Complete 使基础管道释放其分配的内存。

4️⃣ 反压和流量控制

理想情况下，读取和分析可协同工作：

读取线程使用来自网络的数据并将其放入缓冲区。分析线程负责构造适当的数据结构。

通常，分析所花费的时间比仅从网络复制数据块所用时间更长：

读取线程领先于分析线程。读取线程必须减缓或分配更多内存来存储用于分析线程的数据。

为了获得最佳性能，需要在频繁暂停和分配更多内存之间取得平衡。

为解决上述问题，Pipe 提供了两个设置来控制数据流：

PauseWriterThreshold：确定在调用 FlushAsync 暂停之前应缓冲多少数据。 ResumeWriterThreshold：确定在恢复对 PipeWriter.FlushAsync 的调用之前，读取器必须观察多少数据。

PipeWriter.FlushAsync:

当 Pipe 中的数据量超过 PauseWriterThreshold 时，返回不完整的 ValueTask。低于 ResumeWriterThreshold 时，返回完整的 ValueTask。

使用两个值可防止快速循环，如果只使用一个值，则可能发生这种循环。

示例

// The Pipe will start returning incomplete tasks from FlushAsync until

// the reader examines at least 5 bytes.

var options = new PipeOptions(pauseWriterThreshold: 10, resumeWriterThreshold: 5);

var pipe = new Pipe(options);

5️⃣ PipeScheduler

通常在使用 async 和 await 时，异步代码会在 TaskScheduler 或当前 SynchronizationContext 上恢复。

在执行 I/O 时，对执行 I/O 的位置进行细粒度控制非常重要。此控件允许高效利用 CPU 缓存。高效的缓存对于 Web 服务器等高性能应用至关重要。 PipeScheduler 提供对异步回调运行位置的控制。默认情况下：

使用当前的 SynchronizationContext。如果没有 SynchronizationContext，它将使用线程池运行回调。

示例

public static void Main(string[] args)

{

var writeScheduler = new SingleThreadPipeScheduler();

var readScheduler = new SingleThreadPipeScheduler();

// Tell the Pipe what schedulers to use and disable the SynchronizationContext.

var options = new PipeOptions(readerScheduler: readScheduler,

writerScheduler: writeScheduler,

useSynchronizationContext: false);

var pipe = new Pipe(options);

}

// This is a sample scheduler that async callbacks on a single dedicated thread.

public class SingleThreadPipeScheduler : PipeScheduler

{

private readonly BlockingCollection<(Action

夸智网

c# 【.Net实用方法总结】整理并总结.NET 中的 System.IO.Pipelines（管道）

linux Ubuntu22.04中安装Golang1.20和Goland，并创建一个新项目

柚子快报邀请码778899分享：开发语言 C#——委托

发表评论取消回复

夸智网

c# 【.Net实用方法总结】 整理并总结.NET 中的 System.IO.Pipelines（管道）

linux Ubuntu22.04中安装Golang1.20和Goland，并创建一个新项目

柚子快报邀请码778899分享：开发语言 C#——委托

相关文章

发表评论取消回复

c# 【.Net实用方法总结】整理并总结.NET 中的 System.IO.Pipelines（管道）