Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

使用机器学习套件检测姿势 (iOS)

机器学习套件提供了两个经过优化的姿势检测 SDK。

SDK 名称	PoseDetection	PoseDetectionAccurate
实现	基本检测器的资源在构建时会静态链接到您的应用。	准确检测器的资源在构建时会静态链接到您的应用。
应用大小	最高 29.6MB	最高 33.2MB
性能	iPhone X：约 45FPS	iPhone X：约 29FPS

试试看

试用示例应用，了解此 API 的使用示例。

准备工作

在 Podfile 中添加以下机器学习套件 pod：

# If you want to use the base implementation:
pod 'GoogleMLKit/PoseDetection', '8.0.0'

# If you want to use the accurate implementation:
pod 'GoogleMLKit/PoseDetectionAccurate', '8.0.0'

安装或更新项目的 pod 后，请使用 Xcode 项目的 xcworkspace 打开项目。Xcode 13.2.1 版或更高版本支持机器学习套件。

1. 创建 `PoseDetector` 实例

如需检测图片中的姿势，请先创建一个 PoseDetector 实例，并视需要指定检测器设置。

`PoseDetector` 选项

检测模式

PoseDetector 在两种检测模式下运行。请务必选择与您的使用场景相符的模式。

stream（默认）: 姿势检测器会先检测图片中最突出的人，然后运行姿势检测。在后续帧中，除非该人被遮挡或不再以高置信度检测到，否则不会执行人检测步骤。姿势检测器会尝试跟踪最突出的人，并在每次推理中返回其姿势。这样可以减少延迟并使检测更加顺畅。如果您想在视频流中检测姿势，请使用此模式。
singleImage: 姿势检测器会检测一个人，然后运行姿势检测。人检测步骤将针对每张图片运行，因此延迟会更高，并且没有人跟踪。如果您想对静态图片使用姿势检测，或者不希望进行跟踪，请使用此模式。

指定姿势检测器选项：

Swift

// Base pose detector with streaming, when depending on the PoseDetection SDK
let options = PoseDetectorOptions()
options.detectorMode = .stream

// Accurate pose detector on static images, when depending on the
// PoseDetectionAccurate SDK
let options = AccuratePoseDetectorOptions()
options.detectorMode = .singleImage

Objective-C

// Base pose detector with streaming, when depending on the PoseDetection SDK
MLKPoseDetectorOptions *options = [[MLKPoseDetectorOptions alloc] init];
options.detectorMode = MLKPoseDetectorModeStream;

// Accurate pose detector on static images, when depending on the
// PoseDetectionAccurate SDK
MLKAccuratePoseDetectorOptions *options =
    [[MLKAccuratePoseDetectorOptions alloc] init];
options.detectorMode = MLKPoseDetectorModeSingleImage;

最后，获取 PoseDetector 的实例。传递您指定的选项：

Swift

let poseDetector = PoseDetector.poseDetector(options: options)

Objective-C

MLKPoseDetector *poseDetector =
    [MLKPoseDetector poseDetectorWithOptions:options];

2. 准备输入图片

如需检测姿势，请对每个图片或视频帧执行以下操作。如果您启用了流模式，则必须基于 CMSampleBuffer 创建 VisionImage 对象。

使用 UIImage 或 CMSampleBuffer 创建 VisionImage 对象。

如果您使用 UIImage，请按以下步骤操作：

使用 UIImage 创建 VisionImage 对象。请务必指定正确的 .orientation。

Swift

let image = VisionImage(image: UIImage)
visionImage.orientation = image.imageOrientation

Objective-C

MLKVisionImage *visionImage = [[MLKVisionImage alloc] initWithImage:image];
visionImage.orientation = image.imageOrientation;

如果您使用 CMSampleBuffer，请按以下步骤操作：

指定中所含图片数据的方向。CMSampleBuffer

如需获取图片方向，请运行以下命令：

Swift

func imageOrientation(
  deviceOrientation: UIDeviceOrientation,
  cameraPosition: AVCaptureDevice.Position
) -> UIImage.Orientation {
  switch deviceOrientation {
  case .portrait:
    return cameraPosition == .front ? .leftMirrored : .right
  case .landscapeLeft:
    return cameraPosition == .front ? .downMirrored : .up
  case .portraitUpsideDown:
    return cameraPosition == .front ? .rightMirrored : .left
  case .landscapeRight:
    return cameraPosition == .front ? .upMirrored : .down
  case .faceDown, .faceUp, .unknown:
    return .up
  }
}

Objective-C

- (UIImageOrientation)
  imageOrientationFromDeviceOrientation:(UIDeviceOrientation)deviceOrientation
                         cameraPosition:(AVCaptureDevicePosition)cameraPosition {
  switch (deviceOrientation) {
    case UIDeviceOrientationPortrait:
      return cameraPosition == AVCaptureDevicePositionFront ? UIImageOrientationLeftMirrored
                                                            : UIImageOrientationRight;

    case UIDeviceOrientationLandscapeLeft:
      return cameraPosition == AVCaptureDevicePositionFront ? UIImageOrientationDownMirrored
                                                            : UIImageOrientationUp;
    case UIDeviceOrientationPortraitUpsideDown:
      return cameraPosition == AVCaptureDevicePositionFront ? UIImageOrientationRightMirrored
                                                            : UIImageOrientationLeft;
    case UIDeviceOrientationLandscapeRight:
      return cameraPosition == AVCaptureDevicePositionFront ? UIImageOrientationUpMirrored
                                                            : UIImageOrientationDown;
    case UIDeviceOrientationUnknown:
    case UIDeviceOrientationFaceUp:
    case UIDeviceOrientationFaceDown:
      return UIImageOrientationUp;
  }
}

使用 CMSampleBuffer 对象和方向创建一个 VisionImage 对象：

Swift

let image = VisionImage(buffer: sampleBuffer)
image.orientation = imageOrientation(
  deviceOrientation: UIDevice.current.orientation,
  cameraPosition: cameraPosition)

Objective-C

 MLKVisionImage *image = [[MLKVisionImage alloc] initWithBuffer:sampleBuffer];
 image.orientation =
   [self imageOrientationFromDeviceOrientation:UIDevice.currentDevice.orientation
                                cameraPosition:cameraPosition];

3. 处理图片

将 VisionImage 传递给姿势检测器的图片处理方法之一。您可以使用异步的 process(image:) 方法或同步的 results() 方法。

要同步检测对象，请运行以下代码：

Swift

var results: [Pose]
do {
  results = try poseDetector.results(in: image)
} catch let error {
  print("Failed to detect pose with error: \(error.localizedDescription).")
  return
}
guard let detectedPoses = results, !detectedPoses.isEmpty else {
  print("Pose detector returned no results.")
  return
}

// Success. Get pose landmarks here.

Objective-C

NSError *error;
NSArray *poses = [poseDetector resultsInImage:image error:&error];
if (error != nil) {
  // Error.
  return;
}
if (poses.count == 0) {
  // No pose detected.
  return;
}

// Success. Get pose landmarks here.

要异步检测对象，请运行以下代码：

Swift

poseDetector.process(image) { detectedPoses, error in
  guard error == nil else {
    // Error.
    return
  }
  guard !detectedPoses.isEmpty else {
    // No pose detected.
    return
  }

  // Success. Get pose landmarks here.
}

Objective-C

[poseDetector processImage:image
                completion:^(NSArray * _Nullable poses,
                             NSError * _Nullable error) {
                    if (error != nil) {
                      // Error.
                      return;
                    }
                    if (poses.count == 0) {
                      // No pose detected.
                      return;
                    }

                    // Success. Get pose landmarks here.
                  }];

4. 获取检测到的姿势的相关信息

如果图片中检测到人，姿势检测 API 会将 Pose 对象数组传递给完成处理程序或返回该数组，具体取决于您调用的是异步方法还是同步方法。

如果该人未完全位于图片内，模型会将缺失的地标坐标分配到框架之外，并为其提供较低的 InFrameConfidence 值。

如果未检测到人，则数组为空。

Swift

for pose in detectedPoses {
  let leftAnkleLandmark = pose.landmark(ofType: .leftAnkle)
  if leftAnkleLandmark.inFrameLikelihood > 0.5 {
    let position = leftAnkleLandmark.position
  }
}

Objective-C

for (MLKPose *pose in detectedPoses) {
  MLKPoseLandmark *leftAnkleLandmark =
      [pose landmarkOfType:MLKPoseLandmarkTypeLeftAnkle];
  if (leftAnkleLandmark.inFrameLikelihood > 0.5) {
    MLKVision3DPoint *position = leftAnkleLandmark.position;
  }
}

提升性能的提示

结果的质量取决于输入图片的质量：

为了使机器学习套件准确检测姿势，图片中的人应由足够大的像素数据表示；为了获得最佳性能，正文应至少为 256x256 像素。
如果您在实时应用中检测姿势，可能还需要考虑输入图片的整体尺寸。较小图片的处理速度相对较快，因此，为了减少延迟时间，请以较低的分辨率捕获图片，但请牢记上述分辨率要求，并确保正文在图片中占据尽可能大的画面。
图片聚焦不良也会影响准确性。如果您无法获得满意的结果，请让用户重新捕获图片。

如果要在实时应用中使用姿势检测，请遵循以下准则以实现最佳帧速率：

使用基本 PoseDetection SDK 和 stream 检测模式。
建议以较低分辨率捕获图片，但请注意此 API 的图片尺寸要求。
如需处理视频帧，请使用检测器的 results(in:) 同步 API。通过 AVCaptureVideoDataOutputSampleBufferDelegate's captureOutput(_, didOutput:from:) 函数调用此方法，以从给定的视频帧同步获取结果。将 AVCaptureVideoDataOutput 的 alwaysDiscardsLateVideoFrames 始终设为 true，以限制检测器的调用次数。如果在检测器运行时有新的视频帧可用，系统会丢弃该帧。
如果要将检测器的输出作为图形叠加在输入图片上，请先从机器学习套件获取结果，然后在一个步骤中完成图片的呈现和叠加。采用这一方法，每个经过处理的输入帧只需在显示表面呈现一次。如需查看示例，请参阅示例应用中的 previewOverlayView 和 MLKDetectionOverlayView 类。

后续步骤

如需了解如何使用姿势地标对姿势进行分类，请参阅姿势分类提示。
如需了解此 API 的实际使用示例，请查看 GitHub 上的机器学习套件快速入门示例。

使用机器学习套件检测姿势 (iOS) 使用集合让一切井井有条 根据您的偏好保存内容并对其进行分类。

试试看

准备工作

1. 创建 PoseDetector 实例

PoseDetector 选项

检测模式

Swift

Objective-C

Swift

Objective-C

2. 准备输入图片

Swift

Objective-C

Swift

Objective-C

Swift

Objective-C

3. 处理图片

Swift

Objective-C

Swift

Objective-C

4. 获取检测到的姿势的相关信息

Swift

Objective-C

提升性能的提示

后续步骤

使用机器学习套件检测姿势 (iOS)

1. 创建 `PoseDetector` 实例

`PoseDetector` 选项