Skip to content

Observability support  #64

@JamesWrigley

Description

@JamesWrigley

Things that we could do:

  • Keep counters that track communication statistics between workers, like the number of RPC calls and the amount of data sent/received.
  • Expose a hook API so packages can register functions that will be called on sending/receiving data. Not sure about the performance implications of this.
  • Read-only API to check the status of a worker, the time of last contact, and state of worker-to-worker connections.

Use cases:

  • Tracking network usage between workers on HPC clusters
  • Better support for profiling tools like Extrae (Roadmap #1 (comment))

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions