目录监控输入插件
此插件监视单个目录(遍历子目录),并处理放入目录中的每个文件。该插件将在配置的间隔内收集目录中的所有文件,并解析尚未被读取的文件。
文件不应被其他进程使用,否则插件可能会失败。此外,文件不应在实时写入被监视的目录。如果您必须直接写入文件,则必须确保在 directory_duration_threshold 之前写入完成。
引入于: Telegraf v1.18.0 标签: system 操作系统支持: all
全局配置选项
插件支持其他全局和插件配置设置,用于修改指标、标签和字段,创建别名以及配置插件顺序等任务。更多详情请参阅 CONFIGURATION.md。
配置
# Ingests files in a directory and then moves them to a target directory.
[[inputs.directory_monitor]]
## The directory to monitor and read files from (including sub-directories if "recursive" is true).
directory = ""
#
## The directory to move finished files to (maintaining directory hierarchy from source).
finished_directory = ""
#
## Setting recursive to true will make the plugin recursively walk the directory and process all sub-directories.
# recursive = false
#
## The directory to move files to upon file error.
## If not provided, erroring files will stay in the monitored directory.
# error_directory = ""
#
## The amount of time a file is allowed to sit in the directory before it is picked up.
## This time can generally be low but if you choose to have a very large file written to the directory and it's potentially slow,
## set this higher so that the plugin will wait until the file is fully copied to the directory.
# directory_duration_threshold = "50ms"
#
## A list of the only file names to monitor, if necessary. Supports regex. If left blank, all files are ingested.
# files_to_monitor = ["^.*\\.csv"]
#
## A list of files to ignore, if necessary. Supports regex.
# files_to_ignore = [".DS_Store"]
#
## Maximum lines of the file to process that have not yet be written by the
## output. For best throughput set to the size of the output's metric_buffer_limit.
## Warning: setting this number higher than the output's metric_buffer_limit can cause dropped metrics.
# max_buffered_metrics = 10000
#
## The maximum amount of file paths to queue up for processing at once, before waiting until files are processed to find more files.
## Lowering this value will result in *slightly* less memory use, with a potential sacrifice in speed efficiency, if absolutely necessary.
# file_queue_size = 100000
#
## Name a tag containing the name of the file the data was parsed from. Leave empty
## to disable. Cautious when file name variation is high, this can increase the cardinality
## significantly. Read more about cardinality here:
## https://docs.influxdb.org.cn/influxdb/cloud/reference/glossary/#series-cardinality
# file_tag = ""
#
## Specify if the file can be read completely at once or if it needs to be read line by line (default).
## Possible values: "line-by-line", "at-once"
# parse_method = "line-by-line"
#
## The dataformat to be read from the files.
## Each data format has its own unique set of configuration options, read
## more about them here:
## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_INPUT.md
data_format = "influx"Metrics
此插件生成的指标的格式取决于文件的内容和数据格式。
当启用 internal 输入时
- internal_directory_monitor
- 字段 (fields)
- files_processed - 已处理的文件数量 (计数器)
- files_dropped - 已丢弃的文件数量 (计数器)
- 字段 (fields)
- internal_directory_monitor
- 标签 (tags)
- directory - 被监控的目录
- 字段 (fields)
- files_processed_per_dir - 每个目录已处理的文件数量 (计数器)
- files_dropped_per_dir - 每个目录已丢弃的文件数量 (计数器)
- files_queue_per_dir - 每个目录待处理的文件数量 (gauge)
- 标签 (tags)
示例输出
此插件生成的指标取决于文件的内容和数据格式。
此页面是否有帮助?
感谢您的反馈!
支持和反馈
感谢您成为我们社区的一员!我们欢迎并鼓励您对 Telegraf 和本文档提出反馈和 bug 报告。要获取支持,请使用以下资源
具有年度合同或支持合同的客户可以 联系 InfluxData 支持。