NIFI-15681 - Enhance PutElasticsearchJson to support NDJSON, JSON Arr… by agturley · Pull Request #10981 · apache/nifi

agturley · 2026-03-08T05:17:27Z

…ay, and Single JSON input formats with size-based batching

Summary

NIFI-15681

Tracking

Please complete the following tracking steps prior to pull request creation.

Issue Tracking

Apache NiFi Jira issue created

Pull Request Tracking

Pull Request title starts with Apache NiFi Jira issue number, such as NIFI-00000
Pull Request commit message starts with Apache NiFi Jira issue number, as such NIFI-00000
Pull request contains commits signed with a registered key indicating Verified status

Pull Request Formatting

Pull Request based on current revision of the main branch
Pull Request refers to a feature branch with one commit containing changes

Verification

Please indicate the verification steps performed prior to pull request creation.

Build

Build completed using ./mvnw clean install -P contrib-check
- JDK 21
- JDK 25

Licensing

New dependencies are compatible with the Apache License 2.0 according to the License Policy
New dependencies are documented in applicable LICENSE and NOTICE files

Documentation

Documentation formatting appears as expected in rendered files

pvillard31

Few comments after having a quick look through the changes.

agturley · 2026-03-10T01:48:04Z

Finished round1 of your suggestions, please let me know your thoughts on the error handling shenanigans I'm trying. I'll be doing high volumes testing tomorrow and report back.

agturley

pushed with changes

agturley · 2026-04-05T18:59:47Z

Extended NDJSON and JSON Array modes to support field-based Elasticsearch document IDs, bringing them to parity with Single JSON mode.

Single JSON — set Identifier Attribute to the name of the FlowFile attribute that holds the document ID.
NDJSON / JSON Array — set Identifier Field to the name of the JSON field within each document to use as the document ID.
In both cases, if no ID is resolved, Elasticsearch auto-generates one.

…ay, and Single JSON input formats with size-based batching

pvillard31

Thanks for the updates @agturley - Left some comments and there are some tests failures to address.

pvillard31 · 2026-04-06T16:48:59Z

+                return;
            } catch (final Exception ex) {
                getLogger().error("Could not index documents.", ex);
-                transferFlowFilesOnException(ex, REL_FAILURE, session, false, originals.toArray(new FlowFile[0]));
+                final Set<FlowFile> inFlight = new LinkedHashSet<>(operationFlowFiles);
+                transferFlowFilesOnException(ex, REL_FAILURE, session, false, inFlight.toArray(new FlowFile[0]));
+                final Set<FlowFile> alreadyIndexed = new LinkedHashSet<>(allProcessedFlowFiles);
+                alreadyIndexed.removeAll(inFlight);
+                if (!alreadyIndexed.isEmpty()) {
+                    handleFinalResponse(context, session, errorFlowFiles, alreadyIndexed, pendingErrorRecordIndices, inputFormat);
+                }
                context.yield();
+                return;


It may be OK but the old code had a separate catch (JsonProcessingException) that routed to REL_ERRORS. That's now removed and any JsonProcessingException from flushChunk now falls through to this handler and routes to REL_FAILURE instead. Existing flows relying on REL_ERRORS for this edge case would stop receiving those FlowFiles.

I think REL_ERRORS is better for document-level failures where Elasticsearch rejected a record. If we can't parse the response at all, like what I believe is what would happen to trigger this, we don't know what happened to the documents, making REL_FAILURE the more appropriate destination. I'm okay with reverting this back if you think that's better, Ii'll leave it for you to think about.

…ay, and Single JSON input formats with size-based batching

agturley · 2026-04-07T03:50:52Z

fixed test failures

pvillard31

Latest LGTM, thanks @agturley

agturley force-pushed the NIFI-15681 branch from c00847c to 6c8505e Compare March 9, 2026 13:58

pvillard31 requested changes Mar 9, 2026

View reviewed changes

agturley requested a review from pvillard31 March 10, 2026 01:48

agturley commented Mar 22, 2026

View reviewed changes

agturley force-pushed the NIFI-15681 branch from e29f75a to b762aa3 Compare March 22, 2026 22:49

pvillard31 reviewed Mar 29, 2026

View reviewed changes

Comment thread ...-processors/src/main/java/org/apache/nifi/processors/elasticsearch/PutElasticsearchJson.java Outdated

agturley requested a review from pvillard31 April 5, 2026 05:36

NIFI-15681 - Enhance PutElasticsearchJson to support NDJSON, JSON Arr…

cf239bf

…ay, and Single JSON input formats with size-based batching

agturley force-pushed the NIFI-15681 branch from 094bf6c to cf239bf Compare April 6, 2026 04:02

pvillard31 requested changes Apr 6, 2026

View reviewed changes

NIFI-15681 - Enhance PutElasticsearchJson to support NDJSON, JSON Arr…

f4b1ccd

…ay, and Single JSON input formats with size-based batching

agturley requested a review from pvillard31 April 7, 2026 03:51

pvillard31 approved these changes Apr 7, 2026

View reviewed changes

pvillard31 merged commit 5fc2e6a into apache:main Apr 7, 2026
9 checks passed

Conversation

agturley commented Mar 8, 2026

Summary

Tracking

Issue Tracking

Pull Request Tracking

Pull Request Formatting

Verification

Build

Licensing

Documentation

Uh oh!

pvillard31 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agturley commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agturley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agturley commented Apr 5, 2026

Uh oh!

pvillard31 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pvillard31 Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

agturley Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

agturley commented Apr 7, 2026

Uh oh!

pvillard31 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

agturley commented Mar 10, 2026 •

edited

Loading