Additional storage backends #638

yurishkuro · 2018-01-08T21:18:55Z

nbettiol · 2018-01-09T15:14:35Z

Did you remove the flags for elasticsearch in jaeger-collector? Because I'm doing a test using the image docker, which version is:

{"gitCommit":"dbd5db721fc59431b1e64874cc7d6265d89ec917","GitVersion":"v1.1.0","BuildDate":"2018-01-08T21:56:21Z"}

and I cannot see the elasticsearch flags.

black-adder · 2018-01-09T15:32:47Z

It looks like you're using latest instead of 1.1. We recently moved around some of the flags so that we can support plugins better #625. Using latest, you have to instead use env variable SPAN_STORAGE=elasticsearch to use the elasticsearch flags. I'd recommend that you use 1.1 since this change will be apart of 1.2 and will be documented at that time.

nbettiol · 2018-01-09T15:37:25Z

Thanks for the reply, yes I was using the latest version. I will use the 1.1

fzakaria · 2018-01-16T17:26:26Z

I would love to see a SQL option (whatever ANSI SQL that will be least vendor lock-in).
Setting up Cassandra / ElasticSearch might be too ambitious for projects that want distributed tracing but honestly don't have the TPS to warrant a distributed datastore.

ringerc · 2018-02-03T07:18:44Z

Since I work with PostgreSQL, I sure wouldn't complain. But honestly I'm not sure a SQL db is an optimal store for largely free-form metrics of this nature. PostgreSQL at least offers the jsonb type for indexable free-form data. If you're trying to do this in a vendor neutral way you'll land up with your own json blobs, or doing EAV, and both of those are terrible. ANSI SQL is a poor fit for variable-structured or key/value form data and you'll need some vendor extensions to get usable performance.

But you inevitably land up with someone putting an ORM on top to "abstract" the DB. Then the ORM performs terribly, gobbles memory and everyone says "the SQL backend is slow, use instead".

pavolloffay · 2018-02-05T09:38:45Z

Related issue to this one is #551. Upvote if you are interested in it.

SwarnimRaj · 2018-06-29T04:48:23Z

New related issue-
Files - #894

wy100101 · 2018-08-01T16:46:36Z

We are looking at using BigQuery as a storage layer. Presumably this could work with a SQL storage option. SQL can be a generic way to deal with columnar data stores in a generic way. I would complain about a BigQuery specific solution, but I think there is a place for generic SQL interface beyond RDBs.

yurishkuro · 2018-08-01T17:23:09Z

I assume that even if some database can be treated as SQL and accessed via standard database/sql API, we still need to statically import the actual driver. Granted, this may be less maintenance than a dedicated SpanStorage implementation. However, now that the protobuf model has been merged, nothing is blocking us from moving on the storage plugin dev, eg using something like harshicorp grpc plugin framework.

isaachier · 2018-08-01T17:48:09Z

~~Our model is sufficiently simple to warrant looking into using an ORM to support a large number of backends. I'll take a look at what's available.~~ Reread above and understand what @yurishkuro means.

bruth · 2018-08-06T23:52:12Z

Giving my two cents.. an ANSI SQL could work for small workloads, so may be useful for lower-throughput applications that still want to benefit from this tool.

I will also throw out there that Timescale (a Postgres extension) may be a good fit for the required high write throughput.

mcarbonneaux · 2019-05-27T18:08:38Z

Clickhouse are SQL high performance storage very efficient for log and trace storage and whold be perfect storage alternative to cassandra original one... they are a true column db... distributed...compressed...

they are near to the CQL (sql like query language)... they use an SQL like language to...

https://clickhouse.yandex/

chvck · 2019-05-31T16:35:33Z

I just thought that I'd drop something here to say that there is also support for using Couchbase as a storage backend (via the grpc plugin), currently at https://github.com/chvck/couchbase-jaeger-storage-plugin. Will likely move to the couchbase-labs organisation in time.

omerlh · 2019-07-18T06:56:28Z

Has someone started to work on Azure CosmosDB integration? It has support for Cassandra API, but I couldn't manage to make it work...

rleiwang · 2020-10-02T23:46:23Z

I just created an issue proposing Chronowave as storage backend. #2534

DjinNO · 2020-10-15T12:02:49Z

What about ClickHouse? Clickhouse is very cool

jkowall · 2021-04-20T13:03:58Z

Cool! Nice job @muhammadn I'm curious how the performance is with a search using S3 in that manner. Are you planning on deploying or using this setup?

yurishkuro · 2021-04-20T16:17:15Z

@muhammadn I re-opened #2633 and linked your repo at the top. Suggest moving discussions/updates there.

em135 · 2021-05-17T13:06:56Z

@Xitric and I are working in on a storage backend for Humio using the grpc plugin. The repository can currently be found at https://github.com/em135/humio-jaeger-plugin. I have opened an issue for this: #3005

galan · 2021-06-01T16:05:44Z

If S3 is supported, I would suggest to support GCS as well, which is also an objects-storage for all Google Cloud users. For our use-case that would be tremendous helpful!

muhammadn · 2021-06-04T11:08:06Z

@galan it does actually, despite the name is jaeger-s3, i had already added support for GCS (and Azure Storage) for quite some time but i have not tested it.

Maybe you can go to https://github.com/muhammadn/jaeger-s3 to try it out.

Related code to GCS:
https://github.com/muhammadn/jaeger-s3/blob/main/config/config.go#L26
https://github.com/muhammadn/jaeger-s3/blob/main/s3store/store.go#L50

All you need is to modify the configuration for jaeger-s3 to this:

https://grafana.com/docs/loki/latest/operations/storage/boltdb-shipper/#example-configuration

which has the GCS config.

I will update the documentation to jaeger-s3 include GCS and Azure as well. (and probably change the project name entirely)

Do tell me if you need help. But i think we can move this discussion to #2633

@galan Update: I have updated the documentation - https://github.com/muhammadn/jaeger-s3/blob/main/README.md

Also just a question from the community, should i rename this project as a more generic name rather than jaeger-s3 since this plugins will support GCS and Azure as well?

qiansheng91 · 2021-08-16T02:19:06Z

@jpkrohling The Alibaba cloud log service has supported the jaeger, and here are the gifs of the plugin. Link: https://github.com/qiansheng91/jaeger-sls#quick-start

acceptMyPR · 2021-12-09T14:41:37Z

Can i use other jaeger collector as backend for my jaeger collector?

pavolloffay · 2021-12-09T14:43:44Z

Jaeger collector cannot send data (e.g. over gRPC) to other jaeger collector. However this capability is supported with OTEL collector.

nitinsaprumaersk · 2022-03-17T11:50:16Z

@yurishkuro Would highly recommend to add Azure Table Storage as a backend storage option as well.

arajkumar · 2022-10-21T08:11:23Z

@yurishkuro Could you please add PostgreSQL with Promscale into the list.? Now Promscale is Jaeger storage complaint too :) Thanks.

nicolastakashi · 2022-11-01T11:57:16Z

@yurishkuro could you please add RediSearch to the list? I've worked on a GRPC Plugin to Store Traces on Redis Search and this is close to the first release, I'm just adding a few performance tests.

https://github.com/nicolastakashi/jaeger-redisearch

coverthesea · 2022-12-11T05:44:17Z

Zinc https://github.com/zinclabs/zinc

vemula-anu · 2023-03-03T07:22:29Z

i use jaeger with timescaledb in kubernetes
{"level":"info","ts":1677823818.9793034,"caller":"querysvc/query_service.go:137","msg":"Archive storage not created","reason":"archive storage not supported"}
i use this configurations
spec:
containers:
- env:
- name: GRPC_STORAGE_SERVER
value: promscale:9202
- name: SPAN_STORAGE_TYPE
value: grpc-plugin
image: jaegertracing/jaeger-query:1.30
name: jaeger
ports:
- containerPort: 16685
- containerPort: 16686
restartPolicy: Always
the problem getting can anyone give solution for this

yurishkuro · 2023-03-03T08:04:09Z

@vemula-anu please do not post support questions to this issue, create a new question in Discussions.

diondew · 2023-04-19T15:50:32Z

Could you please add Yugabyte to the list? #4354

jkowall · 2023-06-27T20:49:29Z

Could you please add Yugabyte to the list? #4354

Added, sorry for the delay.

paulgrav · 2023-10-31T16:21:48Z

The creator of Tempo @joe-elliott is a maintainer here at Jaeger and is also an engineer with Grafana Labs. We had this discussion, but right now Tempo cannot support the types of queries the Jaeger UI does today. I know that may change, at which point Tempo would be a good Jaeger backend, but today it cannot do everything necessary.

I know Tempo today is a lot more capable in terms of its search compared to back in 2021. Is it now a potentially good Jaeger backend or are there still gaps?

joe-elliott · 2023-11-15T21:13:51Z

Tempo has a broader set of search capabilities (via TraceQL) than Jaeger search. So, generally, Tempo could be used to back Jaeger. There are two gaps I'm aware of.

First, Jaeger search currently returns the entire trace to the frontend which then renders the search results pane you see in the UI. Tempo, however, only returns metadata to the frontend. I don't believe this metadata is enough to render every element of the Jaeger search results. For instance, I don't think you could list the services in the trace.

Second, we currently only retrieve auto complete tags from recent traces. So if you were searching a time range from yesterday the auto complete would still be based on the traces received in the last half hour or so. We are working to address this.

muhammadn · 2024-02-25T01:04:38Z

@paulgrav

Just what @joe-elliott explained but i want to add that it had been already done. I had completely overhauled jaeger-objectstorage to use tempo as a backend to store traces to AWS S3/GCS/AzureBlob since i believe tempo is already mature to support multiple cloud storage providers. Back in the early days of tempo there wasn't support for GCS and AzureBlob so we used loki as the interface to store trace data.

I've posted in the forums on how it would look like so you can take a look.

The codes are already published (both the tempo fork and jaeger-objectstorage) but the documentation needs more polishing.

jiajiayang · 2024-06-04T12:00:46Z

Is it possible to use loki as a back-end store, which can be a good correlation between logging and tracing?

jkowall · 2024-06-08T20:38:25Z

Is it possible to use loki as a back-end store, which can be a good correlation between logging and tracing?

Unfortunately, no @jiajiayang as loki is a logging system, however Tempo is the tracing backend and doesn't support full text search which is how Jaeger does the querying you see in the search dialog box.

Even in Grafana stack you still run multiple backends.

joker-star-l · 2024-09-04T14:37:24Z

Hello community, I am going to implement a apache doris grpc storage.
I have done some work in open telemetry community. The open telemetry collector will be able to write traces data directly to apache doris.
So can I only implement the SpanReaderPlugin interface in the grpc service, or I still need to implement the SpanWriterPlugin interface?

Could you give me some advice? Thank you!

yurishkuro · 2024-09-04T15:32:27Z

@joker-star-l it's up to you. You can maintain it in your own repository and explain the existing limitations. If people find it useful they may ask to support the writer API.

rajeshksv · 2024-10-07T04:40:55Z

Any plans of supporting Pinot in future ?

yurishkuro · 2024-10-07T05:31:12Z

No

jpkrohling added enhancement area/storage labels Jun 29, 2018

olivierboucher mentioned this issue Nov 22, 2018

Bigtable Support #1208

Closed

justinclift mentioned this issue Dec 16, 2018

Is it possible to make yaeger write results (spans) to db and pick them up on launch? yurishkuro/opentracing-tutorial#45

Closed

yurishkuro mentioned this issue Mar 13, 2019

InfluxDB as trace storage backend #272

Closed

sboisson mentioned this issue Mar 21, 2019

ClickHouse as a storage backend #1438

Closed

yurishkuro mentioned this issue May 31, 2019

Couchbase as storage backend #1575

Closed

marius-stanescu-archive360 mentioned this issue Jul 18, 2019

Comsos DB as storage backend #1667

Closed

pavolloffay mentioned this issue Nov 14, 2019

Compatibility with Elasticsearch APM UI #1365

Closed

yurishkuro pinned this issue Nov 28, 2019

yyyogev mentioned this issue Dec 29, 2019

Add logzio storage plugin jaegertracing/documentation#344

Merged

pavolloffay unpinned this issue Feb 6, 2020

pavolloffay pinned this issue Feb 25, 2020

pavolloffay mentioned this issue Jun 16, 2020

can i write jaeger-ingester to an another backend like redis jaegertracing/jaeger-operator#1090

Closed

qiansheng91 mentioned this issue Aug 16, 2021

Support Alibaba Cloud Log Service as storage backend #3172

Closed

yurishkuro mentioned this issue Sep 8, 2021

Define minimum stale time for issues/PRs in SIGs open-telemetry/opentelemetry-specification#1897

Closed

pranay01 mentioned this issue Mar 30, 2022

feat: Enable S3 storage for Clickhouse SigNoz/signoz#812

Closed

evanxg852000 mentioned this issue Apr 4, 2023

Quickwit as a storage backend #4362

Closed

yurishkuro added the meta-issue An tracking issue that requires work in other repos label Feb 3, 2024

Additional storage backends #638

Additional storage backends #638

Comments

yurishkuro commented Jan 8, 2018 • edited Loading

nbettiol commented Jan 9, 2018

black-adder commented Jan 9, 2018

nbettiol commented Jan 9, 2018

fzakaria commented Jan 16, 2018

ringerc commented Feb 3, 2018 • edited Loading

pavolloffay commented Feb 5, 2018

SwarnimRaj commented Jun 29, 2018

wy100101 commented Aug 1, 2018

yurishkuro commented Aug 1, 2018

isaachier commented Aug 1, 2018 • edited Loading

bruth commented Aug 6, 2018

mcarbonneaux commented May 27, 2019 • edited Loading

chvck commented May 31, 2019

omerlh commented Jul 18, 2019

rleiwang commented Oct 2, 2020

DjinNO commented Oct 15, 2020

jkowall commented Apr 20, 2021

yurishkuro commented Apr 20, 2021

em135 commented May 17, 2021

galan commented Jun 1, 2021

muhammadn commented Jun 4, 2021 • edited Loading

qiansheng91 commented Aug 16, 2021

acceptMyPR commented Dec 9, 2021

pavolloffay commented Dec 9, 2021

nitinsaprumaersk commented Mar 17, 2022 • edited Loading

arajkumar commented Oct 21, 2022

nicolastakashi commented Nov 1, 2022

coverthesea commented Dec 11, 2022

vemula-anu commented Mar 3, 2023

yurishkuro commented Mar 3, 2023

diondew commented Apr 19, 2023

jkowall commented Jun 27, 2023

paulgrav commented Oct 31, 2023

joe-elliott commented Nov 15, 2023 • edited Loading

muhammadn commented Feb 25, 2024 • edited Loading

jiajiayang commented Jun 4, 2024

jkowall commented Jun 8, 2024

joker-star-l commented Sep 4, 2024 • edited Loading

yurishkuro commented Sep 4, 2024

rajeshksv commented Oct 7, 2024

yurishkuro commented Oct 7, 2024

yurishkuro commented Jan 8, 2018 •

edited

Loading

ringerc commented Feb 3, 2018 •

edited

Loading

isaachier commented Aug 1, 2018 •

edited

Loading

mcarbonneaux commented May 27, 2019 •

edited

Loading

muhammadn commented Jun 4, 2021 •

edited

Loading

nitinsaprumaersk commented Mar 17, 2022 •

edited

Loading

joe-elliott commented Nov 15, 2023 •

edited

Loading

muhammadn commented Feb 25, 2024 •

edited

Loading

joker-star-l commented Sep 4, 2024 •

edited

Loading