Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc)
19.4K views | +0 today
Follow
Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc)
Search Engines, Architecture, Information Retrieval , Solr , Lucene , ElasticSearch , Natural Language Processing, Rank, Relevance, etc
Your new post is loading...
Your new post is loading...
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Tools for Monitoring Elasticsearch Performance

Tools for Monitoring Elasticsearch Performance | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
We invite you to take a moment of your day to acquaint yourself with the best monitoring tools for Elasticsearch performance.
more...
No comment yet.
Rescooped by Carlos Sponchiado (Sponch) from BigData NoSql and Data Stuff
Scoop.it!

ElasticSearch Query: Performance Optimisation

ElasticSearch Query: Performance Optimisation | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it

In one of my previous posts on elasticsearch, i shared my understanding of elasticsearch configurations and best practices. That was mostly from an indexing perspective. There are several tweaks one can use to optimise query performance as well. Improving querying time can be even more challenging than trying to improve indexing times. Lets see why querying is more of a challenge:

Queries can go on while index is getting updatedDifferent queries would need different strategies for optimisationsThere are far more configurations that impact query performance:Query syntax/clauses usedIndex schemaElasticsearch configurationsRAM, CPU, Network, IO

And there are times when you need to fire 2 or more queries in succession to get certain results back from ES. I have had one such scenario recently where i needed to fire 3 queries to ES and make sure that the response times where always less then a second. The 3 queries in question were related to each other in a sense that query 2 uses output of query 1 and query 3 uses output from query 2. For my use case, one of the queries was simple, while others two were more complex as they had aggregations, stats, filters etc.

As outlined above, there are several things that can prevent an optimal response time. Also, to safely say that  a desired response time has been achieved, one needs to test and test right. A poor testing method would lead to misleading performance statistics. Below are details of my testing methodology and tweaks that led to sub second response times for 3 queries.

ElasticSearch Cluster and Indexes5 Machines in the cluster5 Shards per index250 GB EBS volume on each machine to hold indexesIndexes are stored as compressedNo indexing takes place while testing (my use case asks for indexing in batch once a day)3 indexesIndex A: with 24+ million records (used in 1st query)All integer fields.4 fields.Index B: with 90+ million records (used in 2nd query)All integers3 fieldsIndex C: with 340K records (used in 3rd query)String, Integer and Date fieldsonly few fields used in querying.Different machine types:to hold ES indexes: m3.large to c3.4xlargeRAMDifferent sizes for tests, starting from 4GB to 15GB given to ES instance.
Via Alex Kantone
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Global Languages Support at Netflix - Testing Search Queries

Globalization at Netflix Having launched the Netflix service globally in January, we now support search in 190 countries.  We currentl
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Autocomplete Using Elasticsearch - DZone Big Data

Autocomplete Using Elasticsearch - DZone Big Data | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
We take a look at how to implement autocomplete using Elasticsearch and nGrams in this post. Read on for more information.
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

A Complete And Easy Guide To Check Elasticsearch - DZone Big Data

A Complete And Easy Guide To Check Elasticsearch - DZone Big Data | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
As Ops/DevOps, we are usually more concerned about cluster health and the data inside. This article is a simple step-by-step guide to check Elasticsearch, cluster, nodes, shards, indices, documents, and more.
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Shay Banon - ElasticSearch: Big Data, Search, and Analytics

In this session we will explore elasticsearch, specifically, how to handle huge amount of data with it, how to effectively search it, and last, use facet
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Google Search Operators - Google Guide

search operators that work with each Google search service
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Webinar: OpenNLP and Solr for Superior Relevance

Webinar: OpenNLP and Solr for Superior Relevance | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
Webinar: OpenNLP and Solr for Superior Relevance
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

NYC Apache Lucene/Solr Meetup

Search Accuracy Metrics & Predictive Analytics – A Big Data Use Case

Wednesday, Sep 14, 2016, 6:30 PM

No location yet.

78 Luceners Attending

With Big Data, it is now possible to harvest user event data, such as search logs and click logs, for the purpose of computing user-based search accuracy metrics. These metrics can determine how well your search engine is satisfying the needs of your user population. Citing examples from recent eCommerce and Recruiting industry engagements using So...

Check out this Meetup →

With Big Data, it is now possible to harvest user event data, such as search logs and click logs, for the purpose of computing user-based search accuracy metrics. These metrics can determine how well
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Options to TF-IDF similarity - Apache Solr Search Patterns - Kumar, Jayant - RuTLib.com - Ваша домашняя библиотека

Options to TF-IDF similarity - Apache Solr Search Patterns - Kumar,  Jayant - RuTLib.com - Ваша домашняя библиотека | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Merge Policy Internals in Solr - DZone Java

Merge Policy Internals in Solr - DZone Java | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
Last week, a colleague asked me a really simple question about segments merging in Solr. After discussing the answer for some minutes while playing aroun
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Search & Big Data Analytics in 2017: 5 Hot Topics

Search & Big Data Analytics in 2017: 5 Hot Topics | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
Search and big data analytics have evolved significantly over the last few years, and organizations are increasingly using these technologies to meet their mission-critical needs.   At the beginn
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Daria Sorokina - Amazon Search: The Joy of Ranking Products - MLconf SF 2016

Amazon Search: The Joy of Ranking Products: Amazon is one of the world’s largest e-commerce sites and Amazon Search powers the majority of Amazon’s sales
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Presentations Lucene / Solr Revolution 2016

Presentations Lucene / Solr Revolution 2016 | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
Read and download presentations by Lucidworks
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Q&A: Relevant Search with Elasticsearch and Solr

In their book "Relevant Search", Doug Turnbull and John Berryman focus on the challenge of providing search results by balancing the needs and intents of the user. Using Elasticsearch and Solr, relevance engineers can constantly tune the needs of the business vs. the needs of the user.
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Elastic{ON} Videos | Elastic

The premier Elasticsearch, Logstash, and Kibana conference for the Elastic data platform.
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Online linguistic search engine Ludwig helps get your English on

Online linguistic search engine Ludwig helps get your English on | Search Engine Technologies ( E-Commerce Search, Natural Language Processing, Solr, Lucene, Elasticsearch, etc) | Scoop.it
f you've ever tried to write in a language that isn't your own, you are probably well aware of at least two things: first, writing requires much more time..
more...
No comment yet.
Scooped by Carlos Sponchiado (Sponch)
Scoop.it!

Webinar: OpenNLP and Solr for Superior Relevance

Lucidworks Senior Software Engineer and Solr Committer Steve Rowe explains how to increase relevance using Solr with Apache OpenNLP.
more...
No comment yet.