Category: data science

Document Classification With Solr Streaming Expressions

Post author By aadel
Post date November 6, 2019
No Comments on Document Classification With Solr Streaming Expressions

Classification is one of the most popular tasks in Natural Language Processing and Machine Learning. Solr ships with features, a subset of Streaming Expressions features, that allows building and deploying statistical classification models out-of-the-box. With adequate preprocessing and indexing tweaks, these features can be used to classify documents quickly and with high accuracy. This post illustrates how Solr streaming expressions and Zeppelin notebooks can be used to build a document classifier.

Tags classification, natural language processing, sole, streaming expressions, text classification, zeppelin

data science novice

Zeppelin Notebooks and Solr

Post author By aadel
Post date October 22, 2019
1 Comment on Zeppelin Notebooks and Solr

The concept of data science notebooks has been around for a while. Notebooks are web interfaces that allow creating and sharing live code, equations, visualizations and narrative text. They exist somewhere in data science workflows to serve data cleaning, transformation, numerical simulation, statistical modeling, data visualization and even machine learning. In a Python environment, Jupyter is prominent. In Java or Scala environment, Apache Zeppelin fits seamlessly. Though Jupyter can be used with a Java kernel and Zeppelin can be used with a Python interpreter, each one natively belongs to its own stack.

Tags apache zeppelin, data science, notebook, solr, visualization

What do you think are the main barriers to data science and machine learning? (Select up to 3)

Limited budget (25%, 4 Votes)
Not enough skilled resources (19%, 3 Votes)
Lack of support/involvement from senior management (19%, 3 Votes)
Hard to build and maintain (13%, 2 Votes)
Algorithms inappropriate for our uses (13%, 2 Votes)
Accessing and preparing data (13%, 2 Votes)
Deploying the results in operational systems (0%, 0 Votes)
None - we have no barriers to using machine learning (0%, 0 Votes)

Loading ...

Waiting for PayPal...

Validating payment information...

Waiting for PayPal...