Overview

Keenious helps students and researchers discover and understand academic literature. This page describes how its search works: how a query is matched against a curated index of ~188 million publications spanning all disciplines and more than 100 languages, and what determines the order of results. The same search engine runs whether you type a query in Search or the AI searches on your behalf in Chat.

How It Differs from a Traditional Search

A Keenious search is made for overview as much as for individual results. Instead of one long ranked list to sift through line by line, it returns a set of the best-matching publications and analyses that set as a whole. The results come grouped into research areas: the strongest matches and a picture of the directions the literature takes, at the same time. Digging deeper happens by opening the research area that interests you, not by paging further down a list.

The matching is semantic: a query is understood by its meaning, whatever its phrasing. Exact requirements can be added on top with Boolean-style operators — quoted phrases, AND, OR, and NOT — and filters. The same search works as a loose description of a topic or as a tightly constrained query, depending on the task.

Search matches against titles and abstracts, not full text. A publication can discuss a topic in its body without it being mentioned in the title or abstract; such a publication will not match a query on that topic.

Matching

A query is matched in two ways at once. By meaning: every title and abstract in the index is stored as an embedding — a representation of its meaning — and the publications closest in meaning to the query are found, whatever words they use. teenage mental health and social media surfaces publications about "adolescent well-being and problematic smartphone use". By keyword (BM25, short for "Best Matching 25" — a standard keyword-ranking method): the same query's exact words count too, weighted toward distinctive terms — gene names, acronyms, place names — that meaning alone can treat loosely.

Publications as points in a space of meaning: the query sits among its closest publications, which are retrieved regardless of their wording, while unrelated publications sit far away

A publication that scores well on both meaning and keywords ranks highest, with meaning carrying more weight (the merge is a rank fusion). Quoted phrases, operators, and filters sit on top of all of this as hard requirements — see Search Syntax.

Ranking

Relevance is how well a publication matches your query — its combined score from the meaning and keyword matching above. Results are ordered by relevance first. Scholarly signals then adjust the order — each a modest adjustment that reorders similarly relevant publications rather than overriding relevance. The signals apply independently and are listed in no particular order:

Citations — more-cited publications rank higher. Citation counts are computed within the curated index, so they can be lower than in databases that count against a broader corpus.
Recency — recent publications receive a small boost that fades with age.
Field-weighted citation impact (FWCI) — citation performance normalized by field, year, and document type, so publications from low-citation and high-citation fields are compared on the same scale.
Peer-review status — venues listed in the Norwegian Scientific Index receive a boost, Level 2 channels more than Level 1. Venues not listed receive no boost, but no penalty either.
Venue and publication type — journal and conference publications rank slightly above works without an identifiable venue; review and research articles rank slightly above other document types.
Language match — publications written in the query's language are boosted. English-language work dominates global citation counts, so without this signal, queries in other languages would return mostly English results.

The Result Set

A traditional keyword database can report a total — "1,247 results" — because a record either matches or it doesn't. Semantic matching has no such boundary: relevance never objectively ends, and any automatic cutoff would be arbitrary. A search therefore returns a fixed-size result set — the 300 best-ranked publications by default, adjustable from 100 up to 10,000. The number of results is a setting, not a measurement of how much literature exists. (One exception: when quotes or filters leave fewer matches than the set size, the smaller number returned is a real count.)

Everything that follows operates on this set. Research areas are computed from it, and sorting reorders it — sort by citations and you get the most cited of those 300, not of the index. The fixed set is also what makes sorting meaningful: without a relevance boundary, the most-cited publications only loosely related to the query would dominate. The size trades focus for coverage — a smaller set stays on the core of a narrow topic, a larger one gives a broader overview and more research areas.

For tasks that depend on exhaustive, documented retrieval — a systematic review, for example — this matters: a search returns at most 10,000 publications and makes no claim of completeness. Keenious is a starting point for that kind of work, not the search of record.

Every search is saved with a permanent link, and opening that link always shows the same results. Typing the same query again later is a new search — and a new search can return different results, as the index is updated.

Sorting

Results are ordered by relevance by default — the ranking described above. The Sort control reorders the set:

Relevance — best semantic and keyword matches first (the default).
Most cited — highest citation count first.
Newest — most recently published first.

Sorting reorders the publications already in the result set; it does not bring in different ones (see The Result Set).

Research Areas

The results of a search come grouped into research areas — clusters of related publications within the result set. Publications whose meanings sit close together (by the same embeddings used for matching) form an area, and each publication belongs to exactly one. The number of areas scales with the size of the result set, up to 15.

Result publications shown as points, colored into three labeled clusters — each cluster of nearby publications is a research area

Each area is named — by a language model — for what sets it apart from the rest of the results, not for the query: a search about social media and teenage mental health gets "Cyberbullying and Online Harassment" and "Screen-Time Interventions", not "Social Media Studies". The names are not a fixed taxonomy, and the same publication can sit under a differently named area in another search.

Selecting research areas filters the result list; it does not change the ranking within them. The areas are computed from the result set itself, so the same search always produces the same areas, and changing the search — the query, the filters, or the result size — recomputes them.

Search Syntax

Example	Effect
`gene editing therapy`	Matched by meaning and keywords (default)
`"CRISPR-Cas9"`	Must appear as an exact phrase in the title or abstract
`arctic ecosystem AND climate change`	Each side of `AND` must appear in the title or abstract, words in any order
`"CRISPR" OR "TALEN"`, `mouse OR mice`	At least one must appear
`NOT mice`, `-mice`, or `-"in vitro"`	Must not appear in the title or abstract
`2021` (a bare four-digit year)	Publications from that year are boosted; others still appear

Operators are recognized in capitals only: lowercase and, or, and not are ordinary words of the query.

Quoted phrases. A quoted phrase is a hard requirement: every result must contain it in its title or abstract. A multi-word phrase must appear as consecutive words in the given order — "gene therapy" matches "a gene therapy trial" but not a publication where gene and therapy only appear in separate sentences. Matching ignores capitalization, accents ("Zurich" matches "Zürich"), and punctuation ("CRISPR-Cas9" also matches "CRISPR Cas9"). It does not ignore word forms: quoted matching has no stemming, so "vaccine" does not match a publication that only writes "vaccines", and "mouse" does not match "mice". Unquoted text is unaffected by this — keyword matching on unquoted words handles word forms, and semantic matching is independent of wording altogether.

A quoted phrase also remains part of the query for semantic and keyword matching; the quotes add the requirement on top rather than replacing the term's role in matching.

AND. AND splits the query into parts that must each appear in the title or abstract: arctic ecosystem AND climate change only returns publications whose title or abstract contains all four words. Unlike a quoted phrase, the words of each part can appear anywhere, in any order. The matched words follow the same rules as quoting — capitalization, accents, and punctuation are ignored; word forms are not. The operator itself takes no part in matching: apart from the requirement it adds, arctic ecosystem AND climate change and arctic ecosystem climate change are the same query.

OR groups. OR operates between the terms next to it, quoted or not: in "CRISPR" OR "TALEN" "off-target", at least one of CRISPR or TALEN must appear, and off-target must appear; mouse OR mice requires at least one of the two words. Longer chains work the same way (mouse OR mice OR murine). Combined with AND, an OR group counts as one part: arctic AND warming OR heating requires arctic, and at least one of warming or heating.

Exclusions. NOT term, -term, and -"phrase" remove every publication whose title or abstract contains the term — NOT is an uppercase alias for the minus form (lowercase not is an ordinary word). They follow the same matching rules as quoting, including exact word forms, so -mouse does not remove publications that only write "mice". Excluded terms are stripped from the query before matching: they only remove results, they do not influence what the rest of the query matches.

Years. A bare four-digit year (1900 up to next year) boosts publications from that year; those from other years still appear, without the boost. Several years can be given (2023 2024), and the year also remains part of the ordinary query text. A hard cutoff is a job for the year filter, not the query.

Operators combine freely: gene editing "CRISPR-Cas9" -mice matches the concept gene editing semantically, requires CRISPR-Cas9 in the title or abstract, and removes publications mentioning mice. All quoted, required, and excluded terms are matched against titles and abstracts only — a quoted term excludes any publication that does not use it in those two fields, even if the term appears in the publication's full text.

Filters (publication year, document type, peer-review status, open access, and others) are hard constraints rather than ranking signals: a filtered-out publication is removed before ranking and will not appear regardless of how well it matches.

Example Searches

Two ways to search, mixed freely in one query: describe a topic in plain language, or require exact terms with operators.

Describe a topic. Plain language, matched by meaning — the wording need not match the publications'.

Search	What it does
`how does sleep quality affect memory`	A question in plain words; finds publications on sleep and memory consolidation however they phrase it
`microplastic pollution in deep sea ecosystems`	A short topic description
`why bee populations are declining`	Surfaces publications on pollinator decline and colony collapse, though the query names neither

Require exact terms. Operators add hard requirements on top of the meaning match.

Search	What it does
`"CRISPR-Cas9" gene therapy`	Gene-therapy context by meaning, but CRISPR-Cas9 must appear exactly
`physical activity AND diabetes AND "older adults"`	All three required; the last as an exact phrase
`"CRISPR" OR "TALEN" OR "zinc finger"`	At least one of the three must appear
`jaguar speed -car`	The animal, not the vehicle
`coral reefs NOT aquarium`	Coral reefs, excluding publications about aquariums (`NOT` is the same as `-aquarium`)
`arctic AND warming OR heating`	arctic required, plus at least one of warming or heating

Meaning or exact words. The two retrieve differently. the effects of social media on teenage mental health is understood by meaning, surfacing publications that write "adolescent well-being" or "screen time and depression" without the query's words. "social media" AND "mental health" returns only publications whose title or abstract contains both phrases — more precise, but it misses those paraphrases. Matching is over titles and abstracts only, so exact requirements narrow the results faster here than in a full-text database; add them one at a time when meaning alone returns too much.

Why a Publication Does or Doesn't Appear

A result looks off-topic. Semantic matching retrieves by closeness of meaning, and the matched concept is normally visible in the result's title or abstract. Quoting a term that must appear, or excluding one that shouldn't, narrows the results.

An expected publication is missing. Common causes:

The wording only appears in the full text. Titles and abstracts are the searchable fields; a topic that is not visible there does not match.
A quoted or required term is not in the title or abstract. Quoted phrases and AND parts are hard requirements. Without them, the publication can still match semantically.
A filter excludes it. A year range or peer-review filter removes everything outside it.
It is not in the index. Book chapters, master's theses, meeting abstracts, retracted works, and several source types are excluded from the OpenAlex dataset.
It is outside the result set. A search returns a fixed number of best-ranked publications (see The Result Set). A publication can match without making the cut — a more specific query or a larger result size brings it in.
It is very new. The index synchronizes with OpenAlex regularly; a publication from the last few days may not be indexed yet.

Frequently Asked Questions

Does Keenious search the full text of publications? No — titles and abstracts only. A publication whose topic is visible only in its body will not match a query on that topic.

Does it matter whether I write and or AND? Yes. In capitals, AND, OR, and NOT are operators (require both sides, allow either, exclude); in lowercase they are ordinary words of the query. See Search Syntax.

Can the AI make up a publication? No. Every result is a publication from the index. AI is used to match queries by meaning and to name research areas — never to generate results.

Why did I get fewer results than the size I chose? Quoted terms or filters left fewer publications than the set size — in that case the number shown is a real count of matches. See The Result Set.

If I run the same search next month, will I get the same results?

Redoing the same search within a short period will give the same results, but the Research Area labels might differ. Executing the same search some weeks or months later can differ in results as the index is updated with new and corrected records.

How Search Works in Keenious