Lexile Analyzer® User Guide

This guide provides an overview for how to use all subscriptions to the Lexile Analyzer® including the Professional Lexile Analyzer® and the Lexile Analyzer® Editor Assistant™. The Lexile Analyzer® is used to evaluate the text complexity of books, articles, passages, and other texts to report its reading demand as a Lexile® text measure. The Lexile Analyzer® is a web-based tool that determines the Lexile® measure of professionally edited, complete, conventional prose text. The accuracy of the Lexile® measures produced from the Lexile Analyzer® depends on your following the text-preparation procedures and formatting conventions detailed in this guide.

In this Guide...

What Type of Texts Can Receive a Lexile Measure? top

Certain categories of text should not be measured as a Lexile measure. Because The Lexile® Framework for Reading was built upon the measurement of professionally edited, complete, conventional prose text, the Lexile Analyzer will return an inaccurate Lexile measure for other kinds of text. Follow these guidelines as you choose texts to measure:

    *Note
  • There are a few exceptions to this list. For example, narrative poetry that is conventionally punctuated may be measured with the Lexile Analyzer. Text written in rhyme does not necessarily qualify as poetry and may still be measured.
  • Texts that are appropriate for use with the Lexile Analyzer will still require text editing, as detailed in “Step 2: Prepare Your Text for the Lexile Analyzer" section.

Text Preparation Guidelines top

Please observe the following text-preparation guidelines before you submit your sample file to the Lexile Analyzer.

Here are some guidelines for removing non-prose text:

Text Preparation Considerations top

Additional considerations should be made when editing a text for measurement. Historical notes, introductions, “About the author" pieces, and previews of the next book in a series should typically be removed. Such text is often written separately from the main text and thus contains unique textual characteristics that can influence the Lexile measure. However, such decisions should be carefully considered while preparing your text for analysis. Some frontmatter and backmatter may be a legitimate part of the larger text and should be included. As a general guideline, if text appears to be written by the same author for the same audience, then it should be included in the Lexile analysis.

For Children's Books top

In the layout of children’s picture books, single sentences are sometimes distributed across multiple pages of a book. In the activity of reading, these page breaks function as sentence endings, so a pipe character (|) should be inserted at each page break in your file. The Lexile Analyzer interprets pipes as sentence-ending punctuation. In the example below from Ludwig Bemelmans’ Madeline (Puffin Books), a pipe would be placed in the plain text file after the words “crank" to emulate the effect that page breaks have on reading, and the comma would be removed.

Web Resources top

When using resources downloaded from websites, be sure to remove the non-prose and web page-specific elements, as indicated in the example below from a CNN.com article:

Only the main body of the article (G) and the complete-sentence figure caption (D) should be measured. The article title (A), date line (B), and image (C), as well as website-specific elements such as social sharing links (E) and margin advertisement text (F), should not be measured.

Also be careful to eliminate all HTML code and URLs from your sample file when measuring web resources.

Using the Lexile Analyzer

Step 1: Make Sure Words & Sentences Are Recognizable top

If you typed your text in the text editing box or if the words and sentences in your file are recognizable, proceed to step 2.

If the text you are trying to measure is in an image file or part of printed material, you can scan the pages, save them as a PDF file, and load them into an optical character recognition (OCR) program. There are several different OCR programs you can use to convert an image to text, including ones offered by Acrobat and ABBYY FineReader.

Occasionally OCR results can be inconsistent, particularly involving punctuation marks such as periods not being recognized at all (see below). These inconsistencies will impact the accuracy of the Lexile measure. Unfortunately, the repair of a poorly OCRed file can take as long as typing the text.

If a text is converted from hard copy to electronic format using an OCR application, some problems may occur in the conversion process. These tend to relate to the specific software used, and special care should be taken to ensure the accuracy of the electronic facsimile.

    Some examples of common OCR errors are as follows:
  • A letter “m" might convert as “rn."
  • A comma followed by a quotation mark (,") might be interpreted as a slash w/ an apostrophe (/’).
  • Verify that all the intended punctuation is in place—no periods missing, semicolons omitted, etc.
  • If a polysyllabic word is split between two lines with a hyphen, the hyphen should be removed and the word made whole.
  • Given the near limitless possibilities of language usage and layout, these examples should not be considered exhaustive. Rather, they should be seen as representative of the kinds of things that should be considered when preparing a text for measurement.

Step 2: Prepare Your Text for the Lexile Analyzer top

The Lexile Analyzer is designed to measure professionally-edited, complete, conventional prose text. It should not be used on non-prose, unpunctuated, or unconventional text. The Lexile Analyzer determines sentence length through recognition of sentence endings, so sentences must be conventionally punctuated to be recognized (refer to the FAQ How does the Lexile Analyzer work? for sentence ender information). Likewise, the Lexile Analyzer determines word frequency by recognizing correctly spelled, well-formed words. Otherwise, the Lexile Analyzer will not return a useful estimated Lexile measure.

    In preparing a file for measurement, your two basic objectives are to:
  1. Preserve the prose sentences and the words within them in your text.
  2. Remove non-prose content from your text before you analyze it.

You should keep in mind that the usefulness of an estimated Lexile measure depends on the proper preparation of a text for analysis. Seemingly minor errors can result in significant variation in Lexile measures.

Convert Your Text for Lexile Analysis top

The Lexile Analyzer requires a UTF-8 plain text document (*.txt file) for proper processing and Lexile measurement. A plain text file is one which uses only the basic UTF-8 character set and contains no special formatting. If you submit files of an incorrect format to the Lexile Analyzer, an incorrect Lexile measure will be returned.

Note: The Lexile Analyzer cannot measure Microsoft Word, PDF, HTML or scanned image files such as JPGs.

If the source text to be measured is in an electronic document format, such as a word processing document or a rich text document, the file usually can be converted into the plain text format using the settings in the application’s Save As… menu.

    Converting from Microsoft Word on Windows
  1. With your document open, select Save As… from the File menu.
  2. In the Save as type drop-down box, select Plain Text (*.txt).
  3. Click the Save button and a File Conversion window opens:
  4. Click the Other encoding radio button and select Unicode (UTF-8) from the list of formats to the right.
  5. Also check Allow character substitution.
  6. Click the OK button.
  7. You have saved your document in the plain text format for the Lexile Analyzer.
    Converting from Microsoft Word on Macintosh
  1. With your document open, select Save As… from the File menu.
  2. In the Save as type drop-down box, select Plain Text (*.txt).
  3. If this warning box appears, click Yes to save the file and disregard the remainder of this procedure:
  4. Otherwise, a File Conversion window opens:
  5. Click the Other encoding radio button and select Unicode (UTF-8) from the list of formats to the right.
  6. Also check Allow character substitution.
  7. Click the OK button.
  8. You have saved your document in the plain text format for the Lexile Analyzer.

It is advisable to open your plain text document to check it for conversion errors. Pay attention to special characters such as quotation marks, apostrophes, ellipses, or accented characters. Also, ensure that em-dashes and en-dashes have appropriate spacing on either side. They will be converted to hyphens, which can make the Lexile Analyzer interpret the two words on either side of an em-dash as a single hyphenated word unless there are spaces on either side of the hyphen.

Step 4: Analyze your file! top

    You have three options to submit text for analysis:
  1. Use the Inline Text Editing Box and type in the text you’d like measured
  2. Upload the .txt file you’d like measured
  3. Copy and paste text from either a Word or text file

Step 5: View & Interpret Your Results top

Select the “Analyze" button and your Lexile Analyzer results appear on the screen. The type of subscription to the Lexile Analyzer that you have will determine the data and features you have access to. All subscriptions provide access to the Lexile measure, Mean Sentence Length, Mean Log Word Frequency, and Word Count. For the descriptions of additional data and features displayed for the Professional Lexile Analyzer and Lexile Analyzer Editor Assistant, see the informational pop-up boxes on the results page.

The Professional Analyzer maintains a log history and the Lexile Analyzer Editor Assistant provides a document management system with features like category tagging and editing history. To learn more about the type of subscriptions and what features they offer, see the FAQ What type of subscriptions are available for the Lexile Analyzer?

    Lexile Analyzer Results Lexile Analyzer results are provided in four categories:
  • Lexile measure – This value indicates the reading demand of the text in terms of the semantic difficulty and syntactic complexity. The Lexile scale generally ranges from 0L for beginning reader texts to 1600L for advanced texts.
  • Word Count – This value reflects the total number of words in the text that was analyzed.
  • Mean Sentence Length – This value is the average length of a sentence in the text, based on the sentences that were analyzed.
  • Mean Log Word Frequency – This value is the logarithm of the number of times a word appears in in the MetaMetrics corpus of over 1.4 billion words. The value should be understood to mean how often a word occurs per 5 million words. The mean log word frequency is the average of all such values for words which appear in the text being analyzed.

Lexile Usage Conventions top

    When you using the Lexile Analyzer, please note that:
  • “Lexile measure" should always have a capital “L."
  • “Lexile measure" should always have a lower case “m."
  • Refer to “The Lexile® Framework for Reading" (with the registered trademark symbol) the first time that it is mentioned, and then “the Lexile Framework" henceforth.
  • Lexile measures are reported as a number followed by a capital “L" (for Lexile measure). There is no space between the number and the “L" and Lexile measures of 1000 or greater are reported without a comma (e.g., 1050L). All Lexile measures are rounded to the nearest 10L to avoid over-interpretation of the measures.
  • We refer to a Lexile “zone" or “level" as representing the bands on the Lexile® map (e.g., the “700L Zone" goes from 700L to 790L).
  • We refer to a Lexile range as the suggested range of Lexile measures that a reader should read. The Lexile range for a reader is 50L above to 100L below his or her reported Lexile measure. This takes into account measurement error found in the tests administered to students and in the automated measurement of the books. If a student attempts material above his or her Lexile range, the level of challenge may be too great for the student to be able to independently construct very much meaning from the text. Likewise, material below the reader’s Lexile range will provide that student with little comprehension challenge. Material above or below a reader’s Lexile range can be used for specific instructional purposes.
  • Beginning Reader Code: Beginning Reader (BR) is a code given to both texts and students that are below 0L. A Lexile measure of BR100L indicate that the Lexile measure is 100 units below 0L. Just like -10° is higher (warmer) than -30° on a thermometer, a BR100L text is more complex than a BR300L text.
    • If you are an educator or researcher and want to learn more about Lexile measures and beginning readers, visit: Beginning Readers.
    • If you are a company or organization that has partnered with us and would like help communicating the advantages of using Lexile measure in the early grades, check out our Beginning Reader Communications Toolkit.
  • If you are a company or organization and have an agreement with us to use Lexile measures in your product or service, access our branding guidelines at www.metametricsinc.com/branding-guidelines

Frequently Asked Questions

How do I find the Lexile measure of a book that has been measured? top

    There are two ways to find Lexile measure of a book:
  1. You can search for a specific title using the Quick Book Search on Lexile.com. You can also build book lists based on Lexile measure and interests using the “Find a Book" tool on Lexile.com.
  2. If you are looking for Lexile measures for a larger collection of titles for your library the Lexile Titles Database™ is a good resource. The Lexile Titles Database contains certified Lexile measures for over 260,000 English fiction and nonfiction trade book titles from hundreds of publishers. To learn how to license the Lexile Titles Database to provide Lexile measures for titles in your product or service, visit Licensing the Lexile Titles Databse.

How does a book receive a Lexile measure? top

MetaMetrics (developers of the Lexile Framework) measures a book at a publisher’s request. Books are always measured in their entirety. Publishers pay for this service, as well as the right to use the Lexile measure in their marketing materials.

In order to ensure the most accurate Lexile measure, MetaMetrics’ text measurement process includes the following steps, with quality checks at each stage:

How do books receive Lexile certified measures? top

Several publishers and content developers use the Professional Lexile Analyzer and Lexile Analyzer Editor Assistant to help write text at certain reading levels and measure their texts developmental level before submitting for a certified Lexile measure. Upon submission to MetaMetrics, these files have been prepared using our text-preparation guidelines. These files are reviewed by our resource measurement coordinators to assure that the editing guidelines have been met. They are then submitted for Lexile code review and Lexile measures are returned to the publisher. It is only after review by MetaMetrics’ resource measurement team that these measures are deemed “certified" and then available for distribution via marketing materials, websites, and searchable in “Find a Book."

How does the Lexile Analyzer work? top

The Lexile Analyzer works in steps: first by calculating a number of text complexity variables for a given text such as the length of sentences. Text complexity variables are then combined to calculate the Lexile measure of the text. The Lexile measure is an indication of the degree of challenge a particular text will pose for a particular reader. Lexile measures are units of measure just like degrees Fahrenheit.

Like the measurement of temperature, the measurement of text complexity requires different measurement instruments at different locations on the scale. For example, you would use a different instrument to measure a person’s body temperature than you would to measure the temperature inside your oven, but each instrument still provides measures on the same scale.

Similarly, for texts used in the early grades (generally texts intended for grades K thru 2), different text complexity variables are required to provide the most accurate measurement possible. For texts intended for readers above grade 2, a Lexile measure has a semantic and syntactic component represented by MLF (Mean Log Frequency) and LMSL (Log Mean Sentence Length) respectively. Research has shown that word frequency can be used as a proxy for vocabulary difficulty, and sentence length can be used as a proxy for sentence complexity. Word frequency is calculated using the MetaMetrics corpus of over 1.4 billion words intended for K–12 readers. For texts intended for readers in grades 2 and below, nine variables are used to measure four aspects of text important for early-reading: structure, syntax, semantics, and decoding. Structure refers to the systematic patterning and repetition often found in early-grades texts, but not typically found in texts for older readers. Decoding refers to the complexity of the orthographic patterns in the text and is related to the important task of learning to decode printed text into spoken or silently read words.

When a text is analyzed, all of its text complexity variables are calculated and a determination is made automatically as to whether the early-reading text complexity variables are required for analysis. For texts 650L and below, four additional early-reading indicators, relevant for early-grades texts, are reported: structure, syntax, semantics, and decoding.

Consequently, when using the Lexile Analyzer to measure text, you should keep in mind two keys to getting an accurate Lexile measure:

  • All sentences must be automatically recognizable (capital letter at the beginning; end-punctuation at the end; no unconventional spacing or punctuation)
    • End-punctuation recognized by the Lexile Analyzer includes the following: period (.), question mark (?), exclamation point (!), colon (:), semi-colon (;), and ellipses (...)
  • All words must be automatically recognizable (correct spellings, spacing, and punctuation)

What type of editorial errors affect the Lexile Measure of a text? top

Proper file preparation, as detailed in the earlier section “Step 2: Prepare your text for the Lexile Analyzer" is the crucial step for ensuring Lexile measurement accuracy. File preparation errors or oversights, such as missing or incorrect punctuation or sections of unconventional prose or non-prose, may compromise your Lexile Analyzer results and return an estimated Lexile measure too far from the actual Lexile measure to be of use to you.

The measurement impact of editing errors and oversights is more severe the shorter the length of the input file. For this reason, special attention is encouraged when preparing a short passage, article, or children’s text for analysis.

What type of subscriptions are available for the Lexile Analyzer? top

There are two Lexile Analyzer products offered on a paid subscription basis. These products are intended for commercial education companies and content developers. The products include:

  • The Professional Lexile Analyzer®: With the Professional Lexile Analyzer, there are no word limit restrictions for measuring text. The Professional Lexile Analyzer offers early-reading indicator descriptors (for texts with Lexile measures that are 650L and below), as well as a Lexile history log of each measured text.
  • The Lexile Analyzer® Editor Assistant™ is also available on a paid subscription basis. This product is ideal for content developers or companies trying to produce content at specific reading levels. In addition to the features of the Professional Analyzer, the Lexile Analyzer Editor Assistant also offers:
    • A document management system which facilitates file sharing and category tagging between users
    • Editing history
    • Search capabilities
    • Key information on text features that could present more or less of a challenge in early-reading content.

If you are interested in using the Lexile Analyzer for commercial purposes, visit Licensing the Professional Lexile Analyzer.

How do I know which subscription of the Lexile Analyzer I have access to? top

Once you are logged into your account, click on My Account on the top menu bar.

The next screen will show you what the products to which you have access.

You can also see your subscription type on the upper left hand corner once you are logged into your account. For example, if you have access to the Lexile Analyzer Editor Assistant, you will see this: