CONTENT ANALYSIS AND TEXT-MINING TOOL FOR STATA
Stata is a complete, integrated statistical software package created by StataCorp LP (www.stata.com). It provides a wide range of statistical analysis, data management, and graphics. Released in June 2013, version 13 added many new features, including a long string data type allowing one to store along with numerical and categorical data, documents up to 2 billion characters. One could thus create a statistical database with journal abstracts, news transcripts, patents, incident reports, customer feedbacks, interviews and so on.Wordstat-box
WordStat for Stata was created to allow Stata 13 and Stata 15 users running under Windows, to apply text analytics techniques on any string variables stored in a Stata data file. WordStat combines natural language processing, content analysis and statistical techniques to quickly extract topics, patterns and relationships in large amount of text. It can process millions of words in seconds and compare extracted themes across any other numerical, categorical or date variables in the Stata file.
WHAT IT IS USED FOR?
WordStat can be used by anyone who needs to quickly extract and analyze information stored in Stata text variables. It may be used for:
Business intelligence and competitive web sites analysis
Information extraction and knowledge discovery from incident reports, customer complaints
Content analysis of news coverage or scientific literature (scientometrics or bibliometrics studies)
Automatic tagging and classification of documents
Fraud detection, authorship attribution, patent analysis
Taxonomy development and validation
Etc. (for some examples of studies using WordStat, see the Studies page).
EXPLORATORY TEXT MINING
Integrated exploratory text mining and visualization tools such as clustering, multidimensional scaling, proximity plots, and more, to quickly extract themes and automatically identify patterns.
Use existing or create custom dictionaries composed of words, word patterns, phrases and proximity rules. Get computer assistance for building taxonomies with phrase and named-entity extraction, misspelling replacements, integrated thesaurus, etc..
Explore relationships among words or extracted concepts using force-based graphs, multidimensional scaling or circular graphs. Retrieve text segments associated with specific connections.
Illustrate patterns and explore complex phenomena with interactive visualization tools such as bar charts, line charts, heatmaps, word clouds, bubble charts, MDS plots, etc.. Copy and paste charts or saved them to disk in bmp, jpg, or png file formats.
Get a quick overview of the most salient topics from large text collections. A side panel allows one to compare the frequency of specific topics across other variables using bar charts or line charts.
Explore relationships between unstructured text and structured data with statistical and graphical tools (correspondence analysis, heatmaps, bubble charts, etc.).
Develop automatic document classification models by using Naive Bayes and K-Nearest Neighbors. Classification models may then be saved on disk and reapplied on new data.
DOCUMENT CONVERSION WIZARD
The Document conversion wizard allows one to easily import into a new Stata .dta file, documents stored in various file formats (.DOC, HTML, PDF, TXT) and automatically extract numeric and alphanumeric values from structured documents.
A more detailed description of WordStat for Stata features, can be found in the sezione principali caratteristiche di WordStat, click here
Throughout the term of this Agreement and subject to its terms and conditions, Provalis Research Corp. (hereinafter “Provalis”) authorises CLIENT to use the Provalis proprietary computer program(s) identified in the relevant invoice, (hereinafter the “Software”) in consideration for the price mentioned in the relevant invoice.
The CLIENT will use the Software for the purposes described in Section 3.1 and will comply with the conditions described herein.
Subject to the terms and conditions expressed herein, Provalis hereby grants CLIENT a non-exclusive licence to use the Software, for the duration specified in the relevant invoice, in machine readable form, on up to two computers, provided that there is no chance it will be used simultaneously on more than one computer.
3.2 Software Ownership
CLIENT acknowledges that no right, title or interest, other than the right to use the Software in the manner described herein, is transferred by the present Agreement. CLIENT agrees that Provalis is and shall remain owner of all copyright, trade secret, patent, trademark and other proprietary rights in and to the Software. The technical procedures, processes, systems, methods of operation, and concepts which are embodied within the Software are trade secret information of Provalis and/or its licensor(s). This licence is not a sale of a copy of the Software and does not render CLIENT the owner of a copy of the Software.
CLIENT may not make copies of the Software, except those copies which are needed for backup purposes. CLIENT will advise its employees who are granted access to the Software of the restrictions upon duplication, reverse engineering, disclosure and use contained in this Agreement. CLIENT will be liable for any unauthorised copying, reverse engineering, disclosure and/or use by its employees or agents.
The CLIENT shall not, without the express prior written consent of Provalis:
a) use the Software under any name other than that of CLIENT;
b) lease, rent, sell, pledge, convey, assign, sublicense, loan, distribute or otherwise transfer to any third party the Software, in whole or in part, or any copy thereof, or any derived works or adaptations based thereupon;
c) subject to what is allowed in Section 3.1, use the Software licensed under an Academic license in any manner to provide computer services to third parties;
d) transfer to any third party any of CLIENT’s rights under this Agreement or otherwise encumber or offer the Software as security; or
e) modify, translate, convert to another programming language, or prepare derived works or adaptations from the Software.
CLIENT shall not modify, dismantle, or decompile or allow anyone to modify, dismantle, or decompile the Software or any portion thereof in any possible way. CLIENT further agrees that it shall not use the Software to assist in the development or design of a computer program that is intended to provide substantially similar functionality as the Software. The Software shall not be used to develop, nor shall CLIENT market, any conversion utility or aid specific to the Software enabling or facilitating users to convert from the Software or a database created by using the software to an alternative software or alternative database not marketed by Provalis.
4. OBLIGATIONS OF CUSTOMER
CLIENT shall be responsible for ensuring the security of the Software, as to physical and electronic intrusions. In particular, in order to protect the trade secrets included in the Software, CLIENT shall protect the areas of the CLIENT’s premises and computing infrastructure where the Software is installed or stored, with adequate security measures so as to prevent unauthorised persons from gaining access and restrict access to those persons having a need to access the Software.
4.1.2 Damage to Software
CLIENT shall be responsible for any damage to the Software.
CLIENT acknowledges that Provalis shall have no control over the conditions under which CLIENT and its employees use the Software and CLIENT therefore agrees to assume the responsibility of the results obtained from the Software provided to CLIENT by Provalis.
Any notice, request, instruction, legal proceedings or other instrument to be given, served, or provided under this Agreement by either party shall be deemed given and received when in writing and delivered personally or five (5) days after being sent by certified or registered mail (postage prepaid) properly addressed to the intended recipient of Provalis or CLIENT, at the address of each as indicated in the header of this Agreement, provided that either party may change such address, only by written notice to the other party.
6. EXPRESS RIGHTS
6.1 Intellectual Property
Provalis warrants that it has the right to grant this licence to use the Software as contemplated hereunder. Subject to the limitations of Section 10, Provalis will indemnify CLIENT and hold CLIENT harmless against any damages awarded in a final judgement against CLIENT, arising out of any claim, by a third party, that the Software infringes or violates any copyright, trademark or trade secret rights of such third party, in Canada or the United States, provided that:
a) CLIENT notifies Provalis promptly in writing of any notice of any such claim;
b) CLIENT cooperates with Provalis in all reasonable respects in connection with the investigation and defence of any such claim;
c) CLIENT has not prejudiced Provalis position’s in any way, by, without limiting the generality of the foregoing, its admissions and/or declarations; and
d) Provalis has and retains at all time sole control of the defence of any action on any such claim and all negotiations for its settlement or compromise.
Should the Software become, or in Provalis’ opinion be likely to become, the subject of a claim of copyright, patent or 2 trademark infringement or trade secret misappropriation, in Canada or the United States, CLIENT will permit Provalis, at Provalis’ option and expense, either to:
i) procure for CLIENT the right to continue using the Software;
ii) replace or modify the same so that it becomes non-infringing; or
iii) terminate this licence, return to the CLIENT the price of the license minus linear (straight line) depreciation calculated monthly over:
– three (3) years from the date of the purchase of the Software, in the case of perpetual licenses,
– twelve (12) months from the date of the purchase of the Software, in the case of annual licenses,
and accept the return of the Software.
Notwithstanding anything herein to the contrary, however, Provalis shall have no obligation or liability to CLIENT under any provision of this Sub-Section if any copyright, patent or trademark infringement or trade secret misappropriation claim is based upon:
a) the use of other than a current unaltered release of Software if the infringement would have been avoided by the use of a current unaltered release of Software;
b) the combination, operation or use of any Software with any programs, software or equipment not provided by Provalis, if the infringement would have been avoided by the combination, operation or use of Software with other programs, software or equipment;
c) the use of the Software in other than the operating environment specified for it by Provalis if the infringement would have been avoided by use in the operating environment specified by Provalis;
d) the use of or access to the Software in a manner other than that for which it was furnished by Provalis; or
e) the use of any Software which has been modified by or for CLIENT in such a way as to cause it to become infringing.
CLIENT expressly agrees to allow Provalis to disclose the existence of the Agreement and the fact that CLIENT is a client of Provalis.
7.1 Confidential Information
CLIENT acknowledges that the Software as well as any technical information, design concepts, methods, processes, procedures, formulae, or algorithms contained therein constitute trade secrets, are given access to by Provalis to CLIENT in confidence, and contain proprietary and confidential information.
Each party shall use the same care to prevent disclosure of the confidential information of the other party to third parties as it employs to avoid disclosure, publication, or dissemination of its own information of a similar nature, but in no event less than a reasonable standard of care.
CLIENT further agrees that it will not, under any circumstances except as expressly authorised hereby, distribute or disseminate any information contained in or disclosed by the Software, including but not limited to any technical information, design concepts, methods, processes, procedures, formulae, or algorithms, to any person except to those of its employees and/or other users with a need-to-know and whose access is necessary for CLIENT’s use as set forth above. This does not prevent the user from communicating to third parties the results of processing data with the Software.
CLIENT will take appropriate action with such employees and/or other users, to inform them of the trade secret, proprietary and confidential nature of the Software, to obtain their compliance with the terms of this section and to have them accept in writing the confidentiality obligations contained herein or similar confidentiality obligations, of similar stringency. In the event that CLIENT, or any of its present or former employees and/or other users breaches the confidentiality obligations of this Agreement, CLIENT shall be jointly and severally liable to Provalis for any loss incurred by Provalis resulting directly or indirectly from such breach.
The Parties acknowledge and agree that the Software, as well as all other information in whatever medium disclosed to CLIENT hereunder are unique and commercially valuable and that any breach by a party of the terms of this Agreement with respect to protection against disclosure or distribution of such information would result in an irreparable and continuing injury to the other party, for which money damages would be inadequate. In the event of such a breach or anticipated breach by one party, the other party shall be entitled to immediate injunctive relief and to specifically enforce the terms of this Agreement, in addition to any other remedy to which such other party may be entitled at law or in equity.
8. LIMITED WARRANTY
During a period of ninety (90) days from the execution of this license, Provalis will make reasonable efforts to correct any material failure of the Software to so conform or perform substantially in accordance with its documentation, provided that: a) CLIENT promptly gives Provalis detailed written notice, in the form required by Provalis, of any such claimed failure to so conform or perform; b) Provalis is able, using reasonable means, to reproduce and identify the error; c) such failure to so conform or perform is not, in Provalis’ reasonable opinion, a result of any modification of or damage to the Software or its operating environment or of CLIENT’s failure to operate or access to the Software through the proper hardware and software environment; d) such failure to so conform or perform is not, in Provalis’ reasonable opinion, a result of the interaction of the Software with other software or hardware not provided, installed or configured by Provalis; and e) such failure to so conform or perform is not, in Provalis’ reasonable opinion, a result of the Software having been subjected to negligence, unauthorised access or use, or computer or electrical malfunctions.
In the event that Provalis shall fail or be unable for any reason to correct any failure of the Software to substantially conform to or perform substantially in accordance with its documentation, CLIENT may terminate this licence and cease using the Software and benefit from its use, and shall be reimbursed for the price of the license minus linear (straight line) depreciation calculated monthly over:
– three (3) years from the date of the purchase of the Software, in the case of perpetual licenses,
– six (6) months from the date of the purchase of the Software, in the case of annual licenses,
Such remedy shall be the CLIENT’s sole remedy.
9. DISCLAIMER OF WARRANTIES
EXCEPT AS EXPRESSLY PROVIDED IN SECTION 8 ABOVE, ANY WARRANTY, EXPRESS OR IMPLIED, OFFERED BY THE LAW OR USAGE OF TRADE, INCLUDING WITHOUT LIMITING THE FOREGOING, ANY WARRANTIES OF QUALITY, ABSENCE OF LATENT DEFECTS, ACCURACY, PERFORMANCE, SUITABILITY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE IS HEREBY EXCLUDED. EXCEPT AS EXPRESSLY PROVIDED IN SECTION 8 ABOVE, THE ENTIRE RISK IN CONNECTION WITH THE QUALITY AND THE OPERATION OF THE SOFTWARE IS ASSUMED BY THE CUSTOMER. IN ADDITION, EXCEPT AS EXPRESSLY PROVIDED IN SECTION 8 ABOVE, PROVALIS DOES NOT WARRANT AND ASSUMES NO RESPONSIBILITY FOR THE TIMELINESS, LEGALITY, ACCURACY, RELIABILITY, COMPLETENESS OR UTILITY OF A DECLARATION, OF A NOTICE, OF A SERVICE OR ANY OTHER INFORMATION BASED ON THE RESULTS OF THE USE OF THE SOFTWARE.
10. LIMITATION OF LIABILITY
IN NO EVENT WILL PROVALIS BE LIABLE TO CLIENT OR ANY OTHER PERSON FOR ANY LOST PROFITS, LOST SAVINGS, LOST DATA, OR OTHER INDIRECT,SPECIAL, CONSEQUENTIAL OR INCIDENTAL DAMAGES ARISING OUT OF OR RELATING TO THIS AGREEMENT OR ANY INFORMATION, PRODUCT OR SERVICE FURNISHED OR TO BE FURNISHED BY PROVALIS UNDER THIS AGREEMENT OR THE USE THEREOF, EVEN IF PROVALIS HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH LOSS OR DAMAGE. IT IS SPECIFICALLY UNDERSTOOD BETWEEN THE PARTIES THAT PROVALIS SHALL HAVE NO LIABILITY WHATSOEVER CONCERNING DAMAGES ARISING OUT OF OR RELATING TO THE USE OF THE SOFTWARE. IT IS FURTHER UNDERSTOOD THAT PROVALIS SHALL ALSO HAVE NO LIABILITY WHATSOEVER CONCERNING DAMAGES ARISING OUT OF OR RELATING TO UNAUTHORISED ACCESS, ALTERATION OR DESTRUCTION, BY ANY THIRD PARTY, OF A DATABASE, THE CLIENT’S COMPUTER SYSTEM OR THE CLIENT’S FILES OR DOCUMENTS.
THE AGGREGATE LIABILITY OF PROVALIS UPON ANY CLAIMS HOWSOEVER ARISING OUT OF OR RELATING TO THIS AGREEMENT OR ANY INFORMATION, DATA, PRODUCTS OR SERVICES PROCESSED, FURNISHED OR TO BE FURNISHED BY PROVALIS UNDER THIS AGREEMENT WILL IN ANY EVENT BE ABSOLUTELY LIMITED TO THE AMOUNT PAID TO PROVALIS BY CLIENT UNDER THIS AGREEMENT FOR THIS LICENCE TO USE THE SOFTWARE.
11. FEES AND PAYMENTS
CLIENT shall pay to Provalis the price set in the relevant invoice for the use of the Software, within thirty (30) days after the reception of an invoice from Provalis. Late payment may carry interest at a rate of 1% per month and 12% per year, at Provalis’ discretion. Provalis shall have and retain the right to disable CLIENT’s license in case CLIENT has not paid such amount after ninety (90) days without a valid reason.
12. TERM AND TERMINATION
This Agreement shall be effective commencing on the Effective Date and represents a licence to use the Software installed by Provalis, subject to the conditions described herein, for the duration specified in the relevant invoice.
12.2 Termination for Breach
Either party may terminate this Agreement upon written notice to the other party if the other party materially breaches the terms of this Agreement and such default continues uncorrected for a period of ten (10) days after notice in writing thereof to such other party. The fact that one of the parties is adjudged bankrupt; makes a proposal for the benefit of its creditors; has a receiver appointed; files a petition of bankruptcy; initiates reorganisation proceedings; causes or permits to occur any similar event under the laws of its domicile; or ceases to conduct its operations in normal course of business; or is wound up will be interpreted as a default under the terms of the present Agreement.
12.3 Effect of Termination
Upon termination of this Agreement, CLIENT will, within fifteen (15) days: a) return all whole or partial copies of the Software to Provalis; b) destroy any copy of the Software, any relevant documentation or results and certify to Provalis, in writing, that all copies not returned to Provalis have been destroyed. Upon termination, all licences granted pursuant to this Agreement and any maintenance and support agreement executed by both parties will also cease. CLIENT shall pay immediately any and all sums then owing to Provalis hereunder together with interests, if any, accrued on said sums.
CLIENT undertakes, during the term of this agreement and for a period of twelve (12) months after the termination of this Agreement, not to employ or attempt to employ an employee of Provalis. Should CLIENT be in default with respect to the aforementioned obligation, and should CLIENT not cure this default within three (3) days of the reception of a written notice by Provalis, CLIENT shall pay to Provalis, as anticipated damages and not as a penalty, a sum equivalent to the employee’s total remuneration, including benefits, during the six (6) months preceding its employment by CLIENT.
Sub-Sections 3.2, 3.3, 3.4, 6.1 and Sections 7, 9, 10, 12 and 13 will survive the termination of this Agreement.
13. FINAL PROVISIONS
This Agreement sets forth the entire agreement and understanding between Provalis and CLIENT regarding the subject matter hereof and supersedes any prior representations, advertisements, statements, proposals, negotiations, discussions, understandings, or agreements regarding the same subject matter.
The failure by either party at any time to enforce any of the provisions of this Agreement or any right or remedy available hereunder or at law or in equity, or to exercise any option herein provided, shall not constitute a waiver of such provision, right, remedy or option or in any way affect the validity of this Agreement. The waiver of any default by either party shall not be deemed a continuing waiver, but shall apply solely to the instance to which such waiver is directed.
13.3 No Partnership or Joint Venture
It is expressly understood that this Agreement does not constitute and shall not be construed as constituting a partnership or joint venture between Provalis and CLIENT, and that both parties shall remain independent of one another. Neither party shall have the necessary legal authority to bind the other in its absence with regards to any agreement or contract.
This Agreement is written in the English language, at the request of the parties. / Cette entente est rédigée en langue anglaise, à la demande des parties.
Every provision of this Agreement shall be construed, to the extent possible, so as to be valid and enforceable. If any provision of this Agreement so construed is held by a court of competent jurisdiction to be invalid, illegal or otherwise unenforceable, such provision shall be deemed severed from this Agreement, and all other provisions shall remain in full force and effect.
13.6 Choice of Law and Forum
This Agreement shall in all respects be governed by and interpreted, construed and enforced in accordance with the laws applicable within the Province of Quebec, Canada. Any action between Provalis and CLIENT will be venued in a competent court situated within the Judicial District of Montréal, in the Province of Quebec, Canada, and CLIENT irrevocably submits itself to the personal jurisdiction of such courts for such purpose.
13.7 Force Majeure
Each party shall not be in default or otherwise liable for any delay in or failure of its performance as per this Agreement should the cause of such default be beyond its control or due to a case of force majeure. Are deemed to be force majeure any unpredictable and irresistible event and any foreign cause which presents the same characteristics, notably, without limiting the foregoing, the causes beyond Provalis’ control, fire, electrical failures, failures of a telecommunication network or part thereof, floods and other acts of God, accidents, labour disputes, extra-ordinary supplying difficulties, civil unrest, orders, regulations, laws and other governmental interventions, riots, civil or military interventions and act of war (declared or not).
13.8 Assignment and Binding Effect
Provalis may assign, delegate and/or otherwise transfer this Agreement or its rights and obligations hereunder to any person or entity. CLIENT may not assign, delegate or otherwise transfer this Agreement or any of its rights or obligations hereunder without the prior written consent of Provalis. This Agreement shall be binding upon and inure to the benefit of the parties and their respective successors and permitted assigns.
Copyright © 2004-2018 Provalis Research Corp. All rights reserved. No part of this publication may be reproduced or distributed without the prior written permission of Provalis Research Corp., 1604-1255 Robert Bourassa Blvd, Montreal, QC, H3B 3X3, CANADA.
Operating System: Microsoft Windows XP, 2000, Vista, Windows 7 and 8
Memory: From 256 MB (XP) to 1GB (Vista, Windows 7 and 8)
Disk Space: 40 MB of disk space.
WordStat, along QDA Miner, will run on a Mac OS using virtual machine solution or Boot Camp, and on Linux computers using CrossOver or Wine.
A series of short videos developed by Provalis Research, the developers of QDA Miner, offer excellent online tutorial support for conducting qualitative data analysis using the Provalis products.
© Copyright 2019 Provalis Research
WordStat for Stata was created to allow Stata 13 and Stata 14 users running under Windows, to apply text analytics techniques on any string variables stored in a Stata data file. WordStat combines natural language processing, content analysis and statistical techniques to quickly extract topics, patterns and relationships in large amount of text. It can process millions of words in seconds and compare extracted themes across any other numerical, categorical or date variables in the Stata file.