Context Navigation

source: vis_dev/vis-2.1/share/vis/help/regression_testCmd.txt @ 11

Last change on this file since 11 was 11, checked in by cecile, 13 years ago
Add vis
File size: 16.6 KB

Line
1
2	regression_test - perform regression test with the information given in
3	arguments file
4	_________________________________________________________________
5
6	regression_test [-s <script_file_name>]
7
8	Invokes two set of execution of vis to compare the performance and to
9	check sanity of each execution. Then make a set of tables summarizing
10	results, such as CPU sec, memoty usage and fail/pass of model_check,
11	etc.. This command will read a description of the regression test to
12	be performed from the argument file, which is configuartion file.
13	Besides the configuration file, one needs to prepare two template
14	files and a design list. We now describe these in turn. As running
15	example we shall use a regression test that compares check_invariant
16	with and without guided search.
17
18	The configuration file for our example is shown below, delimited by
19	lines of dashes (which are not part of the file). The order in which
20	the lines appear in the file is immaterial. A line beginning with '#'
21	is a comment, and is ignored by regression_test.
22	----------------------------------------------------------------------
23	DesignDirectory : /projects/development/hsv/vis/Regression
24	DesignList : ./DesignListInv
25	ReferenceVis : /usr/local/vis/i686-o/vis-cu
26	ReferenceVisNickName : BFS
27	ReferenceOutputDir : ./result_inv_bfs
28	RefCommandTemplate : ./TemplateRefInv
29	bRefRun : FALSE
30	NewVis : /usr/local/vis/i686-o/vis-cu
31	NewVisNickName : GS
32	NewOutputDir : ./result_inv_gs
33	bNewRun : FALSE
34	NewCommandTemplate : ./TemplateNewInv
35	bCompareCounterExample : TRUE
36	bDefaultInitCommand : FALSE
37	ResultForPerformance : ./BFSvsGS.tex
38	CompareItem : cpu : elapse:
39	CompareItem : mem : Memory in use:
40	CompareItem : bdd : Peak number of live nodes:
41	----------------------------------------------------------------------
42
43	Each line defines a parameter of the regression test.
44
45	* DesignDirectory gives the path to the directory where the input
46	files are stored. In this case, we are using the central repository
47	under ~hsv/vis/Regression. It's typically a good idea to store the
48	results of regression tests in a separate place from the input files.
49	In our examples, the results are stored in the current directory.
50
51	* DesignList gives the name of the file containing the list of test
52	cases to run. Each line consists of several white space-separated
53	fields describing the files making up the test case. The first field
54	is a path to the files relative to DesignDirectory. The second field
55	is the test case name. The remaining fields are file names. More on
56	this later, when we talk about the design list file in detail.
57
58	* ReferenceVis and NewVis give paths to the two versions of vis to be
59	compared in the regression test. They can be the same, as in this case
60	in which both are the "nightly build."
61
62	* ReferenceVisNickName and NewVisNickName are two identifiers that
63	are used in the LaTeX tables to refer to the two versions that are
64	compared. Short mnemonics are preferable. Avoid using underscores,
65	because they currently cause trouble with LaTeX.
66
67	* ReferenceOutputDir and NewOutputDir give the names of the
68	directories in which the results of the runs will be stored. In each
69	of these directories, vis_reg will create a directory for each test
70	case. The names of these directories are the test case names taken
71	from the design list. Therefore, these names must be unique.
72
73	* RefCommandTemplate and NewCommandTemplate are the templates from
74	which vis_reg produces the scripts used to run the test cases. The
75	mechanism used to derive the scripts from the templates and the design
76	list is described in detail when we talk about the template files. The
77	two templates can be the same file.
78
79	* bRefRun and bNewRun are flags that tell vis_reg which of the two
80	sets of experiments should be run (if any). If one set of results has
81	been already computed, the corresponding flag should be set to FALSE.
82	If both flags are FALSE, vis_reg simply regenerates the summary table.
83
84	* bCompareCounterExample is a flag that instructs vis_reg to scan the
85	output files for counterexamples and record their lengths. For each
86	counterexample, stem and cycle length are found. If this flag is set
87	to FALSE, counterexamples are not analyzed and the corresponding table
88	is not produced.
89
90	* bDefaultInitCommand is a flag that, if set to TRUE, instructs
91	vis_reg to generate a default set of commands to initialize each vis
92	run. This option allows one to save some time in preparing the
93	templates, at the expense of some flexibility.
94
95	* ResultForPerformance is the name of the LaTeX file to which vis_reg
96	writes the summary of the experiments. Its format is described in
97	detail later.
98
99	* CompareItem specifies one item to be monitored in the experiments.
100	Multiple CompareItem lines may appear in the configuration file. Each
101	one consists of two items: an identifier, and a pattern. The
102	identifier is used in the table to denote the item, whereas the
103	pattern is used to extract from the output files the information to be
104	put in the tables.
105
106	In our example, we declare
107
108	CompareItem : mem : Memory in use:
109
110	We instruct regression_test to search for a line beginning with
111	"Memory in use:" in each section of the output file. The token
112	immediately following the occurrence of the pattern is the data item
113	to be put in the table under "mem." so that the resulting LaTeX table
114	contains data on the memmory usage.
115
116	Since the output file is divided into sections delimited by strings
117	"CommandBegin" and "CommandEnd." (See below.), each section is
118	searched for the patterns specified by the CompareItem declarations.
119	If the pattern is matched, the item appears in the table for that
120	section.
121
122	To run the regression_test we need configuration file, template file
123	and design list. In case of template file, we need a two template
124	files, since we want to compare two different run of vis. The syntax
125	of the two templates is identical. We describe the template for "new,"
126	but all we say applies to "ref" as well. The "new" template for our
127	example is shown below.
128	----------------------------------------------------------------------
129
130	echo CommandBegin init
131	read_model
132	flt
133	order_vars
134	part
135	time
136	print_bdd_stats
137	echo CommandEnd init
138	echo CommandBegin ci
139	set hd_frontier_approx_threshold 3500
140	ci -t 3600 -g $HintFile $InvFile
141	time
142	print_bdd_stats
143	echo CommandEnd ci
144	echo CommandBegin ci_cex
145	ci -d 1 -i -t 3600 -g $HintFile $InvFile
146	time
147	print_bdd_stats
148	echo CommandEnd ci_cex
149	----------------------------------------------------------------------
150
151	Before proceeding to its discussion, it is instructive to examine the
152	script that regression_test produces from it for one test case.
153	regression_test relies on vis's shell variables and aliases to
154	customize the script. The script consists of a preamble in which
155	variables and aliases are defined, followed by one or more sections,
156	each corresponding to a major step in the experiment. Statistics will
157	be collected for each step separately. Our example has three sections,
158	whose names are "init," "ci," and "ci_cex." The names of the sections
159	are arbitrary, except that the automatically generated initialization
160	section is always called "init."
161	----------------------------------------------------------------------
162
163	set vis_stdout ./result_inv_gs/am2901/am2901.out
164	set vis_stderr ./result_inv_gs/am2901/am2901.err
165	set BlifFile /projects/development/hsv/vis/Regression/Am2901/am2901.mv
166	set OrdFile /projects/development/hsv/vis/Regression/Am2901/am2901.ord
167	set InvFile /projects/development/hsv/vis/Regression/Am2901/am2901.inv
168	set HintFile
169	/projects/development/hsv/vis/Regression/Am2901/am2901.hint
170	alias read_model "rlmv $BlifFile"
171	alias set_fairness " "
172	alias order_vars "so -s input_and_latch $OrdFile"
173	echo CommandBegin init
174	read_model
175	flt
176	order_vars
177	part
178	time
179	print_bdd_stats
180	echo CommandEnd init
181	echo CommandBegin ci
182	set hd_frontier_approx_threshold 3500
183	ci -t 3600 -g $HintFile $InvFile
184	time
185	print_bdd_stats
186	echo CommandEnd ci
187	echo CommandBegin ci_cex
188	ci -d 1 -i -t 3600 -g $HintFile $InvFile
189	time
190	print_bdd_stats
191	echo CommandEnd ci_cex
192	quit
193	----------------------------------------------------------------------
194
195	The variables and the aliases are defined by vis_reg by examining the
196	fields of the test case description in the design file.
197	regression_test uses file extensions to identify file types. (E.g.,
198	.ord indicates an order file in "input_and_latch" format.) A full list
199	of extensions is given below.
200
201	regression_test defines variables for all the file types actually
202	available for the experiment, and only for those. It also defines
203	three aliases that provide some flexibility in mixing different
204	experiments in the same batch.
205
206	For instance, "read_model" is defined as either
207
208	read_blif_mv $BlifFile
209
210	or
211
212	read_blif $BlifFile
213
214	depending on whether the design list specifies a .mv file or a .blif
215	file as model file.
216
217	The definition of "order_vars" takes into account whether an order
218	file is available or not. Finally, the definition of "set_fairness"
219	depends on whether a .fair file is available. This allows one to mix
220	model checking experiments for which no fairness condition is
221	specified to experiments that do require fairness conditions.
222
223	When bDefaultInitCommand is FALSE, the script is simply the preamble,
224	followed by the unchanged template, follows by the "quit" command,
225	which is always automatically added.
226
227	When bDefaultInitCommand is TRUE, regression_test also adds an
228	automatically generated "init" section between the preamble and the
229	template. This default section is equivalent to
230
231	echo CommandBegin init
232	read_model
233	flt
234	order_vars
235	part
236	time
237	print_bdd_stats
238	echo CommandEnd init
239
240	Notice that one has no control over dynamic reordering during the
241	initialization phase if bDefaultInitCommand is TRUE.
242
243	The template should refer to the defined aliases and variables to
244	tailor itself to each test case. Notice, however, that one is not
245	forced to use "order_vars." If one wants to always use vis's static
246	ordering algorithm, (s)he can simply use "so" instead of "order_vars."
247	Likewise, one can decide to always ignore fairness constraints by not
248	including "set_fairness."
249
250	On the other hand, it is usually a bad idea to explicitly include
251	commands like
252
253	read_blif_mv
254	read_blif
255	so -s input_and_latch
256	read_fairness
257
258	in the templates, because they limit the range of test cases to which
259	they can be applied.
260
261	As we said, the templates, and hence the scripts, are divided in
262	sections. Each section begins and ends with an "echo" command. These
263	commands write delimiters to the output file, which are used by
264	vis_reg to define the scope of the pattern searches. It is important
265	that the tag specified in corresponding pairs of CommandBegin and
266	CommandEnd be the same.
267
268	The calls to "print_bdd_stats" are typically used to add to the output
269	files the information requested by the CompareItem declarations. It is
270	a good idea to specify timeouts for potentially expensive steps.
271
272	The design list file has one line for each test case. A master design
273	list is in ~hsv/vis/Regression/DesignList. Specific design lists can
274	be obtained by commenting out or deleting lines from this master list.
275	One can also create an entirely new list according to the following
276	rules.
277
278	Lines starting with '#' are interpreted as comments and are skipped,
279	as in all files read by vis_reg.
280
281	A test case is a named collection of related files that can be used to
282	run one or more experiments. As mentioned before, each line consists
283	of white space-separated fields. The first is a (relative) path, while
284	the second is the collection name. The remaining fields specify the
285	files in the collection.
286
287	The smallest collection will specify one .mv or one .blif file. This
288	file is called the "model file," and should always appear in the third
289	field of the line. If the model file is the only file specified, one
290	can run reachability analysis, but not much more. Vis_reg understands
291	the file extensions shown in the following table. For each extension,
292	the variable bound to the file name and a short explanation are given.
293
294	.mv $BlifFile model file to be read with rlmv
295	.blif $BlifFile model file to be read with rl
296	.ord $OrdFIle order file to be read with so -s input_and_latch
297	.inv $InvFile invariant file to be read with ci
298	.ctl $CtlFile property file to be read with mc
299	.fair $FairFile fairness file to be read with rf
300	.hint $HintFile hint file to be read with the -g option of mc, ci, and
301	rch
302
303	Only one file of each type can appear in the collection defining one
304	test case. If multiple property files, or different order files are
305	available for one model file, one has to create two test cases for
306	that model. Different models, on the other hand, can share property
307	files, order files, and so on. The master design list provides
308	examples for both these situations.
309
310	It is not possible to specify both a .mv file and a .blif file for one
311	test case. There must be exactly one model file per test case.
312
313	As was mentioned before, it is important that the name of a test case
314	be unique in the entire batch used for a regression test. On the other
315	hand, several test cases may share the same directory, and often do
316	when they have some files in common.
317
318	The result summary created by regression_test is a LaTeX file
319	consisting of several tables. There is one table with performance
320	stats for each section of the output file. In addition, there is an
321	optional counterexample comparison table that summarizes PASS/FAIL
322	information for the properties and the length of counterexamples. For
323	each failing property, the length of the stem and the length of the
324	cycle (if applicable) are reported.
325
326	Each performance table consists of one column for the test case name,
327	followed by as many groups of three columns as there are CompareItem
328	declarations applicable to the section that the table summarizes.
329
330	Each group of three columns gives the data for the reference run, the
331	data for the new run, and the improvement. Vis_reg regards smaller as
332	better. Hence, the improvement is positive if the CPU time, memory
333	occupation, and so on, have decreased from the reference run to the
334	new run. The improvement is computed as
335
336	(value(ref) - value(new)) / value(ref) * 100.
337
338	Notice that this formula tends to make "new" look bad. for instance,
339	if new is twice as fast as ref, the improvement is 50%. If ref is
340	twice as fast, however, the "improvement" is -100%.
341
342	The results of each test case are stored in a separate directory,
343	whose name is obtained by concatenating ReferenceOutputDir (or
344	NewOutputDir) with the test case name. Each directory stores three
345	files, whose names are given by the test case name concatenated with
346	.src, .out, and .err.
347
348	* test_case_name.src stores the script used to run vis.
349	* test_case_name.out stores what vis writes to vis_stdout.
350	* test_case_name.err stores what vis writes to vis_stderr.
351
352	One has to be careful, because vis_reg will overwrite these results
353	directories without warning if run twice with the same
354	ReferenceOutputDir or NewOutputDir and bRefRun or bNewRun set to TRUE.
355
356	With '-s' option, you can create c-shell script file to be able to run
357	experiments in shell. We add this option to solve the problem that are
358	happened when your template file contains child-process by system
359	call.
360	_________________________________________________________________
361
362	Last updated on 20050519 10h16

Note: See TracBrowser for help on using the repository browser.

Download in other formats: