corda

mirror of https://github.com/corda/corda.git synced 2025-03-11 23:13:58 +00:00

Author	SHA1	Message	Date
Joshua Warner	47a7732a81	add jdk-test target, and fix failures The intent of this target is to run our test suite against the installed jre. This should help prevent our VM from diverging in implementation from the jdk. The remainder of this commit fixes the problems that this exposes.	2013-12-06 15:00:02 -07:00
Joshua Warner	39e3850ed8	Merge pull request #120 from dscho/look-behind-test Fix the look-behind test for OpenJDK	2013-12-06 12:19:43 -08:00
Johannes Schindelin	d8d980be9a	Fix the look-behind test for OpenJDK OpenJDK's regex engine can only handle look-behinds of limited sizes. So let's just test for that, not the unbounded one we had before (that our own regex engine handles quite fine, though). This fixes issue #115. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-06 10:50:34 -06:00
Joshua Warner	d17bbc6d1b	Merge pull request #118 from dicej/master fix exception wrapping for Method.invoke and static initializers	2013-12-06 07:13:41 -08:00
Joel Dice	abe8bc6fda	fix exception wrapping for Method.invoke and static initializers Method.invoke should initialize its class before invoking the method, throwing an ExceptionInInitializerError if it fails, without wrapping said error in an InvocationTargetException. Also, we must initialize ExceptionInInitializerError.exception when throwing instances from the VM, since OpenJDK's ExceptionInInitializerError.getCause uses the exception field, not the cause field.	2013-12-05 22:28:13 -07:00
Joel Dice	451fe159c2	Merge pull request #116 from joshuawarner32/master implement sun.misc.Unsafe.throwException	2013-12-05 20:10:53 -08:00
Joshua Warner	8cda2446d5	implement sun.misc.Unsafe.throwException	2013-12-05 20:28:08 -07:00
Joshua Warner	f7ce0c4207	Merge pull request #114 from dicej/treeset-descending-iterator modify TreeSet.MyIterator to support both ascending and descending itera...	2013-12-05 09:27:10 -08:00
Joel Dice	2000c139ea	modify TreeSet.MyIterator to support both ascending and descending iteration This also fixes a bug such that the remove() method left the iterator in an inconsistent state.	2013-12-04 17:52:27 -07:00
Joshua Warner	bb18637f13	Merge pull request #110 from dscho/memory Implement Runtime#{free,total}Memory()	2013-12-04 13:43:45 -08:00
Joshua Warner	136bc0e40b	Merge pull request #111 from dscho/filechannel-size Add FileChannel#size()	2013-12-04 13:36:07 -08:00
Johannes Schindelin	b1d2f66194	Add FileChannel#size() Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 15:23:03 -06:00
Johannes Schindelin	18aeeae0e8	Implement Runtime#{free,total}Memory() Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 15:19:06 -06:00
Joshua Warner	fe9ac94629	Merge pull request #105 from dscho/regex Support (the most common subset of) regular expressions	2013-12-04 11:57:26 -08:00
Joshua Warner	a90100ee32	Merge pull request #112 from dscho/get-generic-type Support Field#getGenericType()	2013-12-04 11:22:49 -08:00
Johannes Schindelin	6626b477ad	Replace java.util.regex.* with the new regular expression engine Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 12:52:03 -06:00
Johannes Schindelin	e96379ee19	Regex: document the strengths and limitations Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 12:52:02 -06:00
Johannes Schindelin	9e7169fe34	Regex: let toString() in the Compiler reconstruct the regex Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 12:52:02 -06:00
Johannes Schindelin	c975e25864	Regex: implement counted quantifiers: {<n>,<m>} Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-04 12:52:02 -06:00
Johannes Schindelin	2d83622975	Implement Field#getGenericType() Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 16:48:40 -06:00
Johannes Schindelin	0eb2d55da2	Class#getDeclaredClasses(): exclude inner classes of inner classes Inner classes can have inner classes, but getDeclaredClasses() is supposed to list only the immediate inner classes. Example: if class Reflection contains a class Hello that contains a class World, Reflection.class.getDeclaredClasses() must not include World in its result. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 16:48:40 -06:00
Johannes Schindelin	fb6486e276	Regex: implement ^,$,\b and \B Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	fe32cce2ad	Regex: support intersection/union of character classes Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	b4c768b101	Regex: Test Pattern#split(String) The particular pattern we use to test it is used in ImgLib2, based on this answer on stackoverflow: http://stackoverflow.com/a/279337 Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	8ab10a6953	Regex: support special character classes This adds support for character classes such as \d or \W, leaving \p{...} style character classes as an exercise for later. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	098f688cd8	Regex: implement negative look-arounds Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	8b611c8075	Regex: support look-behind patterns Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	62d1964779	Regex: add a method to reverse the PikeVM program A program for the PikeVM corresponds to a regular expression pattern. The program matches the character sequence in left-to-right order. However, for look-behind expressions, we will want to match the character sequence backwards. To this end, it is nice that regular expression patterns can be reversed in a straight-forward manner. However, it would be nice if we could avoid multiple parsing passes and simply parse even look-behind expressions as if they were look-ahead ones, and then simply reverse the program for that part. Happily, it is not difficult to reverse the program so it is equivalent to matching the pattern backwards. There is one catch, though. Imagine matching the sequence "a" against the regular expression "(a?)a?". If we match forward, the group will match the letter "a", when matching backwards, it will match the empty string. So, while the reverse pattern is equivalent to the forward pattern in terms of "does the pattern match that sequence", but not its sub-matches. For that reason, Java simply ignores capturing groups in look-behind patterns (and for consistency, the same holds for look-ahead patterns). Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	85af36ef90	Regex: support lookaheads Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	d4a2f58eb5	Regex: implement alternatives Now we support regular expressions like 'A\|B\|C'. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	c3a06a600a	Regex: implement non-capturing groups Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	53563c4f8e	Regex: add support for character classes Now we support regular expression patterns a la '[0-9]'. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	ca428c406c	Regex: implement find() Now that we have non-greedy repeats, we can implement the find() (which essentially prefixes the regular expression pattern with '.*?'. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	7da03b0f19	Regex: Implement reluctant '?', '*' and '+' Now that we have reluctant quantifiers, we can get rid of the hardcoded program for the challenging regular expression pattern. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:11 -06:00
Johannes Schindelin	f979505b3d	Regex: implement * and + operators Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	d753edafcd	Regex: support the dot Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	e2105670a0	Regex compiler: fall back to TrivialPattern when possible While at it, let's get rid of the unescaping in TrivialPattern which was buggy anyway: special operators such as \b were misinterpreted as trivial patterns. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	04d8955f98	Regex: Implement compiler for regular expression patterns Originally, this developer wanted to (ab)use the PikeVM with a hand-crafted program and an added "callback" opcode to parse the regular expressions. However, this turned out to be completely unnecessary: there are no ambiguities in regular expression patterns, so there is no need to do anything else than parse the pattern, one character at a time, into a nested expression that then knows how to write itself into a program for the PikeVM. For the moment, we still hardcode the program for the regular expression pattern demonstrating the challenge with the prioritized threads because the compiler cannot yet parse reluctant operators. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	26c4bf8d8b	Regex: add a class for matching character classes This will be used to match character classes (such as '[0-9a-f]'), but it will also be used by the regular expression pattern compiler to determine whether a character has special meaning in regular expressions. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	d00f799d2e	Regex: special-case a(a?)(a?)(a??)(a+)(a)a Among other challenges, this regular expression is designed to demonstrate that thread prioritization is finicky: Given the string 'aaaaaa' to match, the first four threads will try to grab the second 'a', the third thread (the one that matched the '(a??)' group) having scheduled the same instruction pointer to the '(a+)' group that the second -- higher-priority -- thread will try to advance to only after processing the '(a??)' group's SPLIT. The second thread must override the third thread in that case, essentially stopping the latter. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	edb48ffec2	Regex: support prioritized threads If we want to match greedy or reluctant regular expressions, we have to make sure that certain threads are split off with a higher priority than others. We will use the ThreadQueues' natural order as priority order: high to low. To support splitting into different-priority threads, let's introduce a second SPLIT opcode: SPLIT_JMP. The latter prefers to jump while the former prefers to execute the opcode directly after the SPLIT opcode. There is a subtle challenge here, though: let's assume that there are two current threads and the higher-priority one wants to jump where the lower-priority one is already. In the PikeVM implementation before this change, queueImmediately() would see that there is already a thread queued for that program counter and not queue the higher-priority one. Example: when matching the pattern '(a?)(a??)(a?)' against the string 'aa', after the first character, the first (high priority) thread will have matched the first group while the second thread matched the second group. In the following step, therefore, the first thread will want to SPLIT_JMP to match the final 'a' to the third group but the second thread already queued that program counter. The proposed solution is to introduce a third thread queue: 'queued'. When queuing threads to be executed after reading the next character from the string to match, they are not directly queued into 'next' but into 'queued'. Every thread requiring immediate execution (i.e. before reading the next character) will be queued into 'current'. Whenever 'current' is drained, the next thread from 'queued' that has not been queued to 'current' yet will be executed. That way, we can guarantee that 1) no lower-priority thread can override a higher-priority thread and 2) infinite loop are prevented. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	63b06ebde8	Regex: optimize matching characters Instead of having an opcode 'CHAR', let's have the opcodes that fall within the range of a char be the opcode 'match this character'. While at it, break the ranges of the different types of opcodes apart into ranges so that related operations are clustered. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	b03283033e	Add a unit test for the regular expression engine We still do not parse the regular expression patterns, but we can at least test that the hardcoded 'a(bb)+a' works as expected. This class will be extended as we support more and more features. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	2073d4bffb	Prepare the Matcher class for multiple groups Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	e6ad10de04	Implement Pattern / Matcher classes based on the PikeVM Based on the just-implemented PikeVM, let's test it with a specific regular expression. At this point, no parsing is implemented but instead an explicit program executing a(bb)?a is hardcoded. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	944f5f3567	Start implementing a regular expression engine So far, these are humble beginnings indeed. Based on the descriptions of http://swtch.com/%7Ersc/regexp/regexp2.html I started implementing a Thompson NFA / Pike VM. The idea being that eventually, regular expressions are to be compiled into special-purpose bytecode for the Pike VM that executes a varying number of threads in lock-step over each character of the text to match. The thread count is bounded by the length of the program: two different threads with identical instruction pointer at the same character-to-match would yield exactly the same outcome (and therefore, we can execute just one such thread instead of possibly many). To allow for matching groups, each thread carries a state with it, saving the group offsets acquired so far. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Johannes Schindelin	84829dc390	Refactor Pattern / Matcher classes This makes both the Pattern and the Matcher class abstract so that more specialized patterns than the trivial patterns we support so far can be implemented as convenient subclasses of the respective abstract base classes. To ease development, we work on copies in test/regex/ in the 'regex' package. That way, it can be developed in Eclipse (because it does not interfere with Oracle JRE's java.util.regex.* classes). Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-12-03 12:28:10 -06:00
Joshua Warner	c3e3447c62	Merge pull request #107 from dscho/temp-file Delete temporary test file afterwards	2013-12-02 19:54:38 -08:00
Mike Jensen	a2e1e1eec9	Merge pull request #102 from dscho/proxy-annotations This looks good to me, good work.	2013-12-02 09:13:01 -08:00
Johannes Schindelin	0681531dc0	Test complicated annotation constructs Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de>	2013-11-27 10:39:28 -06:00

1 2 3 4 5 ...

3685 Commits